Web Server Hardware and Software Chapter 8

advertisement
Chapter 8
Web Server Hardware
and Software
Learning Objectives
In this chapter, you will learn about:
• Web server hardware considerations
• Measuring the performance of Web server hardware
• The way that individual computers are combined to
provide large-scale Web services
• Web server software including Apache, Microsoft
Internet Information Server, and iPlanet Web Server.
Other software that works with Web server software to
accomplish the basic operations of a Web site.
Web Server Hardware
and Performance Evaluation
• Today a Web site may be the first place customers go
to conduct business with traditional companies, while
electronic commerce sites have become the main
business focus for many organizations.
• The two main ingredients in a Web server are its
hardware and it Web server software.
Types of Web Sites
• An important first step in planning a Web server is to
determine what the company wants to accomplish
with the server.
• The company must estimate how many visitors will be
connecting to the Web site and what types of files will
be delivered through the site.
• Companies create Web sites in a wide variety of forms
including simple development sites, intranets,
information-only sites for customers, business-tobusiness portals, storefronts, or content-delivery.
Development Sites
• The simplest Web site and the least costly to
implement is a development site.
• A development site can reside on an existing PC and
can be developed with low-cost Web site building
tools, such as Microsoft FrontPage or Macromedia
Dreamweaver.
• Testers can access the site through their PCs on the
existing LAN.
Intranets
• Corporate intranets house internal memos, corporate
policy handbooks, expense account worksheets,
budgets, newsletters, and a variety of other corporate
documents.
• Intranets are shielded from the Internet, they do not
require additional security software to protect them
against threats from outside the company.
Transaction-Processing Sites
• Transaction-processing sites, such as business-tobusiness and business-to-consumer electronic
commerce sites must be available 24 hours a day,
seven days a week; that is, they must be highreliability servers.
• Transaction-processing sites must also have spare
server computers to handle high traffic volumes.
• Transaction-processing sites must also run security
software.
Content Delivery Sites
• Content delivery sites, such as The Wall Street
Journal, The New York Times, and C-Net, sell and
deliver content, such as news, histories, summaries,
and other digital information.
• Content must be presented rapidly on the visitor’s
screen.
• Visitors must be able to locate articles quickly with a
fast and precise search engine.
Web Hosting Choices
• The company must decide whether to run servers
in-house or through third-party Web and e-commerce
providers.
• Companies should consider:
– scalability of e-commerce
– bandwidth of the Web site
– a local third-party ISP
• Contact Web Host Guild (WHG) for help.
Web Hosting Choices
Web Hosting Choices
• When making Web server hosting decisions, a
company should ask whether the hardware, or
platform and software combination, can be upgraded
when the traffic on its site increases.
• Using a service provider’s shared or dedicated
hosting services instead of building an in-house
server or using a co-location service means that the
staffing burden shifts from the company to the Web
host.
Web Hosting Choices
Operating Systems for Web Servers
• Hardware decisions go hand in hand with operating
system and application server software choices.
• Companies should consider:
– Speed -- A fast server is better than a slower one
– Internal and external traffic to occur on the server
– Scalability of the server hardware
Test Reports on Various Web Servers
Building a Scalable E-commerce
Systems
• Three layers in E-commerce systems:
• Web server layer
• Middle-tier layer
• Backend layer
• An application server is a middle-tier software and
hardware combination that lies between the Internet
and a corporate backend server.
Web Server Performance Evaluation
• Benchmarking is a form of testing used to compare
the performance of hardware and software.
• Hardware and operating systems are key areas for
benchmarking.
• The speed of its connection can affect a Web server’s
performance.
• Throughput and response time can be used to
measure a server’s Web page delivery capability.
Web Server Benchmark Software
• Web server benchmark software types are listed
below:
– NetBench
– ServerBench
– SPEC SFS97
– SPECweb99
– WCAT
– WebBench
– WebStone
Desirable Features of Web Servers
• Web servers are located on the Internet or intranets,
usually behind firewalls.
• The duties and features of Web servers differ
depending on whether they are publicly accessible.
• Web server software features depend on the software
package being used.
Core Capabilities
• The most fundamental duty of a Web server is to
process and respond to Web client requests that are
sent using the HTTP protocol.
• For dynamic pages, the server uses an architecture
with three or more tiers that invokes other programs.
Indexing and Searching
• Search engine and indexing programs are important
elements of many Web servers.
• Search engines, or search tools, search either a
specific site or the entire Web for requested
documents.
• An indexing program can provide full-text indexing
that generates an index for all documents stored on a
server.
Data Analysis
• Web servers can capture visitor information, including
data about who is visiting a Web site, how long the
visitor’s Web browser viewed the site, the date and
time of each visit, and which pages were displayed.
• Two of the most popular Web log file analysis
programs are the Analog Web server log file analyzer
and the WebTrends Web server log file analyzer.
• Figure 8-4 shows part of WebTrends log file analysis
report.
Data Analysis
Site Management Tools
• FrontPage has some site management capabilities.
• HomeSite is a site management tool that validates
graphics, computes page-download times for modem
connection, validates links, and validates HTML
codes.
Site Management Tools
• Application construction uses Web editors and
extensions to produce Web pages.
• Some Web development systems provide simple tools
to create Web pages.
• Some tools can be used to create dynamic features
without the need to know CGI or use API coding.
Link Checking
• A link checker examines each page on the site and
reports on any URLs that are broken, that seem to be
broken, or that are in some way incorrect.
• An orphan file is a file on a Web site that is not linked
to any page.
• Other important site management features include
script checking and HTML validation.
• A dead link, when clicked, displays an error message
rather than a Web page.
Link Checking
• Free link-checking and Web site validation programs,
such as Elsop Linkscan, can be launched by entering
the address of a Web site’s home page and checking
a few boxes.
• Commercial site checkers, such as Big Brother
software from Watchfire, produce more
comprehensive results and more detailed site
analyses than do the free products.
Link Checking
Remote Server Administration
• With remote server administration, a Web site
administrator can control a Web site from any
Internet-connected computer.
• Although all Web sites provide administrative
controls, it is convenient for an administrator to be
able to fix the server from wherever he or she
happens to be.
Remote Server Administration Tool
Dynamic Content
• Dynamic content is nonstatic information constructed
in response to a Web client’s request.
• Using Open DataBase Connectivity (ODBC), the Web
server can assemble information from disparate
database systems.
• Active server pages (ASP) is a server-side scripting
mechanism to build dynamic sites and Web
applications.
Dynamic Content
• Most Web sites today provide dynamic Web pages
using an approach called server-side scripting.
• In server-side scripting, programs running on the Web
server create Web pages before sending them back to
the requesting Web clients as parts of response
messages.
• Microsoft uses Active server pages (ASP).
• Sun uses Java server pages (JSP).
• Apache developed Hypertext Preprocessor (PHP).
Web Server Software
• There are two distinct Web servers in the market:
intranet servers and public Web servers.
• Three of the most popular Web server programs are:
• Apache HTTP Server
• Microsoft Internet Information Server
• Netscape Enterprise Server
Apache HTTP Server
• The Apache HTTP Server is free and performs
efficiently.
• Apache runs on many operating systems and the
hardware that supports them.
• Apache has a built-in search engine and HTML
authoring tools and supports FTP.
Apache HTTP Server
• Apache can be managed from either a server console
or a Web server.
• Apache supports Server Side Includes (SSI), Active
server pages and Java Servlets.
Microsoft Internet Information Server (IIS)
• IIS comes bundled with Microsoft’s Windows NT
Server and 2000 Server operating systems.
• IIS includes an integrated search engine.
• IIS supports FTP, permits administration from a
remote browser.
• IIS combines HTML pages, ActiveX components, and
scripts to produce dynamic pages.
Microsoft Internet Information Server (IIS)
iPlanet Enterprise Server
• The iPlanet enterprise server runs on such operating
systems as AIX, Digital UNIX, HP-UX, Irix, Solaris, and
Windows NT.
• iPlanet Web server provides a powerful development
environment that supports development of Webbased applications that can run on the Internet, an
intranet, or an extranet.
• iPlanet Web server’s management tools allow
administrators to manage users and monitor server
activity interactively.
iPlanet Enterprise Server
Determining Web Server Hardware and
Software Information
• You can determine the type of hardware and software
most Web sites are running by visiting Netcraft.
• Netcraft software examines the designated Web site
and returns both the Web server hardware and
software information.
Determining Web Server Hardware and
Software Information
Server Architectures and Server Utilities
• Companies that operate more than one Web server
must decide how to configure their servers to provide
site visitors with the best service possible.
• The different ways that servers can be connected to
each other and to related hardware, such as routers
and switches, is called server architecture.
Web Server Architectures
• Large electronic commerce Web sites must deliver
millions of individual Web pages every day.
• They must also process thousands of customer and
vendor transactions each day.
• The large collection of servers that these sites have
are called server farms.
Web Server Architectures
• One approach to Web server architecture is called a
centralized architecture, which uses a few very large
and very fast computers.
• Another approach is a decentralized architecture,
which uses a large number of servers.
• Most large decentralized sites use load-balancing
systems.
Load-Balancing Systems
• A load-balancing switch is a piece of network
hardware that monitors the workloads of the servers
attached to it and assigns incoming Web traffic to the
server that has the most available capacity.
• In a simple load-balancing system, the traffic that
enters the site from the Internet, encounters the loadbalancing switch, which then directs the traffic to the
Web server best able to handle the traffic.
Load-Balancing System Architecture
Complex Load-Balancing
Search Engines
• A search engine is a special kind of Web page
software that finds other Web pages that match a
word or phrase you entered.
• A Web directory is a listing of hyperlinks to Web
pages that is organized into hierarchical categories.
• Search engines contain three major parts: spider,
index, and utility.
Intelligent Agents
• Software agents have been in use for a number of
years.
• An intelligent agent is a program that performs
functions, such as information gathering, information
filtering, or mediation running, in the background on
behalf of a person or entity.
• Research reveals that software agents will become
extremely important in the electronic commerce field
sooner than later.
Intelligent Agents
• Because software agents are always running in the
background they can help reduce the workload that
people normally take on in locating, thinking about,
negotiating, and purchasing goods and services.
• Examples of agent systems include Best Web and
MySimon.
Download