Implementing a Loadbalanced Web Server System Architecture of

Implementing a Load-balanced Web Server System

Architecture of A Cluster-based Web System Courtesy: IBM Research Report, The state of the art in the locally distributed Web Server systems.

Architecture of Our Web Server Cluster ht tp re qu File Service es ts Database Service Web Server 1 Load Distributor File Service Video-On-Demand ht tp re qu es ts (grid 1. cs. ucr. edu) Web Server 2

Our Web Server Cluster The whole web server only provides one visible web address to the outside world. Each Web Server is able to provide two kinds of web services. The load distributor distributes the incoming requests among the servers according to either content-aware or content-unaware load balancing strategies.

Tasks to do to set up the system Building up the web services on the servers n n n File Services Video-on-demand Services Database Services Implementing the load distributor on the frond-end node n n Content aware request distribution Content unaware request distribution

Building Up the Web Services File service n n Built on top of Apache server. File set is generated by SPECWEB 99. Video-On-Demand n n Real MPEG 2 movies are stored in a specific directory on the Apache server. Client video streaming software (Video. Lan) is installed and automatically launched by the Apache server. Database service n Built on top of Apache and My. SQL.

Video On Demand Service Video. LAN project (Open Source Media Streaming Solution) n Targeting multimedia streaming of MPEG-1, MPEG-2, MPEG-4 and Div. X files, DVDs, digital satellite channels, digital terrestial television channels and live videos on a high -bandwidth IPv 4 or IPv 6 network in unicast or multicast. Client-server Architecture n n Server streams MPEG-1, MPEG-2 and MPEG-4 / Div. X files, DVDs and live videos on the network in unicast or multicast. Client receives, decodes and displays MPEG stream.

Video. Lan System

Building Up Video-On-Demand Service in Our Web Server Video. Lan client-server software is installed Server can stream movies to the client in realtime through UDP/RTP or HTTP/TCP For video-on-demand service using HTTP/TCP, only the client is needed. The client software (vlc) is automatically launched once the Apache server detects that it is a video file.

Load Balancing Schemes Content Unaware Scheme n n Choose a server before receiving the URL request Round Robin Content Aware Schemes n n Choose a server to dispatch a request after receiving and looking at the URL request Balance load according to different URL request w For database service — Database Server w For video-on-demand service — Multimedia Server w For file service — Round Robin

Implementing the Load Distributor Install the TCPSP n The tcp splicing is a technique to splice two connections inside the kernel, so that data relaying between the two connections can be run at near router speeds. Write the Distributor program in C language n n Two load balancing strategies are implemented The installed kernel module TCPSP is invoked to perform TCP splicing Run the distributor program in the application level

Flow Chart of the Load Distributor (content aware) Distributor Child Process Listen for incoming connections on port 8888 Establish a TCP connection with the chosen server Accept the connection Read the URL request Choose a server according to the request type and load balancing scheme Create a child process to do further processing Splice two TCP connections Write the URL request to the second TCP connection Monitor the two TCP connections and close them when no more activities are going on End

Flow Chart of the Load Distributor (content unaware) Distributor Listen for incoming connections on port 8888 Accept the connection Choose a server according to the load balancing scheme Create a child process to do further processing Child Process Establish a TCP connection with the chosen server Splice two TCP connections Read the URL request Write the URL request to the second TCP connection Monitor the two TCP connections and close them when no more activities are going on End

Comparison with Gage : A Qo. S Aware Web Server System n “Performance Guarantees for Cluster-Based Internet Services”, Chang Li, State University of New York at Stony Brook. The load distributor is implemented as a kernel module. It is faster but can only implement content-unaware load balancing. Gage doesn’t provide a variety of web services.

Planned Performance Measurement Let all servers provide file service, use SPECWEB 99 to test the performance of the cluster-based file server. Compare the time taken to service a Database request through the load distributor with that without the load distributor.

SPECWEB 99

Let’s go to the lab to see DEMO!
- Slides: 17