Research on cloud computing application in the peer-to-peer based video-on-demand systems Authors:

advertisement
Research on cloud computing application in the
peer-to-peer based video-on-demand systems
Authors:
SUN Bing-jue, WU Kai-jun
Speaker :吳靖緯 MA0G0101
2011 3rd International Workshop on Intelligent
Systems and Applications (ISA),
On page(s): 1- 4, May 2011
2012.04.13
Outline
• Introduction
• Related work
• System design
• Simulation and result analysis
• Conclusion
2
Introduction
• In the peer-to-peer based video-on-demand systems, when the
number of users online reaches a certain scale.
• It will greatly influence the bandwidth and the speed of
response server.
• According to the usage analysis of P2P/VOD video system,
user demand video via the cache server which will generate a
cache file for the next user call.
3
Introduction
• This raises the question that if the original video files have
1000.
• So it is possible to generate 1,000 cache files which remain in
the cache server within the server's disk.
• When the buffer disk is full, it will clear the cache files
automatically and move the new file to the disk, but in the
process of deleting and writing files, the client appears the
phenomenon of the video buffer, which is blocking in the play.
4
Introduction
• Traditional method is to use load-balancing with multiple
cache servers to solve this problem.
• Because load-balancing server can’t distinguish between user
needs, resulting in more than one cache servers stored the same
cache files in the video.
• According to the thinking of "cloud computing", the paper
designed to work a cross platform of cache server cluster based
on operation stored of Hadoop synchronization which applied
in P2P/VoD.
5
Related work
• Figure 1 is a flow chart of Hadoop for data processing and
transmission of map, reduce data phase.
6
Related work
• Hadoop processing needs to be done by using 3 data transfer
process: file input to the map, map transfer to reduce, reduce
output to HDFS.
• When they run in the same time, system manage the data
segmentation, job scheduling, load balancing, fault tolerance
and network communications.
• MapReduce provides a simple interface that enables
researchers to design parallel and distributed applications
easily.
7
Related work
The data flow analysis on demand
• Map read data from the server in the way to pull the data.
• The output data of Map side is first written to local disk, it
notified JobTracker when the task is completed.
8
Related work
The integration of computing and storage
9
System design
• The whole system is divided into hardware and software.
• Hardware components include: Web system modules, file
server system modules, DNS server, storage module and the
composition of cache server cluster.
• The design of software components: the operating environment
of server will be mainly based on Windows 2003 server
operating system and FreeBSD operating systems.
10
System design
• Database store data by using a distributed way of SQLserver.
• Parts of the site use ASP.net compiler to achieve the
management of background data and the view of front static
page.
• Play system of video program uses the design of P2P
technology.
11
System design
Video services implement the process in the system which was
shown in the following.
• Internet users first access to the video website for inquiring the
video information, then send the video request.
• Web server inquire the information of video address in the
database after it obtain video request, and then send the
requests of video data to the file server.
12
System design
• File server send video data for users in a variety of ways:
Direct transmission, transmission through the cache server,
users through other Internet video services transmitted by P2P.
• The eventual realization of the video data streams transmitted
parallelly to the client, complete high-definition video
playback.
13
Simulation and result analysis
• The experimental data from access logs of our school VOD
system user.
• The average daily traffic of VOD system is up to 1000 requests
/ day, the system of the user logs records the various interactive
operations in detail, a unique user ID correspond to one visit,
but the user's interactive operation, one visit may contain
multiple user’s requests.
14
Simulation and result analysis
• The experiment compare the traditional system of load
balancing plus 3 sets of buffer server with Hadoop cluster, the
results shown in Table 1.
15
Simulation and result analysis
• Figure4 shows that when the traditional manner executes,
execution time almost linearly increases.
16
Simulation and result analysis
• Storage occupancy rate of Cache server is true as shown in
Figure 5.
17
Simulation and result analysis
• Theoretically, for a particular cluster, the trend of execution
time that is slowly grow, with the growth of input data, will not
last.
• From this experiment can be seen, when the configuration of
parameters accord with the scale of cluster, Hadoop framework
compared to the traditional way has a huge advantage for timeconsuming aspects of data processing in large.
18
Conclusion
• The simulation of P2P/VoD video systems which designed
based on "Hadoop" technology theoretically realizes unlimited
expansion and unlimited data computing power.
• This design does not require too much hardware investment,
and transform the existing equipment of network transmission,
you can achieve fast data transmission.
• And facing the possibly increases of future traffic, the
implementation does not change the system structure and
upgrading core equipment, just add the appropriate equipment
to meet the demand.
19
Conclusion
• If such P2P/VoD system which is promoted widely in the
major institutions can form a large cluster of the streaming
media.
• These cluster nodes joined together to form a huge video
network through the network between the cache server farms,
streaming media will become the main platform for online
teaching.
20
Download