PAKISTAN-US SCIENCE AND TECHNOLOGY COOPERATIVE PROGRAM 2007 JOINT PROJECT PROPOSAL Please provide the required information briefly. Condensed statements are preferred. However, the boxes in Section 3 may be expanded as necessary. The maximum limit of the project document is 10 pages. Please do not forget to attach the one-page curricula vitae of all the participating scientists. Project title: Measurement and Analysis for the Global-grid and PERN’s Internet end-to-end Performance (MAGPIE) Field and subfield of proposed project Internet Communications: Network Performance Measurement and Analysis Duration: 3 Years Starting date: January 2008 PAKISTAN SIDE Principal Investigator: Prof. Dr. Arshad Ali NUST Institution of Information Technology Date Signature Address 166A, Street 9 Chaklala Scheme III Rawalpindi Pakistan 46000 Telephone 92-519280433 Fax 92-519280782 E-mail arshad.ali@niit.edu.pk Head of Institution/Vice-Chancellor/Rector Lt. General (R) Syed Shujaat Hussain, Rector, National University of Sciences and Technology Tamiz-Ud-Din Road, PO BOX 297 Rawalpindi, Pakistan Email: rector@nust.edu.pk Phone: +92-51-9271575, +92-51-9271576 FAX : +92-51-9271577 US SIDE Date Principal Investigator: Dr. Les Cottrell Stanford University/SLAC Signature Address Stanford University/Stanford Linear Accelerator Center 2575 Sand Hill Road, MS 96 Menlo Park, CA 94025 Telephone Fax E-mail (650) (650) cottrell@slac.stanford.edu 926-2523 926-3329 Head of Institution/Vice-Chancellor/Rector c/o Meredith O’Connor, Managing Senior Contract and Grants Officer Stanford University Office of Sponsored Research 320 Panama Street Stanford, CA 94305 e-mail: meredith.oconnor@stanford.edu 1 2. Project Summary Title of the Project: Measurement and Analysis for the Global-grid and PERN’s Internet end-to-end Performance (MAGPIE) Pakistan Side U.S Side Co-Director: Prof. Dr. Arshad Ali Co-Director: Dr. Johnathan Dorfan Institution: NUST Institution of Information Institution: Stanford University/Stanford Linear Technology (NIIT) Accelerator Center (SLAC) The growth of the Internet has fostered intellectual and economic growth of both developed and developing countries [i]. Fundamental to this is the operation and maintenance of the underlying Internet technologies to keep the network functioning and performant. Specifically, advancements in network connectivity have improved scientists’ ability to collaborate and helped accelerate the rate of scientific discovery, see for example [ii]. Modern science relies on the global Internet to create large and physically distributed scientific work environments. Thus scientists have become more dependent on networking, and network problems have increasing significance. The pursuit of modern science is thus dependent upon cyber-infrastructure designed to support the efficient diagnosis of network issues along end-to-end paths traversing multiple networks. We outline a proposal to enable Pakistan’s advanced education and research facilities to better understand and utilize their network connections and for Pakistan’s Education and Research Network (PERN) to become a world leading National Research and Education Network (NREN). Viewing case studies such as [iii] it is apparent that by monitoring the network, even for Pakistan alone, one can identify the fragility of computer networks; the problematic routing of traffic e.g. between sites in Islamabad and Rawalpindi going via London; and despite well provisioned backbones, last mile problems causing heavy congestion and hence back end-user experience of the Internet. Understanding network performance can also pose challenges for distributed collaborators. Scientific communities usually span many physical organizations. The distributed nature of these communities makes it difficult for campus network engineers or end-users to effectively find or fix endto-end performance problems. Campus networks are usually interconnected via regional networks and national backbones and may span many domains. Each of these domains usually maintains explicit control over who can access information and infrastructure and what performance data is shared, if any. Often, considerable time and effort must be expended to convince physical organizations to begin testing and it can still be a difficult task to isolate the issue, and even more time can be spent evaluating different solutions. It is therefore difficult to perform tuning and troubleshooting in distributed environments due to the need to access information from many different sources and the human coordination required. Hence, it is critical that any solution to the problem of network performance monitoring and diagnostics must be scalable and universal to enable investigators to easily gather network performance data. More importantly, the framework for this should be capable of conducting automatic analysis, diagnosis and reporting of network problems for different types of users (e.g. end-users, backbone administrators etc.). SLAC, Internet2, ESnet, GEANT, University of Delaware, Georgia Tech and RNP (Brazil) have begun a global project to deploy such monitoring infrastructures under the perfSONAR framework [iv]. We propose a collaborative effort between SLAC/US and NIIT/Pakistan to help further develop the various tools and components that will facilitate perfSONAR and apply these tools to the new PERN network including major end-sites. We intend to help build core services and infrastructure to facilitate the continued global support of perfSONAR and enable successful scientific collaborations between Pakistani scientists and their peers worldwide. We will also tutor and supervise students at NIIT and offer select graduates the opportunity to work in the US, where they will gain knowledge of research and development in the domain of network performance monitoring as well as valuable experiences of working at a world class leading research institute, contacts and best practices in many fields, that they will take back to Pakistan. 2 3. Project Description 3(a) Background and Rationale To efficiently manage any network one needs to be able to measure it. This includes measuring and understanding current and long-term performance, identifying and reporting problems both end-to-end and within the network itself, and providing forecasts of both long and near term performance. Without such information network managers lack planning information, users and network administrators do not know what to expect; problems are reported by users, and network administrators spend all their time fire-fighting; locations and sizes of bottlenecks and their behavior are unknown; and users and applications are unable to dynamically optimize their network usage. The extent of ensuring that the network ‘works’ is demonstrated by the fact that network researchers and end-users have always had a need for network measurement, monitoring, and analysis tools. This need dates back to the early days of the Internet when the “ping” command was written. While tools like the ubiquitous “ping” and “traceroute” commands will continue to be the front-line tools used by most network engineers, more sophisticated tools are needed to directly measure one way delays, throughput and bandwidth, utilization, non layer 3 paths, etc. Network monitoring tools and how they are used should answer both basic connectivity type questions and more complex “what’s wrong with this path” type questions. Previous efforts to develop these new tools can be sorted into two major classes: (1) individual tools that can be installed on Internet hosts, (2) measurement tools deployed in a single test domain. The first class of projects deals with the development of individual measurement tools. These refer to two types of measurement tools: passive and active. Passive tools (e.g., tcpdump [v], OCx-mon [vi], Netflow [vii], and Web100 [viii]) make use of application-generated data packets that flow through the network. A copy of the data packet is passed to the tool for some type of processing. Each tool typically generates its own unique output format. As with active tools (described below), the output of the tool is usually handed over to a tool expert for interpretation and analysis. Active tools (e.g., Iperf [ ix], Thrulay [x], NDT [xi], NPAD [xii]) use a client/server model to generate test packets. The major advantage of these tools is that individuals and small communities can download and install the tools on an as needed basis. This ensures that tools are used in a wide variety of operating environments, much larger than the developers could have achieved. A major disadvantage is that these tools may not be already installed when problems are reported. Another deterrent is that remote administrators may be reluctant to install them when they are not sure how the tool works or what its reporting capabilities are. Finally, tools that require manual action at both ends usually require coordination between remote administrators and an implicit trust relationship, reducing ease of use. The second class of projects deploy network-only measurement or monitoring tools inside some test domain. In addition to individual tools, larger research projects (such as AMP [xiii], Surveyor [xiv], IEPM-BW [xv], PingER [xvi], RIPE’s Test Project [xvii]) have been established to generate large-scale measurement and monitoring programs. These projects tend to use both active and passive test tools to perform their measurement and monitoring functions. A major disadvantage to such systems developed in such projects is that they have lacked community involvement in the design and are ultimately limited by the ability of the research team to support such an infrastructure. As a direct result, such complete systems are hard for non-project members to extend as they have lacked community involvement in their design. Experience has shown that such projects (see for example AMP and Surveyor) fade away as funding shifts and people move on to new projects, and those subsequent projects rarely build on what has come before. Therefore, a single, monolithic, global measurement system is unrealistic. Federation and neutral information exchange standards-based formats and a modular mix-and-match architecture are necessary. This preserves local autonomy while promoting global utility. Moreover, reports and unified access to the measurement results are needed to provide useful analysis and diagnostics in order to identify, diagnose and help resolve network performance and connectivity problems to ensure that users can view the network as a reliable, predictable utility. 3 3(b) Problem Statement The next generation of the PERN backbone, PERN2 [xviii], will soon be deployed to interconnect over 60 universities across Pakistan and to the rest of the world. It will offer immense possibilities for the various network users across diverse disciplines/endeavors such as medical research [xix] [xx, particle physics [xxi], agro-informatics [xxii], media-communications [xxiii] [xxiv] and distance learning [xxv]. Figure 1 shows the historical connectivity of the existing international link between SLAC and NIIT. As the figure shows, network users typically experience very poor performance (RTTs > 250ms and very variable and heavy losses) and network fragility (100% loss) that makes collaborative technologies such as video conferencing technically unfeasible at least until late 2006 when performance improved. As PERN2 is deployed, it is important to not only understand but also to monitor the network infrastructure and the end-to-end performance to ensure that the return on investment of such a large project is understood and that network users are able to utilize such infrastructures. Figure 1: PingER RTT (blue) in msec. and loss (green) % since May ‘05 thru May ‘07 measured from SLAC to NIIT To understand the problems of network performance and monitoring, and their effect on network users, the research community and the local industry, it is beneficial to note how these groups typically deal with network performance measurement and monitoring issues these days. Consider two scholars from different universities (but connected by the PERN network) arranging a video conference to facilitate their collaboration. Hypothetically, when they setup the call, one scholar finds that the video stream is poor. The scholar then contacts his/her local network administrator seeking help in resolving the matter. After some preliminary tests the network administrator suspects that the reason for the poor video quality is insufficient available bandwidth. For this reason he/she plans to use the Iperf tool to test the links. However, in order to use Iperf there is a requirement of a server component. Consequently, after hours or possibly days of telephone tag and coordination with network administrators within each administrative domain, the tests are conducted and a single link identified behaving as the bottleneck. Later the network administrators corroborate to fix the problem and scholars are informed that the link would be available after the repair has been made. This is all with the assumption that the network administrators within each administrative domain forming the link were available and willing to support the tests. Thus in such scenarios it is safe to say that days of effort from multiple network administrators is required to first identify the problem, and later fix it. Reducing the hours and number of participants required to find and report these types of problems is a major goal of this proposal. Not only is the time to repair reduced, but the frustration level of the scientist is also reduced. Knowing where a problem is, and when it is likely to be repaired, can help scientists develop contingency plans. In addition to making manual trouble reporting and resolution easier, a network measurement infrastructure can assist network operators with the day-to-day monitoring tasks that can automatically find and report problems. Waiting for trouble reports from users before noticing a problem is another serious issue with today’s complex network. While a typical operations staff quickly notices and fixes a major outage or a problem with a well-used application, a niche connection or application may fail for weeks or months before a trouble report makes it into the Network Operational Center (NOC). See for example [xxvi] where dramatically reduced throughputs went un-noticed and un-reported for more than two months, yet when reported took only hours to fix. In the mean time, the end-user is frustrated and unable to 4 effectively use the network to perform a specific task. At a higher level, the impact of this work on society will mirror that of the Internet itself; by making networks easier to operate and use, scientists and end-users alike will have an improved experience. This translates to scientific and commercial benefits from the ability to more effectively use existing applications or take advantage of new ones [xxvii]. Today, numerous scientific disciplines have developed globally distributed work environments that depend on the network. Some examples include: The international High Energy and Nuclear Physics (HENP) community is nearing completion of the Large Hadron Collider (LHC) [xxviii] facility located at CERN in Switzerland. The LHC will allow thousands of scientists, engineers, and university professors access to petabytes of particle physics data. Data will be stored at multiple national repositories and processed at dozens to hundreds of sites around the globe. Both NUST and the National Centre for Physics (NCP) at Quaid e Azam University and others are building computer clusters to be used for simulations and analysis for LHC and others. Without highly reliable and high performance network connections to CERN, Europe and the US, these clusters will have limited global impact as they will be unable to replicate and transfer the large amounts of data required for simulation and analysis. Cisco (Dr. Horst Dumke and Dr. Masum Hassan) in collaboration with NIIT (Dr. Arshad Ali) has been aggressively pursuing the establishment of a ‘Health e-Grid’ [is there a REF]. Due to the fact that sufficient trained medical professionals are not available in Pakistan (even in urban areas), providing health facilities to the population (of 160 million with a growth rate of 2.09%) in general is a challenging task. The intention behind the Health e-Grid project is to setup a tele-medicine infra-structure so as to provide basic medical facilities to the population in remote areas. A step towards providing affordable health care to Pakistani public is to make use of Grid technologies to provide the facility of telemedicine. Given the critical nature of such Grid applications, the setup, management and maintenance of PERN – the network over which the applications are intended to execute – is of vital importance. The Virtual University [xxix] of Pakistan delivers education through a judicious combination of broadcast television and the Internet. The lectures and miscellaneous literature is made available over the PERN in the form of streaming media and documents/archives from the Virtual University’s servers. Here too the predictable behavior of the underlying network is desirable which can only be achieved if it is suitably monitored, the network is reliable and anomalies are either identified in time to limit major disruptions.. Organizations such as Pakistan Agricultural Research Council, Agriculture University Faisalabad, Agriculture University, Peshawar have been collaborating under the AgroInformatics [xxx] initiative and have achieved significant results. Such initiatives have only been possible due to the availability of means to conduct collaborative research. Existence of networks such as PERN encourage such collaborations by providing suitable means for communications among the collaboration. On a more technical level, the rapid detection and correction of network performance problems and infrastructure faults represent unresolved problems facing all these scientific communities. It is difficult to overstate the amount of effort that large-scale distributed projects such as these spend on network debugging and performance tuning. That this effort is duplicated across projects represents a significant waste of time and energy. Moreover, many scientists are forced to become part-time network engineers, which is an inefficient use of their time and effort. Therefore, a network monitoring infrastructure should focus on the following: Implement techniques to share network performance data across domains to enable a broader snapshot of network performance with the end-to-end view. Enable and accommodate new types of analysis that are continually being developed. Sometimes, these are done using completely new tools that generate information on completely 5 new metrics. Often, they represent new ways of correlating existing diagnostic data types. Be easily adaptable to changes in network technology to support new devices and provide useful diagnostic data. This may include ease of extensibility to new data sources – new tools are produced frequently and the system must not be biased toward any particular tool. Secure, policy-based access: the ability to provide protection for who can access what and under what conditions at a local level is critical for a single system framework to be accepted. Community-driven: the architecture of the system must be completely open (and open source) to encourage the reintegration of extensions and the piecemeal upgrading of modular components. Another aspect of network performance is that Network Operational Centers (NOCs) typically are focused on monitoring the performance of their own network domains. However, real Internet users will have to traverse many different domains in order to do their work. As such, the typical end-to-end performance experienced by real network users, is often nowhere near that which NOCs would assume. Example limits in the performance of end-to-end experiences include: Badly tuned network components (including mismatched speed or duplex issues, stack limitations or even TCP algorithmic limitations). Routing problems; including mis-configured routers and switches; non-optimal routes; poor backup routes; flapping etc; all causing unpredictable performance. Overloaded networks which have been badly provisioned to support the traffic that upon it. Denial of Service attacks; maintenance; power problems; firewall limits; port blocking; congestion; improper host configurations; application problems. Real-life examples of all of these can be found in the SLAC Case Studies referenced above. 3(c) Prior Experience / Capability The perfSONAR consortium initiated in partnership by Internet2 [xxxi], the European Union-funded GÉANT2 project [xxxii], and DoE Energy Sciences Network (ESnet) [ xxxiii], has been working, in recent years, to provide an open set of protocols and a reference implementation to address cross-domain sharing of network measurements and metrics. SLAC recently joined the consortium - providing their experience and skills in deploying the PingER and IEPM-BW network monitoring infrastructures; their expertise in network data mining and analysis; as well as their close involvement with HENP [xxxiv] and the LHC [xxxv]. The perfSONAR project is focused on building open source, flexible, modular, and extensible performance middleware to greatly simplify the process of gathering and sharing network performance information across multiple administrative domains to simplify the process of debugging network issues and to provide performance data to network-sensitive applications without redundant measurements. Early prototypes of the perfSONAR framework are deployed in Europe and in North and South America and serve as a central component of the network monitoring functionality for the global network dedicated to serving the LHC project. The rapid adoption of the early prototype of the perfSONAR system by various NRENs and the LHC project is a testament to the need that it fills. The perfSONAR project originated with the desire to unify the efforts of Internet2’s End-to-End Performance Initiative Performance Environment System (E2E piPES) project [xxxvi] and the EU’s GÉANT2 Joint Research Activity on monitoring and measurement (GN2-JRA1) [xxxvii], ultimately involving parallel efforts in the U.S. DoE’s ESnet. The desired result of this project, a common performance middleware, is referred to as the perfSONAR framework. This project seeks to enable consistent access to network performance data across a large fraction of the world’s advanced research and education networks. The effort is in its second year of funding in the EU and receives a small amount of seed money from Internet2 and ESnet in the United States. Some components of this framework exist in prototype form, but much remains to be done. This framework specifically defines a federated network measurement infrastructure that can be used by a large number of research scientists and scientific collaborations. The perfSONAR project is 6 still at a very early phase, with much work to be done yet, but it is beginning to have a significant impact already and to generate interoperable, independent development of additional services, which is a testament to the level of community support. perfSONAR defines a series of mid-level services that can be used to create a measurement infrastructure. It does not mandate the use of a specific tool or set of tools to be used to generate or capture measurement data. Rather, it defines a standard XML-based storage schema that allows multiple tools to generate and consume data. This increases the usability and reach of these tools, making them more applicable to a larger number of potential users. perfSONAR does not require realtime intervention to deploy tools or to coordinate the data collection process. Instead, a standard authentication mechanism allows interested parties (e.g., end-users, scientists, dedicated support staff, network operators) to pre-register themselves in a domain-managed database. Once registered, these users can present their credentials to a remote domain’s federated authorization service and, if approved (according to locally-determined policies, and home institution defined identity), they can begin troubleshooting a problem. In summary, the perfSONAR framework will ultimately: Be easily extensible to new types of measurement and analysis tools. Tool developers will be able to focus on how the tool operates, not on how to deploy the tool widely enough for it to become useful. Scale to global size. Multi-national science communities will be able to focus on science, not just run networks. Contain flexible policy for sharing across administrative domains. Local domains will retain control over the policies that define how specific resources can be accessed and what data will be shared with whom according to uniform, organization-agreed-upon policies, not ad hoc decisions based on personal relationships. Use common data exchange formats. The OGF [xxxviii] NM-WG [xxxix] schemata will be used to ensure that different tools and archives can exchange data effectively. Use common discovery mechanisms. Users will be able to find measurements points and archival data to assist in troubleshooting an application’s behavior. Use generic service components for tracking resource utilization and user authorization to acquire those resources for performance testing. Piggyback on existing federated trust authentication infrastructures. We believe the perfSONAR framework provides the necessary services and infrastructure to facilitate a rich and extensible monitoring system for PERN2 from which further network research and opportunities can arise. Various funded projects in the past have failed to reach critical mass in the realm of network monitoring due to their lack of open source and open community traits. Furthermore, projects, such as AMP, MonALISA [xl], IEPM-BW, PingER and Surveyor, have attempted to bundle data generation, data sharing, and analysis with visualization in an attempt to do everything. However, they have been hampered by the relative lack of standards for sharing performance data, limiting the breadth of data (both in type and deployment) available to be consumed. Many members of the perfSONAR development team have been directly involved with creating or using these tools and the lessons learned during those developments have been incorporated into the perfSONAR framework. SLAC has been a leader in network monitoring and performance tuning for many years and has over fourteen years of computer networking experience in the wide area with experiences ranging from the deployment of the first Internet in China [xli], capturing the Internet2 Land Speed Record twice [xlii], winning the Supercomputing Bandwidth Challenge three years in succession (2003, 2004 and 2005 [xliii]), to the monitoring of the digital divide between developed and developing countries [ xliv]. SLAC also has extensive experience developing and deploying network infrastructures such as the PingER suite and the IEPM-BW suite – both of which are still collecting data on over 125 countries from over 40 sites throughout the world. 7 With the benefits of decentralized network monitoring provided by the perfSONAR framework, SLAC has shifted its focus from the deployment and collection of metrics to the analysis and presentation of network performance. An important aspect to the analysis of network performance data is the understanding of the baseline performances and the effects of seasonal effects such as the increase of network traffic during work days and hours. This is illustrated in the figure below for throughput measurements from SLAC to the University of Florida, where it is seen (blue line) that at the weekend measured performance is higher and then oscillates (day down, up at nights) during the week. The attempt to use an exponentially weighted moving average (light blue smooth line) to predict the performance lags the data and, following the weekend, the differences are outside the acceptable range (below the orange line) for a sufficiently long duration (the black line is the number of events falling outside the threshold) that an event is detected. Figure 2: Detection of anomalous events using the plateau algorithm [REF] from SLAC to U. Florida The net result is that there is a higher probability for detecting false events at the end of a weekend or the start of a weekday. This is illustrated in the plot below where we see the number of events triggered by the simple non-seasonal Kolmogorov-Smirnov [xlv] algorithm is weighted towards 21:00 hours Pacific time (which is 9:00am Pakistan time, or the start of the work day when utilization increases and congestion starts to get bad). Figure 3: Frequency of detecting events as a function of time of day using the Kolmogorov-Smirnov algorithm on available bandwidth between SLAC and NIIT. Thus for paths with seasonal variations we need to investigate more sophisticated techniques including Holt-Winters [xlvi] and ARMA/ARIMA [xlvii] models. The authors of this proposal have built prototypes using Holt-Winters [xlviii] and investigated ARMA/ARIMA [xlix] for forecasting and event detection. An example of the use is shown in Figure 4. Given the noisiness of the data (dots) it is first smoothed (blue line), Holt-Winters is then used to forecast the performance (red line of bandwidth capacity) and provide expected deviations (black lines). If the smoothed data goes outside the deviations for sufficient number of points then an event is signaled. It is seen that the Holt-Winters forecast tracks the seasonality of the data, and detects the sudden and persistent drop in performance on Friday evening/Saturday morning July 31, 2004. 8 Figure 4: Time series of bandwidth capacity measured between SLAC and U. Indiana with forecasting by HoltWinters, and the detection of an event. The above is a simple example of the types of analysis that network engineers and end-users would analyze to diagnose a network problem. As the perfSONAR framework enables the collection and storage of results in a simple and consistent format, it offers new opportunities that will lead to a broad wealth of network performance data – data that will invariably take a network expert an impossibly long time to analyze and document. Furthermore, with such abundance of data, it is likely that simple problems will take a while to finally be detected, or worse still, fail to be detected at all. Therefore, data mining techniques and new visualizations should be incorporated into the data collection and analysis to highlight the important features and make it useful and not onerous. In addition, the proliferation of data would mean that there could be tens of thousands of time series data of multiple metrics between multiple pairs of hosts; and to determine the scope and locations of performance problems would be nigh on impossible. This naturally leads to automated systems that will help analyze, detect, present and report on potential problems – soon after it happens or in some cases even before the problem occurs through the analysis and forecasting of certain metrics such as bit error rates between two devices [REF?]. There is therefore a need to reduce the data volume into manageable and easy to understand formats (tables, graphs etc) whereby network engineers and operational managers can quickly determine the performance characteristics of their network. Using example techniques presented above for event detection, this proposal will also focus on the automated detection, alerting and diagnosis of such network performance problems on a global scale. This will be done using heuristic models of network fault detection [l] combined with the cross correlation of events in time series for a given metric for a given path; also other related time series for different metrics, different target hosts and probe sources; plus information from intermediate points along the network path(s) looking for shared experiences will be used to eliminate or pin-point common possible causes. The ultimate goal of such complicated analysis will help increase the accuracy of problems, reduce the frequency of false positives and provide more diagnostic information by eliminating possibilities and helping pin-point the possible cause(s). Alerts will be generated in the form of emails with attached diagnostic information. To reduce the annoyance factor of multiple alerts we will also use techniques (such as triggering an event state when one threshold is passed and resetting it when recovery passes another threshold (hysteresis)), and by not un-necessarily repeating alerts for the subsequent re-detection of the same event. Over the last two and a half years, SLAC has a proven track record in supervising and training Pakistani NIIT students both at NIIT and at SLAC to be productive, acquire knowledge and share it and their expertise to benefit Pakistan/PERN network activities. These students have come from classes that contain 50:50 ratio of male:female. In the past there have three women students who worked with MAGGIE and have recently graduated. Numerous interesting and innovative projects have spawned from this relationship (previously funded under the MAGGIE project [li]) which has developed several measurement, analysis tools and international case studies available in the public domain. Examples include: PingER management; IEPM-BW deployment at about a dozen sites; the ABwE measurement probe [lii]; yearly ICFA SCIC Network Monitoring Reports [liii] giving quantitative, non-subjective measures of the Digital Divide from an Internet point of view; detailed case studies [liv] of the network performance within and between the region and the rest of the world for: Pakistan, South Asia, Latin 9 America, Palestine, and Sub-Saharan Africa; and numerous prestigious invited presentations around the world for example the American Physical Society [lv], ESnet/Washington [lvi], Internet2/Washington [lvii], ICTP/Italy [lviii], Mumbai/India [lix], Pakistan [lx] and ICTP/Romania [lxi]; SLAC has also hosted visits by NIIT staff and faculty enabling faculty members to gain valuable experience and promote the research culture in their classrooms. SLAC staff have also visited Pakistan, learnt in more detail of the needs and possibilities, and shared their expertise and case studies directly with the students and faculty both of NIIT, other universities in Islamabad/Rawalpindi, and with PERN, HEC and related institutions. This has resulted in useful observations and recommendations that have been made to PERN and HEC among others. 3(d) Scope and Objectives Given the need of network monitoring and the analysis of network data in order to make it useful, the scope of this proposal is threefold: To apply the perfSONAR framework to PERN, this will involve amending and or creating new services, from the existing perfSONARservices, which are tailored towards the specific requirements of the Pakistani research network – i.e. the software. Develop new and innovative presentation, reporting, network problem detection and alerting techniques based upon the perfSONAR framework using the experience gained from previous projects and make them production quality for use on PERN2 – i.e. the analysis. To foster the partnership between NIIT/Pakistan and the SLAC/U.S. to improve the quality and relevance of Pakistani higher education in science and technical fields. This project also aims to develop new forecasting tools and identify network anomalies as a means by which network problem detection can be extended. Building on analysis techniques used at SLAC, the forecasting will use advanced statistical models such as Holt-Winters and ARMA/ARIMA. To assist in divide and conquer techniques for diagnosis and pin-pointing of network performance issues, we will also work with PERN to install one or more nodes in their network. A major purpose of these nodes will be to make on-demand measurements (e.g. ping, thrulay) with sufficient Authentication, Authorization and Auditing. These will also assist in dividing possible problem sources up to assist in pin-pointing the cause. We will ensure the software is configurable to meet different sites’ configurations. In addition we will work closely with PERN to get secure access to backbone router/switch information (e.g. utilization, errors, configuration information such as location etc.). Bringing PERN into the project will help the whole Pakistani educational and research community join and share experiences with a growing community of worldwide NRENs. In addition, across these NRENs, there will be common monitoring and assisted diagnosis to assist users in effectively using their network connections. This will be critical for users such as scientists and researchers using the NCP [lxii] and NUST [lxiii] clusters. The deployment of a perfSONAR installation for PERN2 will be staged. Initial end-site installations will be at SLAC and NIIT. These will help to create documentation drafts, and installation procedures while improving the deployability of the framework. Following this we shall install at endsites where we have existing strong partnerships. These include the National Centre for Physics (NCP) at Quaid e Azam University (QAU) and COMSATS where there are strong scientific programs, concentrations of senior scientists and Grid activities - we believe that application at these sites will offer the best use-cases for the advanced technologies and techniques outlined in the next sections. We are working to extend our partnerships to include sites in Karachi (e.g. SSUET and/or FAST-NU) and elsewhere in Pakistan with similar strengths. Typically the machine to host the monitoring software at a site will be provided and administered by the site with sufficient power and connectivity. Using open source software and operating systems (Linux), we will reduce the cost of such machines whilst also increasing their security. Persuading endsites to contribute their own machines will also reduce the site-specific administration and security/deployment issues while improving the “buy-in” partnership of the end-site personnel. The expertise of SLAC’s, and more recently NIIT’s, involvement in global network monitoring 10 will be applied to demonstrate the middleware components/services of the perfSONAR framework. An important part of this proposal builds on SLAC’s and NIITs continued commitment to help develop the intellect, skills and ability of Pakistani students. It does this by training NIIT graduate students to contribute to research and development of computer networks to benefit the Pakistani research and education community. Selected students will spend time at SLAC, which is highly regarded as one of the foremost research facilities in the world; and have access to high performance computing and networking facilities not currently available in Pakistan. They will also have access to advanced high-speed networks in the US and Europe, and will work with acknowledged experts in the network monitoring field. Thereby they will be developing highly skilled human resources for Pakistan by conducting applied research and developing network monitoring and end-to-end network performance measurement tools/infrastructure. This will greatly increase their skills in very relevant areas for Pakistan and not only will they contribute to the development of new technologies but also share the experience and will be in a position to contribute positively to the Pakistani society and economy. As a tentative schedule, in the first year we intend to identify, extend and customize, package and deploy and validate the necessary performance services across Pakistan. In the second year we intend to apply and evaluate mechanisms that will be productive in detecting and pin-pointing network performance problems, of which in the third year we will ensure that the most useful are hardened into production quality services that will be used across PERN and potentially worldwide. 3(e) Methods The primary objective of this proposal is to deploy, enhance and customize perfSONAR middleware services for an Autonomous System (PERN2) in Pakistan. In addition to this we intend to analyze and evaluate algorithms for network weather forecasting, event detection and diagnoses. It will deal with the deployment, extension of the services and integration of new tools into the perfSONAR framework. The main focus of this proposal is to utilize the expertise of SLAC’s history in network monitoring in order to help guide students at NIIT to understand and contribute effectively to the global perfSONAR framework, and also towards building a useful network monitoring framework for Pakistan. Collectively, a series of perfSONAR modular services will be developed and deployed to provide the performance middleware upon which new tools and analysis service will monitor important devices along the PERN infrastructure and provide end-to-end performance analysis by being deployed at select academic institutions around Pakistan: Measurement: Measurement Point (MP) services will be developed to provide access to active and passive network monitoring tools available on PERN. They will create and/or publish monitoring information related to active and passive measurements. Here, we intend to investigate suitable network monitoring tools required to gather network performance statistics for analysis and determine their usefulness and feasibility. Existing tools such as ping, traceroute, iperf, SNMP [lxiv] utilization (Router/Switch utilization), BWCTL [lxv], Thrulay and OWAMP [lxvi] will be deployed to provide the core performance metrics that will allow problem detection and analysis. Storage: Measurement Archive (MA) services will be deployed to allow the storage of MP data that will provide historical information regarding the performance of the network. This is an important service that will provide the necessary data to determine trends for forecasting and anomaly detection. Topology Service: It is a non-trivial task to engineer the network without having an understanding of the dynamic nature way in which network paths diverge. Snapshots of how the complex algorithms enable the Internet to remain interconnected results in the network topology. This service will provide the information regarding the topology of the network. This topology will be used extensively in visualization of the weather maps which will help network engineers to analyze the network efficiently. It will also provide the data from which problems in network paths can be easily detected and investigated. 11 Decentralized discovery: Maintaining the list of services manually or having static labels for them is not suitable for a dynamic environment. Thus it is reasonable not only to have a lookup service, but also a decentralized lookup service to determine the access points the services. MP’s, MA’s and topology services will register with. The lookup services will provide DNS [lxvii] like mechanisms to discover the (virtual) location of the various perfSONAR services. Security: It is apparent that sharing information regarding Autonomous Systems with users without suitable Authentication, Authorization or even Accounting (AAA) is not a reasonable concept. Thus there is a need to provide suitable AAA services to the infrastructure. This would allow for authorized access to trusted user groups only with a provision for logging the events. Building upon the perfSONAR concepts, it will utilize existing efforts made by Shibboleth [lxviii] and eduGAIN [lxix] to provide identity management using existing SAML [lxx] standards. The next major phase is to provide reporting and diagnostic services such as aggregation and summarizations of the relevant performance measurement information, and to visualize it in a manner that is in accordance with the best practices. This requires that MA’s have been deployed and accessible such that performance metrics can be obtained. In this regard, we intend to develop and integrate: Anomaly Detection and Diagnosis Services - Currently, diagnosis of network problems requires much administrative manpower and time to determine the cause of the problem(s). The need is to quickly and automatically identify and diagnose network problems and dramatically decrease the period that users experience degraded performance. This aims to automatically gather relevant information, and use heuristical analysis to enable location (e.g. network or host issues), identification (narrowing down of the cause), and verification (double-checking against maintenance tickets etc.) of events. Alert Service - notify relevant network administrators in the event of an anomaly. Some basic functionality is already in place to provide alerts within the IEPM-BW framework when performance changes occur. This project will both provide a more generic method to the automated analysis of network performance data, and also investigate and implement advanced algorithms for such analysis and reporting Network Performance Forecasting - The prediction of network performance will greatly aid large scale data replication applications. However, such techniques are currently not very well understood. This project will investigate and report upon techniques to provide reliable forecasts of network performance based upon past performance. Reporting – The provision of manager level reports on the various network performances is necessary to help network managers make important choices in the identification of potential network problems and divert necessary resources to those parts of the network. Similarly, high level governmental reports should be provided to offer guidance on the International health of Pakistani’s Internet. NIIT students under guidance from SLAC and NIIT staff will design, develop, test, integrate, install, document and support network measurement, analysis and presentation tools, plus infrastructure management tools. In addition selected NIIT graduate and PhD students will spend a year at SLAC working closely with SLAC’s experts in a world class research center, and take courses at Stanford University. In addition it will provide opportunities for the NIIT faculty to visit the high quality research environment at SLAC and acquire the latest trends in network and related technologies. Finally it will enable SLAC staff to visit Pakistan, learn in more detail of the needs, and share their expertise, and vice-versa. 3(f) International Cooperation The division of effort between Pakistan and the U.S. is optimal. The co-supervision of NIIT students and their engagement in important network monitoring projects with an international dimension adds a dramatic widening of their experience. Choosing students at NIIT to be involved in this project and then further selecting just the best to spend time at SLAC is an enormous incentive. The students visiting SLAC will work with leading network experts at a world class facility, experience 12 tuition from world renowned professors and should become leaders in their field. This way, not only will the student and faculty contribute to the development of new technologies but they also share their experiences and are in a position to contribute positively to the Pakistani society and economy. As such all funding for this proposal is aimed towards supporting student activities and contribution and is provisioned with approximately one-third / two-thirds split of the stipend costs of the students. The members of this proposal in both the US and Pakistan offer their expertise and time gratis as a commitment to their dedication towards this project. 3(g) Relevance Even after embracing information technology for more than over a decade, Pakistan lacks in the availability of sufficient skilled human resource. A number of reasons may be attributed to this lack of quality human resource. A major attribute to this situation may be the lack of opportunity and resources. We believe that the the project MAGPIE addresses such issues (and achieves the funding program’s objectives) in the following manner: This project will contribute to the nation’s development of human resource as the students and faculty working under this project shall be given the opportunity to gain knowledge of research and development in the domain of network performance monitoring as well as valuable experiences and best practices by working with renowned institutes such as SLAC. During a period of three years it is estimated that the direct beneficiaries of the project will include two PhDs, two Masters, eight undergraduate students and four faculty/staff members. However the project will have a multiplier effect on approximately sixteen faculty members and several research associates, undergraduate and post-graduate students as a consequence of interaction with the direct beneficiaries. The local industry such as Micronet Broadband Pvt. Limited [lxxi] shall also benefit from the experiences since they are in close collaboration with NIIT regarding three development projects (maintenance of network-route consistency, dealing with DoS attacks on the service network and open software for network setup and management) to fulfill their local requirements. This project will also enhance the capabilities of PERN to function as a network with over 60 universities throughout the country. The collaborators of this proposal believe that just connecting the universities is not sufficient; planning maintenance and continual improvement of service levels is also equally important, as is the ability to pro-actively respond to problems. With the team’s rich expertise in Network Monitoring and Measurement of the digital divide, the analyzed data would be highly valuable to the decision makers in the government. As a consequence of this project the participants will not only gain knowledge of network performance measurement and analysis, but will also have the experience of working on highperformance networks. This trained human resource will then be of immense importance to diverse organizations in the local industry such as Internet service providers, telecomm operators as well as the software industry. 3(h) Results Statement Design and deploy a perfSONAR compliant infrastructure on PERN2 that will offer very valuable and varied network performance data across both the backbone and selected end-sites across Pakistan. Using this data, design innovative and production quality mechanisms to present, analysis, diagnose and pin-point network performance problems that will be beneficial to both network engineers and Internet end users. Pakistani students and faculty who participate will gain experience in high performance networks and innovative research in networks and their monitoring; and be enabled to contribute to Pakistani education and growth. 13 3(i) Additional Benefits Building a large international community invested in deploying the perfSONAR measurement framework is as important as developing the actual software. The perfSONAR framework is being developed in an open community by consensus, with all community-developed code given a BSD-style open-source license. There is thus no impediment to contributions from Pakistan or Pakistan taking benefit from the developments from this project. Academic institutes participating in this project will have access to a large scale monitoring network in Pakistan which will provide them practical hands-on experience. Students and researcher from these institutes shall be able to incorporate their own research ideas into this infrastructure. This infrastructure will make it easier for educators to bring the network into the classroom. Early prototype components of the perfSONAR system have been featured in courses at the University of Delaware [lxxii]. Daily use of the network for routine teaching activities depends on a network infrastructure that can meet the needs of the teaching application. A perfSONAR-enabled network will allow educators and support staff to easily verify and demonstrate that the network is operating properly. Members of the perfSONAR consortium are already engaged with resolving performance issues on production networks. By continuing to work with operations staff, local administrators and faculty, the team will promote the tools and the advantages of self service over waiting for third parties to run tests. For example Georgia Tech. and SLAC are deploying campus wide network monitoring infrastructure. These infrastructures will be perfSONAR enabled, allowing faculty and central support staff to view performance and schedule tests within and beyond campus. This proposal will help build bridges across the world, extend education to both male and female students, reduce misunderstanding and help reduce radical, fundamentalist indoctrination of hate. 14 3(j) Budget (in equivalent US $) (For Year 1) Pakistan Side Requested Approved Items requested US Side Requested Approved Equipment (Attach Proforma invoice) Four deskside computers & monitors for students $3450 Consumables Services (Attach Proforma invoice) Travel Domestic travel (2 trips/year) $3450 Support personnel Four graduate student for 12 months ($900/mo from SLAC) $51568 Miscellaneous Tuition TOTAL US $ $58468 15 Budget (in equivalent US $) (For Year 2, if requested) Pakistan Side Requested Approved Items requested US Side Requested Approved Equipment (Attach Proforma invoice) Compute server for analysis and data storage $1785 Consumables Services (Attach Proforma invoice) Travel Domestic travel (2 trips/year) 3571 Support personnel Four graduate students for 12 months ($932/mo from SLAC) $53373 Miscellaneous TOTAL US $ $58729 16 Budget (in equivalent US $) (For Year 3, if requested) Pakistan Side Requested Approved Items requested US Side Requested Approved Equipment (Attach Proforma invoice) Consumables Services (Attach Proforma invoice) Travel Domestic travel (2 trips/year) $3696 Support personnel Four graduate students for 12 months ($964/mo from SLAC) $55241 Miscellaneous TOTAL US $ $58937 17 Budget (in equivalent US $) (TOTAL REQUESTED for all years of project) Pakistan Side Requested Approved Items requested US Side Requested Approved Equipment Four deskside computers & monitors for students Compute server for analysis and data storage $3450 $1785 Consumables Services Travel Domestic travel (2 trips/year) $10,716 Support personnel Four graduate students for 12 months each year for 3 years $170,943 Miscellaneous TOTAL US $ $186,595 OTHER FUNDING. Please list the source and amount of any other funds received or applied for to support this project: 18 3(k) Executive Matters / Budget Requirements i. Timetable (Please indicate each major step in project evolution referring to the time schedule) 2008 ID Task Name Q1 1 Deployment of the perfSONAR framework for PERN2 2010 Measurement – Measurement Point (MP) services will be developed to provide access to active and passive network monitoring tools available on PERN2. 288d 3 Storage – Measurement Archive (MA) services will be deployed to allow the storage of MP data that will provide historical information regarding the performance of the network. 432d 4 Topology – The topology service will be deployed to provide the information regarding the topology of the network. 288d 5 Discovery – The lookup service shall be deployed to provide DNS like mechanisms to discover the (virtual) location of the various perfSONAR services. 288d 6 Security - Suitable services of Authentication, Authorization and Accounting (AAA) shall be deployed for the management of the perfSONAR monitoring infrastructure. 288d 7 Integration and Testing 144d Algorithms for Network Performance Monitoring, Event Detection and Diagnosis, and extension of the perfSONAR framework Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 764d 2 8 2009 Duration 686d 9 Anomaly Detection and Diagnosis Services 288d 10 Alert Service 288d 11 Network Performance Forecasting 432d 12 Reporting Service 288d Figure 5 Time line of project tasks ii. Request justification (Please provide detailed justification for each budget item requested) Support Personnel: The twelve graduate students who will be visiting SLAC for one year each, will be funded from the SLAC side at $900 per month. Miscellaneous Expenses: Over the three years we will need a workstation and screen for each of four students currently at SLAC. Since workstations have a rough lifetime of three years we are asking to procure four workstations each with a flat screen in the first year. We estimate the cost of each workstation and and screen at about $750. In the second year we will procure a server to host the data and analysis tasks at a cost of about $1500. All costs will be escalated at 3.5% for years two and three. The Stanford University fringe benefit rate of 3.8% for graduate students has been applied to salary costs, SLAC’s indirect costs will be charged at 15% to all items, following the proposal instructions, although SLAC’s normal indirect rate is 39%. Dr. Les Cottrell, head of the Network Computing Group, will direct the project at SLAC (10% effort / no cost to the project). Other researchers (35% FTE/no cost to project) will provide coordination, guidance, training, quality control and integration. SLAC is a department of Stanford University, under a Management and Operating contract with the US Department of Energy. All SLAC expenditures under this grant are incurred under the cost principles 19 governing SLAC as outlined in the DoE/SLAC contract. iii. Travel details (Please indicate number, duration, timing, and justification for each visit proposed for each side) Research Students visit / stay at SLAC (US): A major part of this collaborative effort is to provide students with high quality research environment. Nine students are to be sent to SLAC over the three year duration for research visits of one year’s duration for each student. During this visit students will have a chance to grow in an environment which has a proven track record of basic and applied research. During this visit students will also be able to take courses at Stanford University. NIIT Faculty visit/stay at SLAC for collaborative meetings: This collaborative project requires frequent meetings between NIIT and SLAC mentors and students. Most of the communication is through e-mails, phone calls and video conferences which are very useful but not enough. It is required that faculty of NIIT working on this project visit SLAC for a period of 2 weeks each year so that faculty members are able to guide students in a better fashion. SLAC supervisors visit to NIIT: Some students working with this research group will have a chance to visit SLAC as internees. For guidance of other students it is required that a SLAC mentor visits NIIT once a year for a period of two weeks each year, so that even if a student does not visit SLAC there is a strong interaction with existing students and new students can be attracted and introduced. Conference Travel & per diem: We are anticipating two publications per year in this project. We have decided to publish one of those in international conferences to establish the usefulness and importance of our research. We have also decided to publish one of the research papers in a domestic conference so that other Pakistani institutes and students also benefit from our work. Local Pakistan travel: This category involves travel by Network Assistant or research student for the installation/upgrade/ deployment of the server(s). This is an ongoing activity which shall continue for the whole duration of project. SLAC requests funding to cover the cost of two domestic (U.S.) trips/year to attend conferences, present findings, meet with and share information with collaborators in perfSONAR. iv. Additional notes (Please provide additional appropriate information, if any) 3(l) Curricula Vitae (Please attach brief CVs—1 or 2 pages each—for all Pakistani and US participants. Please do not include lengthy publications lists or copies of publications.) Co-PI (Pakistan): Prof. Dr Arshad Ali is a senior IT professional and academicians. He completed his PhD studies in 1992 from University of Pittsburgh, USA in the Design, Simulation and Fabrication of Optical Waveguides for Optical Computers. He has over 26 national and international publications to his credit. Dr Arshad has been awarded Pakistan Academy of Sciences and COMSTECH gold medal for IT research. Currently he is working as Dean, NUST Institute of Information Technology (NIIT), Rawalpindi. Dr Arshad enjoys good rapport with IT industry which helped in establishment of CISCO Local Academy program, attracted donation of state of the art computer Lab from INTEL, USA and a teradata Lab from NCR Pakistan for NUST Institute of IT (NIIT). Dr Arshad initiated research collaboration with Center for European Nuclear Research (CERN), Switzerland and earned the Associate Institute status of CMS-CERN for National University of Sciences & Technology (NUST). Dr Harvey Newman from CALTECH USA (29 faculty members include Nobel Laureates) has committed 40,000 US dollars per year grant to strengthen this research collaboration 20 initiative. The research collaboration with Stanford University (SLAC Project), USA has also helped in attracting 100,000 US dollars funding from US State department. Dr Arshad formed a joint consortium with University of the West of England UK, Beijing Institute of Technology China, University of Savoie France and NUST Pakistan which attracted 400,000 Euro research funding from European Union. He is actively pursuing research in Distributed computing and Grid Technologies in collaboration with University of Melbourne Australia and University of Portsmouth UK. Prof. Dr Arshad is also contributing in launching of open source resource center in collaboration with Pakistan Software Export Board, Comtec (Communication Technologies) Japan and NUST, Pakistan. Co-PI (US): Dr. R. Les. Cottrell received his BSc (Hons) in 1962 and his PhD in Nuclear Physics in 1967 from Manchester University, England. He then joined the Stanford Linear Accelerator Center (SLAC) as a research physicist in High Energy Physics, focusing on real-time data-acquisition and analysis in the Nobel 1990 prize winning group that discovered the quark. In 1972/3 he spent a year’s leave of absence as a visiting scientist at CERN in Geneva, Switzerland. In 1979/80 he spent another year’s leave of absence this time as a visiting scientist at the IBM UK Laboratories at Hursley, England. While at IBM he developed a dynamic graphical cursor that was granted United States Patent 4,688,181. On returning from IBM UK, in 1980, he became the manager of SLAC’s computer networking. In 1982 he became the Assistant Director of SLAC’s Computer Services (SCS). From 1995-1997 he was the Acting Director of SCS. He is currently the Assistant Director of SCS and leads SLAC’s computer networking, telecommunications, and remote collaboration support groups. He is also SLAC’s representative on the ESnet Site Coordinating Committee (ESCC), the chairman of the International Committee on Future Accelerators (ICFA) Standing Committee on Inter-regional Connectivity (SCIC) Network Monitoring Working Group. He has served on many advisory groups such as for the Superconducting Super Collider (SSC), the American Institute of Physics, IHEP/Beijing, Internet2, and FNAL, as well as technical committees such as chairing the ESCC Network Monitoring Working Group, and being a member of the Global Grid Foundation’s Network Monitoring Working Group. He is also the PI of the DoE funded Internet End-to-end Performance Monitoring (IEPM) project which has attracted funding of almost $2M since 1997. He was the leader of the effort that in 1994 established the first permanent Internet connection to mainland China. He was co-PI of teams that won the Internet2 Land Speed record twice, an achievement listed in the Guinness Book of Records (2004). He was a co-leader of teams from Caltech and SLAC and other institutes that won the prestigious SuperComputing Bandwidth Challenge three years in succession. He is the SLAC PI of the DoE funded Terapaths project, the NIIT/MAGGIE project, and the collaboration with ICTP, Trieste. 4. References i M. Desai, S. Fukuda-Parr, C. Johansson, F. Sagasti, "Measuring the Technology Achievement of Nations and the Capacity to Participate in the Network Age", Journal of Human Development, Vol. 3, No. 1 ii H. Newman, “ICFA SCIC Report for High Energy Physics” available at http://monalisa.cern.ch:8080/Slides/SCIC2006/ICFASCICReportV17_020706.doc iii “Network Problem Cases”, available at http://www.slac.stanford.edu/grp/scs/net/case/html/ iv perfSONAR, available at http://www.perfsonar.net/ v LBL, “tcpdump”, available at http://www.tcpdump.org/. vi NLANR, “OCXMON”, available at http://moat.nlanr.net/Papers/ieee-comms-nlanr-html/node4.html. vii Cisco, “Netflow”, available at http://www.cisco.com/warp/public/732/Tech/nmp/netflow/index.shtml. viii “Web100”, available at http://www.web100.org/. 21 NLANR, “Iperf”, available at http://dast.nlanr.net/Projects/Iperf/, Internet2 “Thrulay”, available at http://sourceforge.net/projects/thrulay/. xi Internet2, “Network Diagnostic Tool”, available at http://e2epi.internet2.edu/ndt/ xii Internet2, “Network Path and Application Diagnosis”, available at http://web100.internet2.edu:8200/. xiii NLANR, “Introduction to the NLANR AMP Project for HPC sites”, available at http://amp.nlanr.net/AMP/ xiv Advanced Network Services, “Surveyor”, available at http://www.advanced.org/surveyor/. xv IEPM: “Internet End-to-end Performance Monitoring - Bandwidth to the World (IEPM-BW) project”, available at http://www-iepm.slac.stanford.edu/bw/. xvi W. Matthews and R. L. Cottrell “The PingER Project: Active Internet Performance Monitoring for the HENP Community”, IEEE Communications Magazine, May 2000. xvii RIPE, “Test Traffic Measurement Service”, available at http://www.ripe.net/test-traffic/index.html xviii HEC, "Request for Proposal (RFP) for Establishment of 10GbE Metro Area Network for PERN2 in Islamabad/Rawalpindi Region", Higher Education Commission, H-9 Islamabad, document HEC/RFP/PERN2/564-2006, available at http://www.hec.gov.pk/new/eReforms/download/RFP%20for%20Metro%20Network%20for%20PERN% 20in%20Isb.pdf xix Aga Khan University, “Clinical Epidemiology Unit”, available at http://www.aku.edu/research/ceuonprj.shtml xx Pakistan Medical Research Council, “Research Activities”, available at http://www.pmrc.org.pk/researchactivities.htm. xxi CERN, “Pakistani Physics”, available at http://cerncourier.com/main/article/39/7/21. xxii National University of Computer and Emerging Studies, “2nd Workshop on Agro-Informatics”, available at http://www.nu.edu.pk/NewsDetails.aspx?nid=78 xxiii Associated Press of Pakistan, “Media University in Pakistan – a significant leap into the future”, available at http://www.app.com.pk/en/index.php?option=com_content&task=view&id=5631&Itemid=2 xxiv National College of Arts, Lahore, “Research and Publication Centre”, available at http://www.nca.edu.pk/rpc/main.htmlhttp://www.nca.edu.pk/rpc/main.html. xxv “Virtual Iniversity of Pakistan”, available at http://www.vu.edu.pk/.. xxvi SLAC, “Throughput from SLAC to Caltech reduced by a factor of 5”, available at http://www.slac.stanford.edu/grp/scs/net/case/caltech/ xxvii See for example: Wachovia, “Wachovia Benefits from Improved Inter-Office Collaboration with Cisco TelePresence”, available at http://newsroom.cisco.com/dlls/2007/ts_060407.html. xxviii CERN, “Large Hadron Collider”, available at http://cern.ch/lhc/. xxix “Virtual University”, Pakistan, http://www.vu.edu.pk/ xxx “Agro-informatics”, http://www.agroit.com/reports.html xxxi “Internet2”, available at http://www.internet2.edu/. xxxii “GEANT2”, available at http://www.geant2.net/. xxxiii DoE, “Energy Sciences Network”, available at http://www.es.net/. xxxiv SLAC, “BaBar Public Information & Visitor Pages”, available at http://www.slac.stanford.edu/BFROOT/. xxxv SLAC/ATLAS, “The US ATLAS Western Tier 2 Computing Center”, available at http://wt2.slac.stanford.edu/. xxxvi Internet2, “E2E Performance Initiative Performance Environment System Architecture”, available at http://e2epi.internet2.edu/e2epipes/ ix x 22 GEANT2, “Joint Research Activity on monitoring and measurement (GN2-JRA1)”, available at http://wiki.perfsonar.net/jra1-wiki/index.php/JRA1_Main. xxxviii “Open Grid Forum”, available at http://www.ogf.org/. xxxix “Network Monitoring Working Group”, available at http://nmwg.internet2.edu/. xl Caltech, “MONitoring Agents using a Large Integrated Services Architecture”, available at http://monalisa.caltech.edu/ xli R. L. Cottrell, C. Granieri, L. Fan, R. Xu, Y. Karita, “Networking with China”, CHEP94, also SLACPUB-6468, April 1994. xlii SLAC, “Internet2 Land Speed Records”, available at http://www-iepm.slac.stanford.edu/lsr2/. xliii “Global Lambdas for Particle Physics Analysis”, available at http://wwwiepm.slac.stanford.edu/monitoring/bulk/sc2005/hiperf.html xliv Edited by R. L. Cottrell, “ICFA SCIC Network Monitoring Report”, available at http://www.slac.stanford.edu/xorg/icfa/icfa-net-paper-jan07/ xlv Brockwell P. and Davis R, “Introduction to Time Series and Forecasting”, Springer New York, 1996 xlvi NIST, “e-handbook of statistics”, available at http://www.itl.nist.gov/div898/handbook. xlvii Box, George and Jenkins, Gwilym (1970) Time series analysis: Forecasting and control, San Francisco: Holden-Day xlviii R. L.Cottrell, C. Logg, M. Chhaparia, M. Grigoriev, F. Haro, F. Nazir, M. Sandford, “Evaluation of Techniques to Detect Significant Network Performance Problems using End-t-end Active Network measurements”, NOMS, April 2006. xlix Saqib, F., "Network Weather Forecasting for Management and Analysis of Global Grid and Internet end-to-end Performance" (BS thesis, National University of Sciences and Technology, Pakistan, 2006). l A. Iqbal, Y. T.Li, R. L Cottrell, “Event Diagnosis using Correlation”, ICIMP 07. li SLAC/NIIT, “Measurement and Analysis of the Global Grid and Internet End-to-end performance (MAGGIE)”, available at http://maggie.niit.edu.pk/newwebsite/index.html lii J. Navratil, R. L. Cottrell “ABwE: A Practical Aooroach to Available Bandwidth Estimation”, PAM 2003, also available at http://moat.nlanr.net/PAM2003/PAM2003papers/3781.pdf liii ICFA/SCIC, “ICFA-SCIC Working Group on Monitoring”, available at http://www.slac.stanford.edu/xorg/icfa/scic-netmon/. liv SLAC, “PingER Case Studies”, available at https://confluence.slac.stanford.edu/display/IEPM/PingER lv R. L. Cottrell, S. Khan, “Quantitative Measurements of the Digital Divide”, available at http://www.slac.stanford.edu/grp/scs/net/talk07/aps-apr07.ppt. lvi R. L. Cottrell, “Lessons Learned Monitoring”, available at http://www.slac.stanford.edu/grp/scs/net/talk07/lessons.ppt. lvii R. L. Cottrell, S, Khan, “Quantifyintg the Need for Improved Network Performance for South Asia”, available at http://www.slac.stanford.edu/grp/scs/net/talk07/sasia-case-apr07.ppt. lviii R. L. Cottrell, , A. Rehmatullah, J. Williams and A. Mehdi, “Quanifying the Digital Divide from an Internet Point of View”, available at http://www.slac.stanford.edu/grp/scs/net/talk06/digital-divideoct06.ppt. lix A. Rehmatullah, R. L. Cottrell and J. Williams, “Quantifying the Digital Divide: A Scientific Overview of the Connectivity of South Asian and African Coountries”, available at http://www.slac.stanford.edu/grp/scs/net/papers/chep06/paper-final.pdf. lx R. L. Cottrell, U. Kalim, S. Khan, and A. Mehdi, “Visualizing the Digital Divide from an Internet Point of View and Some Challenges”, available at http://www.slac.stanford.edu/grp/scs/net/talk07/comsatsmar07.ppt. xxxvii 23 R. L. Cottrell, A. Rehmatullah, J. Williams and A. Mehdi, “Internet Monitoring and Results for the Digital Divide”, available at http://www.slac.stanford.edu/grp/scs/net/talk06/digital-divide-romaniaoct06.ppt lxii NCP, “Network Centre for Physics”, available at http://www.ncp.edu.pk/. lxiii NIIT, “The Centre for High Performance Scientific Computing at NIIT”, available at http://www.ncp.edu.pk/ lxiv Wikipedia, “Simple Network Management Protocol”, available at http://en.wikipedia.org/wiki/Simple_Network_Management_Protocol. lxv Internet2, “Bandwidth Test Controller (BWCTL)”, available at http://e2epi.internet2.edu/bwctl/ lxvi Internet2, “One-way Ping (OWAMP)”, available at http://e2epi.internet2.edu/owamp/. lxvii “DNS.NET homepage”, available at http://www.dns.net/. lxviii Internet2, “Shibboleth”, available at http://shibboleth.internet2.edu/. lxix GEANT2, “eduGAIN2 interconnecting established AA infrastructures”, available at http://www.fsknet.dk/text/pdf/04_eduGAIN_GNOMIS-Presentation.pdf lxx Wikipedia, “Security Access Markup Language”, available at http://en.wikipedia.org/wiki/SAML. lxxi Micronet, “Welcome to Micronet Broadband (Pvt.) Ltd.”, available at http://www.dsl.net.pk/. lxxii At the University of Delaware’s Computer Science department, the Distributed Computing Course (CISC879) has covered efficient exchange of network performance information for distributed computing extensively, using perfSONAR as a case study. lxi 24