Communication-Efficient Distributed Monitoring of Thresholded Counts Ram Keralapura, UC-Davis Graham Cormode, Bell Labs Jai Ramamirtham, Bell Labs Introduction Monitoring is critical to managing distributed networked systems Main challenges: Continuous Distributed Resource-constrained environments June 28, 2006 Ram Keralapura, UCDavis 2 Thresholded Counts New fundamental class of problems “Tracking counts for an event beyond a given threshold value with user-specified accuracy” Motivating scenarios: Total # of connections to a server when it exceeds the normal operational condition (ex, DDoS attacks) Total traffic to a particular destination prefix when it exceeds the pre-defined limit Tracking the total number of cars on a highway June 28, 2006 Ram Keralapura, UCDavis 3 Thresholded Counts (cont’d) Two key properties Threshold value User specified tracking accuracy 0 Nˆ T when N T (1 ) N Nˆ N when N T June 28, 2006 Ram Keralapura, UCDavis 4 System Architecture m remote sites (or monitors) and a coordinator site (or central node) Non-continuous updates cv,1 Counts can be positive, negative, or fractional Ignore network delays and losses Local thresholds at remote sites Remote Site m Remote Site 1 Coordinator Site (Central Node) Remote Site 2 Remote Site i cv,2 June 28, 2006 cv , m Ram Keralapura, UCDavis cv ,i 5 Approach Every remote monitor i, maintains a set of local thresholds: ti, j , j 0,1,2... Local count at monitor i, should always lie between two neighboring thresholds ti,f (i ) Ni ti,f (i )1 Global estimate at the central node: Nˆ m t i,f (i ) i 1 June 28, 2006 Ram Keralapura, UCDavis 6 Approach (cont’d) Maximum error in the global estimate should satisfy: m 0 (t i,f (i ) 1 ti,f (i ) ) N when N T i 1 Two methods to set local thresholds Static thresholding Adaptive thresholding June 28, 2006 Ram Keralapura, UCDavis 7 Static Thresholding Problem: For given values T and , we have to determine t i , j , j [0, ) such that, j 0 : ti, j 1 ti, j and ti,0 0 m (t i,f (i ) 1 i 1 June 28, 2006 ti,f (i ) ) m t m i,f (i ) i 1 Ram Keralapura, UCDavis when t i,f (i ) 1 T i 1 8 N Uniform T N̂ Max error = N3 T N 2T m T m N̂ N2 N1 0 Monitor-1 Monitor-2 N3 Monitor-3 Central Node Blended threshold assignment N Proportional N Max error = (1 ) N3 2 N̂ T N N̂ (1 ) 1 0 Monitor-1 N1 N2 Monitor-2 Monitor-3 N3 Central Node Static Thresholding (cont’d) Blended threshold assignment T ti , j (1 )ti , j 1 (1 ) where 0 1 m ti ,0 0 and when 1, ti ,1 1 0 uniform threshold assignment 1 proportional threshold assignment Complexity: m N O log1 1 T June 28, 2006 Ram Keralapura, UCDavis 10 Adaptive Thresholding Every monitor maintains only two threshold values: tiL and tiH Problem: For given values T and , and a threshold violation from monitor k , determine t iH for all the monitors such that, i : tiH tiL m t i 1 June 28, 2006 iH tiL m t m iL when i 1 Ram Keralapura, UCDavis t iH T i 1 11 T Slack t3H N̂ t1H t2H t1L Monitor-1 Nˆ (1 )T t3L t2L Monitor-2 Central Node Monitor-3 Basic Adaptive Algorithm N̂ t3H T t1H t3L Nˆ (1 )T t2H t1L Monitor-1 t2L Monitor-2 Monitor-3 Central Node Experimental Setup Built a simulator with m monitoring nodes and a central node Implemented all the static and adaptive algorithms Data set: Public traces from NLANR June 28, 2006 Ram Keralapura, UCDavis 13 Count Accuracy June 28, 2006 Ram Keralapura, UCDavis 14 Validating the Theoretical Model [T , , N ] [T , , N ] June 28, 2006 Ram Keralapura, UCDavis 15 Comparing Costs – Static and Adaptive Cases [T , , N ] June 28, 2006 Ram Keralapura, UCDavis 16 Related Work Top-k monitoring [Babcock et al] Heavy-hitter definition Adaptive filters for continuous queries [Olston et al] Distributed triggers [Jain et al] Distributed continuous queries but does not address the thresholded counts problem Simplified version of the thresholded counts problem Randomized algorithms with statistical guarantees Geometric approach for threshold functions [Sharfman et al] Focus is mainly on non-linear functions June 28, 2006 Ram Keralapura, UCDavis 17 Summary We defined a fundamental class of problems called “Thresholded Counts” We proposed algorithms to address the problem – static and adaptive Analyzed the complexities of these algorithms and provided proofs Using experiments, we showed the effectiveness of our algorithms June 28, 2006 Ram Keralapura, UCDavis 18 Future Work Building the monitoring system for real networks to explore the practical aspects of our framework Address scalability issues Sensor networks IP network monitoring For example, hierarchical monitoring architecture Extend for different query types with thresholded nature For example, arithmetic combinations June 28, 2006 Ram Keralapura, UCDavis 19 Thank you!! Questions?? Contact: rkeralapura@ucdavis.edu June 28, 2006 Ram Keralapura, UCDavis 20