12 Key Analytical Trends / Themes Silicon India BI Summit Hyderabad – Aug-2011 Derick Jose Vice President-DAS ( MindTree ) Context 12 Battle tested themes distilled from real life experiences Not based on Forrester or Gartner Spans Emerging markets + Advanced markets Example + Specific call to action Slide 2 Theme-1 : Information monetization Slide 3 I have terabytes of consumer data . How can I create a new revenue stream ? Real life example : GPS info product catalogue Travel behavior Segmentation Travel behavior Media asset placement and optimisation Demographic data GIS/Road data Cut thru data Village visitor data Highway traveler Travel Behavior Analysis Congestion Forecaster Government road/town planners Misc data Events data Macro economic data Understanding drivers of traffic Survey data Census data Retail / Mall planning 4 $ impacting actions from 3 info products • Action-1 : Traffic Signal optimization • Action-2 : Bill board setup • Action-3 : Catchment interception and new outlet store creation • Action-4 : Road creation/widening/ Policy changes to relieve congestion buildup SPECIFIC ACTION : Evolve SPECIALISTS to “curate” monetizable scenarios from data Theme-2 : Analytics can solve problems customers never thought existed Are 6there patterns in Slide data which can signal an issue which are not apparent to my naked eyes ? Real life example : Channel Cannibalization Detector In emerging markets, did Modern trade cannibalize General trade during summer season for beverage drinks Analytical models can reveal if there is a systematic sales cannibalization pattern SPECIFIC ACTION : Don’t wait for customer problem statements. Prime latent demand Theme-3 : Mining Digital Consumer Behavior Which are the most engaged actions a Digital consumer takes on the platform ? Slide 8 Real life example : Understanding WOM behavior DAN VINAY FREE SAMPLE LINK NEW RAZOR LAUNCH ARTICLE MELANIE SCOTT HAIR CARE TIPS ARTICLE CONFIGURE UR OWN PRODUCT LINK LINK TO RESONATING DISCUSSION KRISHNAN FREE PERFUME SAMPLE LINK BARGAIN COUPONS GOOD REVIEW OF FACE WASH NUGENT SANJOY RADHA FREE SAMPLE LINK ACTION : Think Digital Platform as a Lab where understand drivers of consumer intent Theme-4 : Unstructured Data Mining Slide 10 What hidden patterns reside in unstructured data ? Real life example : Sentiment analysis of opinions on www.yelp.com Sentiment metric Value Posts by source (For ABC Brand) 136 ABC Brand Sentiment index 2.70 ABC Brand Buzz velocity (for 2008-2009) 8.03 posts/month (+ve) Competitive Buzz index (with respect to Compet Brand) 0.56 More No. of posts entered (on yelp.com) for Compet Brand than for ABC Brand Competitive Sentiment index (with respect to Compet Brand) 2.18 Ratio of ABC Brand sentiment index to McDonald sentiment Index is greater than 1, this implies consumers have better experience with ABC Brand as compared to Compet Brand Competitive buzz velocity (For Compet Brand, for 20082009) 12.64 posts/month (+ve) For this exercise MindTree sourced the postings from yelp.com Eopinions and Twitter have more entries on ABC Brand sentiments and can be mined subsequently No. of Positive sentiments is greater than No. of Negative sentiments (Based on Yelp.com) Indicates 8.03 posts are coming up per month at yelp.com. Further slope of buzz curve is +ve. Indicates 12.64 posts for Compet Brand are coming up per month at yelp.com. Further slope of buzz curve is +ve. ACTION : Consider mining unstructured data for insights in addition to structured data Slide 11 Theme-5 : Examining value at the intersect What new questions can I ask at the intersection of online/offline behavior ? Slide 12 Real Life Example : Search & Booking SEARCH DATA - Logged & Flushed BOOKING DATA ACTION : Dig deeper at the intersect ! Theme-6 : Ensemble models How can we combine analytical models in innovative ways ? Slide 14 Real life example : Segmentation + Text mining in Fleet ind Av. Gallons pcpm Av. No of Av. Ancillary Transactions Revenue pcpm Av. No. of Retention Calls Av. Late Fees Av. Activation Rate Av. MOM Growth Population 106.4 5.5 248.8 0.7 324.1 0.7 1.03 Stable Underdogs 62.5 3.6 83.4 0.3 87.7 0.5 1.05 Miniature Laggards 77.3 4.6 90.0 0.2 118.3 0.9 1.03 Cash Cows 179.9 8.0 1098.6 0.6 2965.3 0.7 0.96 Dark Horses 122.3 6.1 1196.2 0.4 534.6 0.7 0.93 Sulking Mediocres 101.5 5.2 63.2 4.6 279.2 0.6 1.07 Front-runners 276.4 12.4 163.8 0.7 371.0 0.7 1.04 Slide 15 Ensemble Model in Telecom Churn Prediction Models Behavorial Segmentation CDR data Billing data Social Network analysis Maven list Inbound calls -Activations -Inquiries -Service calls Outbound calls -Collection calls -Campaign calls Cluster-2 High credit risk Cluster-1 Low credit risk Customer = “Scott Nugent”, Customer = “V Komrala Text mining keywords Regression model Customer = “Joe Henry”, Customer = “A Sampath Neural networks Customer = “Scott + Nugent”, churn score+= Scoring 1 Customer+= Social “Joe Henry”, churn score =0 ACTION : Combine Segmentation text mining Models media analysis Customer = “V Komrala”, churn score =0 Customer = “A Sampath”, churn score =1 Theme-7 : Verticalized Data Model Frameworks • • • • • ARTS -Retail ACCORD -Insurance SCOR -Supply Chain CDISC -Clinical Trials CMAT – Customer Analytics Which Slide 17 prefabricated industry specific data models and analytical processes can I use to jumpstart my solution ? Real life example : ACCORD based Claims model for a leading US based Insurance provider ACTION : Which industries and sub processes do I need to build knowledge in Vertical specific Slide 18 standards ? Theme-8 : Analytics penetrating unconventional areas Telemedicine + Student scoring models Slide 19 Real life example : Location based intelligent alerts Here when the district health care officer clicks on location the most statistically significant patterns from t/chi / regression test will be filtered and shown ACTION : Which non traditional, data rich processes can we apply analytics in to unlock value? Theme-9 : Need varied ways to triangulate predictors & hypothesis What are the various means to unearth causal levers which influence a business outcome ? Slide 21 4 important learning's in surfacing predictors and behavioral hypothesis HUNCHES + STATISTICS = FORCE MULTIPLIER 1. Informed Market “Intuition's + Analytics = Force multiplier 2. Intuitions are NOT bad ! Its not “either or” …”And” mindset 3. 6 ways to get “informed hunches” regarding what potentially caused a business outcome 4. Triangulate ! Real Life Example : Triangulating Surfacing policy renewal predictors 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. Recency of a claims denial Tenure of agent Overall experience of agent ( total experience ) Automated deduction or cash/cheque based ( payment mode ) No of unanswered call center calls in last 8 weeks Frequency of outbound triggers for renewal Recency of phone bound renewal trigger % Change in renewal commissions to the agent ( driven by policy ) 3 month ratio of inbound calls to outbound calls Range of channels for interaction – Agent / Internet / Mobile / call center Outbound watch list : Frequency of occurrence of specific keywords in outbound call interaction Inbound watch list : Frequency of occurrence of specific keywords in inbound call interaction Recency of last payment Policy attributes : Type of policyholder/location / type of coverage/Policy cost / Sum assured / Issue age / Policy tenure / Range of products covered ACTION : Consider spending a “immersive day in life of session” to surface real world predictors Theme-10 : Increase in demand for “data driven story tellers” • • • • • Wired for narratives Package data patterns in narrative format Ability to connect the dots From Presenting Insights to presenting ‘data dr Right brained + left brained How can I maximize the impact of data patterns to business users ? Slide 24 The “T” shaped Analytics Individual Breadth of Knowledge Depth of Expertise Right brained and left brained analytics professional Frame issues- creative Blue print – left brained Interpretation - creative ACTION : Have the right cross functional team for an analytical project Theme-11 : Reverse innovation in Analytics 2 real world examples of reverse innovation Fraud : Open source Telemedicine : Mobile analytics ACTION : Identify opportunities at the intersection of open source and industry sub processes Slide 28 Telemedicine – Patient Side VC Telemedicine – Doctor Side Application Tabs to view Case sheet, initiate VC, etc Patient Case Sheet, Review records Option to view ECG – Offline/Live Patient Vital Statistics Telemedicine – Doctor Side Live ECG and VC. ECG viewer Integrated into application Patient Side Video PIP Option – Doctor side Video Telemedicine analytics framework 1 2 3 4 5 PATIENT DIAGNOSIS REPOSITORY Pulse/Height/Weight -Temperature/Heart rate -BP/ ECG / SpO2 -- Diagnostics Rural patient data capture Satellite transmission Data reception Patient repository Info syndication biz model 11 13 9 Insurance 14 6 Doctors observations text miner Pharmaceutical Diagnostic text mining 12 Central early warning control center unit 8 Remote Diagnostics 10 Govt Hospitals 7 Field health officer Diagnostic scanner ( chi square / t tester ) Secure hosting platform for Rural Patient Diagnostics data Diagnostic scanning process Theme-12 : Feel good insights vs. the one Transformative insight ! 3 Real world examples of Transformative insights 1. Outlet analytics : Outlet recommendations resulting in crores of ruppees of additional revenue 2. People scoring model : Person who did last interview and TAT were more important predictors than salary 3. Uplift modeling : Sister brand cannibalization Summary of 12 key analytical trends & themes Information Monetization Industry specific data models – ARTS/SCOR/ACCORD Solving problems customers never thought existed Analytics penetrates unconventional industries and processes – Health care, Education Handling unstructured information Skills to triangulate predictors Digital consumer behavior analytics Data narrative story telling & “T” shaped Value at the intersect - Multi Channel – Online individuals will offline Reverse Innovation in Analytics – “ Ensemble models : Segmentation + text mining Need for transformative analytics vs “feel good” analytics Slide 35 Big Data will explode & Distilling significant few patterns is key The “Data” refinery Oil refineries & “Data” refineries • Oil fuelled the industrial revolution . Data will fuel the services revolution • Raw or unprocessed crude oil is not generally useful. Raw or unprocessed data is also not useful • Companies like Google and face book are building large “data refineries” to distill consumer insights Analytics is going to be an interesting ride for next 5 years ! Key is to Enjoy the journey !!!