Predictive Analytics Low-Risk Strategies for High-Impact Projects Course Outline Introduction Core Concepts Beyond Traditional Statistics o ‘Assumptions’ of Traditional Statistics o Shift your thinking… Behaviors of Interest Goal of Modeling Modeling Human Behavior Components of Mathematical Models Uses of Formulas Winning at the ‘game’ we call business Attributes of a game Project Success Survey Predictive Analytics ROI Survey Predictive Analytics Business Goals Analytic Goals Why Predictive Analytics? Lab 1: Introduction Types of Models Response Risk Attrition Activation Cross-Sell and Up-Sell Profile Analysis Segmentation Net Present Value Lifetime Value Why Predictive Analytics? Low-Risk / High-ROI Project Design Low-Risk / High-ROI Projects Phased Development Cycle Positive Impact Behavior Modeling Negative Impact Behavior Modeling Conflict Resolution Modeling Ranking Across the Continuum Dimensionality Enhancement Refining Precision Forecasting Lab 2: Opportunity Conceptualization The CRISP-DM Process Model Environment Development Data Sandbox CRISP Development o Business Understanding o Relationship Solution Space o Determine Modeling Objectives o Data Understanding o Data Preparation o Modeling A Sampling of Commercial Data Mining Software Products o Validation o Implementation Predictive Analytics is Analysis… not Engineering Business Understanding Project Team Performance Metrics o Determine Business Objectives o Conversion: Objectives to Metrics o Handling Multiple Metrics o Lift and Gains Chart Interpretation o Custom Performance Charts o Enhancing Performance with Threshold Evaluation o Calculating the Current Baseline: Uplift Analysis Lab 3: Performance Metrics Modeling Objectives o The Case for Classification o Prioritize the Dependent Variable o Precision Requirements o Training for what “should be done…” not what “was done” o Confirming Compatibility o Defining Modeling Objectives o Resource Availability Experimental Design o How Much Data is Needed to Develop a Model? o Training Data for Classification o Training Data for Prediction o How Many Variables? o Purpose of Experimental Design o Experimental Design: Statistics vs. Predictive Analytics o Data Sets Used o Type of Data Distribution Lab 4: Experimental Design & Data Sandbox Construction Data Understanding Data Set Determination Availability Requirements Planning Data Quality Issues o Data Errors o Outliers o Missing Data Data Types: Behavioral Characteristics o Demographic Data o Behavioral Data o Psychographic Data Data Types: Mathematical Characteristics o Qualitative Variables Categorical Data Nominal Data o Quantitative Variables Ordinal Data Interval Data Continuous Data Lab 5: Data Understanding Data Preparation Data Representation Expectations o Natural Values o Binning o Bin Boundary Determination o Open Ended Ranges o Collapsed Sets o 1ofN Representations o Thermometer Representation o Bipolar Representation o Fuzzy Boundaries o Multiple Boundary Strategies o Controlling Error Data Transformation Expectations o Conversion to Linear o Converting the Shape of the Distribution o Ratios o Roll-ups o Domain Specific Transformations Data Resource Consumption Considerations Data Extraction for Replicability Lab 6: Data Preparation Modeling Matching Techniques to the Project Goals o Classification Modeling Techniques o Forecasting Modeling Techniques Variable Selection Candidate Model Evaluation Lab 7: Model Development Validation & Evaluation Lab 8: Model Evaluation & Validation Deployment End User Interface Model Run Cycle Model Maintenance Summary and Next Steps Formal Project Assessment o Business Understanding o Data Understanding o Report of Findings & Recommendations Resources