11 University of Minnesota December 3rd, 2010 Nathan Hubbell, FCAS John Renze, PhD, FCAS Agenda • Travelers – Broad Overview – Analytics Career Opportunities • Predictive Modeling – Generalized Linear Models (GLM’s) 3 About Us • Offers property and casualty solutions to individuals and companies of all sizes • Second-largest commercial insurer in the U.S. Revenue of • Second-largest personal insurer $25 billion and total through the independent agency channel assets of $110 billion • No. 98 on the Fortune 500 list of largest in fiscal year 2009 U.S. companies • Representatives in every U.S. state, Canada, Ireland and the United Kingdom • A member of the Dow Jones Industrial Average – the only insurance company on the list 4 Company History The Saint Paul – 1853 Travelers – 1864 Seeing the need for a local insurance company, Alexander Wilkin and 16 fellow Saint Paul businessmen founded St. Paul Fire and Marine Insurance. J.G. Batterson and nine others formed Travelers Insurance Company for the purpose of insuring travelers against death or injury while journeying by railway or steamboat. St. Paul Travelers – 2004 The St. Paul and Travelers merged on April 1, 2004 forming The St. Paul Travelers Companies, Inc. Travelers – 2007 The company name changed to The Travelers Companies, Inc. and began trading on the New York Stock Exchange under the symbol TRV. The 137-year-old insurance icon, the red umbrella, was reinstated. 5 Our Organization • Business Insurance: Offers a broad array of property casualty, specialty insurance and related services to businesses of all sizes. • Financial, Professional & International Insurance: Includes international products and surety, crime and financial liability products that use creditbased underwriting processes. • Personal Insurance: Offers products including automobile, homeowners, renters and condominium policies to individual consumers. • Claim: Includes 13,000 trained Claim professionals in four countries and all 50 states who respond to customers 24 hours a day, seven days a week, 365 days a year. 6 What is insurance and what good does it serve? Insurance restores individuals to the financial state they were in prior to a loss (e.g. car accident; tree fell on house) For this benefit, customers pay a premium to the insurance company If a customer doesn’t have a loss, then their premium A. Helps the insurance company cover the loss another customer did have B. Keeps the insurance company functioning so it can continue providing this service If the customer does have a loss, it’s the other insureds that are helping them! 7 Analytics at Travelers – Who are we? Across the four business units, we form a large (100+) and diverse community of Ph.D., Masters and Bachelors holders in the following disciplines: mathematics statistics physics actuarial science computer science business … and more! 8 Analytics at Travelers – Who are we? What makes Travelers special? Teamwork isn’t a buzzword here – it’s real and we live it • We share information & technology openly with each other • Learning something new can be as simple as asking a local expert; they make the time, despite busy schedules • We each have a unique combination of strengths; we are valued for them and our managers help us grow into the careers we desire 9 Analytics at Travelers – What do we do? We ask and answer questions requiring sophisticated analyses • • • • • How much will it cost to insure a customer? How expensive will this claim be? How likely is it that this customer will purchase our product? How many claims adjusters will we need in two years? What new statistical methods will help move our business into the future? 10 Analytics at Travelers – What do we do? Why are these questions hard? Example: How much it costs to insure an auto customer It’s impossible to predict if someone is going to A. Get into an accident B. The type of accident (telephone pole, another vehicle) C. How bad the accident will be But if we have enough customers, we can start to group them… in group A) we expect 1 / 10 to get into an accident, costing on average $1000 in group B) etc. 11 Analytics at Travelers – What do we do? Why it matters… The more finely we group, the more accurate the price. In that case: 1. If our competitors charge more, the customer will choose us and we will grow profitably 2. If our competitors charge less, the customer will choose them and they’ll grow unprofitably Either way, we win! The trick is finding the right groups, and getting the right price for them… 12 Analytics at Travelers - Methodologies To stay ahead of our competitors, we sift through the literature searching for the most applicable techniques. Some examples: – – – – GAMs Elastic net & adaptive LASSO MARS Gradient boosted trees We pick the methods that have real-world value to our business and give us the competitive edge We do a significant amount of proprietary methodology development internally 13 Analytics at Travelers - Technology We have millions of customers. To support this volume of data and facilitate the use of the latest methodologies, we rely on cutting-edge technology Teradata data warehouses – hundreds of TB at our fingertips! Multi-processor linux servers for analytic software • • • • • SAS R Salford Systems Custom software (C++, FORTRAN) …and more! 14 Analytics at Travelers – Why us? We are at the cutting-edge We grow our people; we give them the training and opportunities they need to move their careers ahead We are a team – it’s our combined focus that makes Travelers the leader of the industry that we are today. 15 Predictive Modeling 16 Predictive Modeling • Using Generalized Linear Models (GLM’s) and other statistical methods to predict exposure to loss at detailed level. – Recently, property-casualty insurance companies have embraced predictive modeling as a strategic tool for competing in the marketplace. • Originally introduced as a method of increasing precision for personal auto insurance pricing • Extended to homeowners and commercial lines • Today, it is applied in areas such as marketing, underwriting, pricing, and claims management 17 How do you differentiate your rates? Automobile – – – – – – – – – – – – – Age Gender Marital status # vehicles # drivers Home policy Driving record Years Licensed Limits Prior Insurance Student/Nonstudent Location (Garage/driven) Annual Mileage Homeowners – – – – – – – – – – – Age of home # occupants Primary / Secondary Prior claim experience Construction Protection Roof Type Location (CAT?) Amount of Insurance Auto Policy Responsibility of owner 18 How do you set the prices? • Old Way: – – – – Group data by class class relativities Sort data into age groups age relativities Group data by territory territorial relativities Rating factor = class x age x territory 19 What is wrong with the old way? Example Size of car Class Claim Count Large cars Medium cars Small cars • Age Exposures Frequency Relativity (number of (probability of cars) a claim) 15 400 0.038 1 110 1700 0.065 1.725 143 900 0.159 4.237 Class Old Young Claim Count Exposures Frequency Relativity (number of (probability of cars) a claim) 80 188 1800 1200 0.044 0.157 1 3.325 • A young driver of a small car would be charged • 4.237 x 3.325 = 14.088 times what an old driver of a large car would be charged. • Important point: Some of this effect is double-counted, as size of car is correlated with age. • (numbers are illustrative only) 20 Possible Solution: Multiple Linear Regression • E[Y] = a0 + a1X1 + …+ anXn • Two Key Assumptions: – Y is Normally distributed random variable. – Variance of Y is constant (homoscedastic). 21 Problems With Regression – Part I • Y is NOT normally distributed. – Number of claims is discrete – Claim sizes are skewed to the right – Probability of an event is in [0,1] 22 Problems With Regression – Part II • Variance of Y is NOT constant. – Varies by expected loss. • High frequency losses have less variance. • High severity losses have more variance. – Varies by exposure. 23 Problems With Regression – Part III 2.5 2 1.5 1 0.5 88 85 82 79 76 73 70 67 64 61 58 55 52 49 46 43 40 37 34 31 28 25 22 19 0 16 Loss Relativity 3.5 Nonlinear relationship between X’s and Y’s. 3 Example: Age of driver Driver Age (numbers are illustrative only) 24 Generalized Linear Models (GLMs) • E[Y] = g-1(a0 + a1X1 + …+ anXn) • Fewer restrictions: – Non-linear relationships. • g(x) = x Additive model • g(x) = exp(x) Multiplicative model • g(x) = 1 / (1+exp(x)) Logistic model 25 Generalized Linear Models (GLMs) • E[Y] = g-1(a0 + a1X1 + …+ anXn) • Fewer restrictions: – Y can be from any exponential family of distributions. • Poisson (number of claims) • Binomial (probability of renewing) • Gamma (loss severity) 26 Generalized Linear Models (GLMs) • E[Y] = g-1(a0 + a1X1 + …+ anXn) • Fewer restrictions: – Variance depends on the expected mean. • Normal: Variance is constant. • Poisson: Variance equals mean. • Gamma: Variance equals mean squared. 27 Generalized Linear Models (GLMs) • What’s the catch? – No closed form solution. – Use maximum likelihood estimation. • Iterative process. • Make a guess and linearize. • Solve the linear problem to find next guess. – Increased computational complexity. 28 Key Steps in Model Building – Part I • What are you modeling? • How will you implement? • Gather and clean internal data. • Link other sources: internal and external. • Create training and validation sets 29 Key Steps in Model Building – Part II • Build Model on Training Set • Univariate analysis – statistically test each predictor • Build multivariate models using significant predictors • Select best multivariate predictive model Keep most relevant predictors Principle of parsimony – simplicity is good 30 Key Steps in Model Building – Part III • Measure predictive power on validation set • Was training set over fit? • Peer review • Implement • Post-implementation monitoring • Adjust with new knowledge 31