Multiple Linear Regression: Cloud Seeding By: Laila Rozie Vimal Introduction What is Cloud Seeding? – Treatment of individual clouds or storm systems to achieve an increase in rainfall. – Treatment = massive amount of Silver iodide (1001000g per cloud) – The experiment took place in the Florida. – 24 days were considered suitable for seeding on the basis of measured suitability criterion S-Ne. – optimal days for seeding are those: When seedability is large natural rainfall early in the day is small. Objective of the Experiment Analyze the data to see how rainfall is related to the explanatory variables and determine the effectiveness of seeding. Multiple Linear Regression It attempts to model the relationship between two or more explanatory variables, and a response variable by fitting a linear equation to observed data. What are explanatory variable? – they are the independent variables in the experiment used to explain the response variable. What is the response variable? – They are the dependent variables. Explanatory variables Seeding: A factor indicating whether seeding action occurred: So yes and no Time: number of days after the first day of experiment Cloud cover: percent cloud cover in that experimental area. Measure using a radar. Prewetness: total rainfall an hour before seeding echo motion: whether radar echo was moving or stationary SNe: Suitability criteria Response Variable The amount of rain measured in cubic meters * 10^7 Multiple Correlation Coefficient The correlation between the rainfall and all the explanatory variables is given by the value of R². the set of predictor variables X1, X2, ... is used to explain variability of the criterion variable Y Assumptions All data are drawn from populations following normal distribution All data are homoskedastic meaning constant variance. All explanatory variables are measured without error. Avoidance of multicolinearily- so when the explanatory variable start to show some correlation among each other. So it is important to have the correlation between each pair of explanatory variables approximates to zero. co linearity is a problem because it can make the regression difficult or misleading to interpret. Multiple Linear Regression Model yi = 0 + 1xi1 + 2xi2 + ... pxip + εi for i = 1,2, ... n. Analysis of variance The ANOVA calculations for the multiple linear regression is identical except the degrees of freedom are adjusted to reflect the number of explanatory variables in the model. There is also an F-test used, which does not indicate which of the parameters x is not equal to zero, but only that atleast one of them is linearly related to the response variable. Homework Define the explanatory variable and the response variable? (List what they in terms of this experiment) Explain what each term (variable) means in the MLR model.