WEEK 8: REGRESSIONS AND
REPORTING RESULTS
Predictive Modeling
DATA 5400
WEEK 8 HOMEWORK DUE SUNDAY
2 OCTOBER 25, 2021 DATA SCIENCE
• U N D E R S TA N D I N G Y O U R R E S U LT S I N T E R P R E T I N
G & R E P O R T I N G Y O U R F I N D I N G S
LINEAR REGRESSION:
3 OCTOBER 25, 2021 DATA SCIENCE
• U N D E R S TA N D I N G Y O U R R E S U LT S I N T E R P R E T I N
G & R E P O R T I N G Y O U R F I N D I N G S
MULTIPLE REGRESSION:
4 OCTOBER 25, 2021 DATA SCIENCE
• Using Cars dataset, create linear model in R
CREATING A LINEAR MODEL
5 OCTOBER 25, 2021 DATA SCIENCE
PLOT CARS DATASET
6 OCTOBER 25, 2021 DATA SCIENCE
• The lm() command is used to fit linear models
• Use lm() function to create a Single Linear Regression model on
the cars dataset
• Reminder ?lm to get to the help documentation on the function
• Run the model to see the information and extract that as part of
results interpretation
L M () FUNCTION
7 OCTOBER 25, 2021 DATA SCIENCE
• Output shows coefficients and intercept, can manually check that
calculations compare to the linear regression formula
INTERPRET RESULTS
8 OCTOBER 25, 2021 DATA SCIENCE
ADD FITTED LINE TO SCATTER PLOT
9 OCTOBER 25, 2021 DEPARTMENT NAME HERE
ULEARN DATASET | L M WITH MULTIPLE FEATURES
10 OCTOBER 25, 2021 DATA SCIENCE
CREATE MODEL ON HEIGHT AND AGE
11 OCTOBER 25, 2021 DATA SCIENCE
REPORTING STATISTICS FROM YOUR MODEL
12 OCTOBER 25, 2021 DATA SCIENCE
1. Variable Coefficients
2. P values
3. Model Statistics
CREATE MODEL WITH MULTIPLE RESULTS
13 OCTOBER 25, 2021 DATA SCIENCE
SUPPORTING YOUR HYPOTHESIS
14 OCTOBER 25, 2021 DATA SCIENCE
R E P O RT I N G R E G R E S S I O N R E S U LT S
15 OCTOBER 25, 2021 DATA SCIENCE
REPORTING YOUR RESULTS
16 OCTOBER 25, 2021 DATA SCIENCE
FORMATTING AND REFERENCING A TABLE
17 OCTOBER 25, 2021 DATA SCIENCE
• What factors lead to increases in percentage of alumni who
donate?
• Research shows students who are more satisfied with their contact
with teachers are more likely to graduate, one might suspect
smaller classes + low student-faculty ratio -> satisfied graduates
• Look at the giving model data for 48 Universities to determine
which factors are those that lead to higher giving outcomes
PROBLEM STATEMENT
18 OCTOBER 25, 2021 DATA SCIENCE
1. Use the methods of the descriptive statistics summarize the data.
2. Write a directional hypothesis to explain the relationship between graduation rate and alumni giving.
3. Test your hypothesis using a simple linear regression model that can be used to predict the alumni giving
rate, given the graduation rate. Report and discuss your findings using appropriate model statistics.
4. Write directional hypotheses to explain the relationship between the other variables in the database and
alumni giving.
5. Develop a multiple linear regression model that could be used to predict the alumni giving rate using
Graduate Rate, % of Classes Under 20, and Student/Faculty Ratio as independent variables. Report and
discuss your findings using appropriate model statistics.
6. What conclusions and recommendations can you derive from your analysis?
7. Which universities are achieving a substantially higher alumni giving rate than would be expected, given
their Graduate Rate, % of Classes Under 20, and Student/Faculty Ratio?
8. Discuss any limitations of your current model and make recommendations for future research (e.g. ways
to improve the model, inclusion of other independent variables to improve the explanatory strength of
your model.)
MANAGERIAL REPORT
19 OCTOBER 25, 2021 DEPARTMENT NAME HERE
20 OCTOBER 25, 2021 DATA SCIENCE
• Variables/Features
• University
• State
• Graduation Rate
• % of Class Under 20
• Student-Faculty Ratio
• Alumni Giving Rate
MINI PROJECT
21 OCTOBER 25, 2021 DATA SCIENCE
ALUMNI GIVING DATA SUMMARY
22 OCTOBER 25, 2021 DATA SCIENCE
• We believe that students at colleges with smaller classes are more
likely to give back to the college and therefore the college will
have a higher giving rate
EXAMPLE HYPOTHESIS
23 OCTOBER 25, 2021 DATA SCIENCE
CREATE MODEL AND VIEW INITIAL RESULTS
24 OCTOBER 25, 2021 DATA SCIENCE
25 OCTOBER 25, 2021 DATA SCIENCE
26 OCTOBER 25, 2021 DATA SCIENCE
Don't use plagiarized sources. Get Your Custom Essay on
Factors lead to increases in percentage of alumni who donate
Just from $13/Page