Midterm Quiz 2 - Verified MM Learners
Midterm Quiz 2 due Apr 12, 2021 1500 HKT Completed
90 Minute Time Limit
Instructions
Work alone. Do not collaborate with or copy from anyone else.
Work the problems in any order
...
Midterm Quiz 2 - Verified MM Learners
Midterm Quiz 2 due Apr 12, 2021 1500 HKT Completed
90 Minute Time Limit
Instructions
Work alone. Do not collaborate with or copy from anyone else.
Work the problems in any order you wish, but submit each answer before
ending the exam.
You may use any of the following resources:
One sheet (both sides) of handwritten (not photocopied or scanned) notes
If any question seems ambiguous, use the most reasonable interpretation (i.e.
don't be like Calvin):
Good Luck!
4/29/2021 Midterm Quiz 2 - Verified MM Learners | Midterm Quiz 2 | ISYE6501x Courseware | edX
https://courses.edx.org/courses/course-v1:GTx+ISYE6501x+1T2021/courseware/f712bb2a96ff46b0bc8d775293bfc91d/1b57ff6ea64c40cf8f4eb69d2b… 2/28
This is the beginning of Midterm Quiz 2. Please make sure that you submit all
your answers before the time runs out. Once you submit an answer to a
question, you cannot change it. There is no overall Submit button.
After submitting all answers, please click the "End my Exam" button, above,
before exiting from ProctorTrack to complete your exam.
Information for Question 1
There are five questions labeled "Question 1." Answer all five questions. For
each of the following five questions, select the probability distribution that
could best be used to model the described scenario. Each distribution might be
used, zero, one, or more than one time in the five questions.
These scenarios are meant to be simple and straightforward; if you're an expert
in the field the question asks about, please do not rely on your expertise to fill in
all the extra complexity (you'll end up making the questions below more difficult
than I intended).
Question 1
1.4/1.4 points (graded)
Number of eggs inspected until the first cracked one is found
Geometric
You have used 1 of 1 attempt
Question 1
1.4/1.4 points (graded)
Number of phone calls made by a telemarketer until one is answered
Geometric
Submit
4/29/2021 Midterm Quiz 2 - Verified MM Learners | Midterm Quiz 2 | ISYE6501x Courseware | edX
https://courses.edx.org/courses/course-v1:GTx+ISYE6501x+1T2021/courseware/f712bb2a96ff46b0bc8d775293bfc91d/1b57ff6ea64c40cf8f4eb69d2b… 3/28
You have used 1 of 1 attempt
Question 1
0.0/1.4 points (graded)
Number of arrivals to the ID-check queue at an airport each minute
Exponential
You have used 1 of 1 attempt
Question 1
0.0/1.4 points (graded)
Number of faces correctly identified by deep learning DL software until an
error is made
Weibull
You have used 1 of 1 attempt
Question 1
1.4/1.4 points (graded)
Time between people clicking an online banner ad
Exponential
You have used 1 of 1 attempt
Submit
Submit
Submit
Submit
4/29/2021 Midterm Quiz 2 - Verified MM Learners | Midterm Quiz 2 | ISYE6501x Courseware | edX
https://courses.edx.org/courses/course-v1:GTx+ISYE6501x+1T2021/courseware/f712bb2a96ff46b0bc8d775293bfc91d/1b57ff6ea64c40cf8f4eb69d2b… 4/28
Questions 2a, 2b
5.0/10.0 points (graded)
Five classification models were built for predicting whether a neighborhood will
soon see a large rise in home prices, based on public elementary school ratings
and other factors. The training data set was missing the school rating variable
for every new school 3% of the data points).
Because ratings are unavailable for newly-opened schools, it is believed that
locations that have recently experienced high population growth are more likely
to have missing school rating data.
Model 1 used imputation, filling in the missing data with the average school
rating from the rest of the data.
Model 2 used imputation, building a regression model to fill in the missing
school rating data based on other variables.
Model 3 used imputation, first building a classification model to estimate
(based on other variables) whether a new school is likely to have been built
as a result of recent population growth (or whether it has been built for
another purpose, e.g. to replace a very old school), and then using that
classification to select one of two regression models to fill in an estimate of
the school rating; there are two different regression models (based on other
variables), one for neighborhoods with new schools built due to population
growth, and one for neighborhoods with new schools built for other reasons.
Model 4 used a binary variable to identify locations with missing information.
Model 5 used a categorical variable: first, a classification model was used to
estimate whether a new school is likely to have been built as a result of
recent population growth; and then each neighborhood was categorized as
"data available", "missing, population growth", or "missing, other reason".
4/29/2021 Midterm Quiz 2 - Verified MM Learners | Midterm Quiz 2 | ISYE6501x Courseware | edX
https://courses.edx.org/courses/course-v1:GTx+ISYE6501x+1T2021/courseware/f712bb2a96ff46b0bc8d775293bfc91d/1b57ff6ea64c40cf8f4eb69d2b… 5/28
a. If school ratings cannot be reasonably well-predicted from the other factors,
and new schools built due to recent population growth cannot be reasonably
well-classified using the other factors, which model would you recommend?
b. In which of the following situations would you recommend using Model 3? All
predictions and classifications below are using the other factors.
[Show More]