Information Technology  >  Solutions Guide  >  Georgia Institute Of Technology ISYE 6501 Homework 10 Complete Solutions - Introduction To Analytics (All)

Georgia Institute Of Technology ISYE 6501 Homework 10 Complete Solutions - Introduction To Analytics Modeling - GTX ISYE 6501

Document Content and Description Below

ISYE6501 HOMEWORK 10 Question 14.1 The breast cancer data set breast-cancer-wisconsin.data.txt from http://archive.ics.uci.edu/ml/ machine-learning-databases/breast-cancer-wisconsin/ (description a ... t http://archive.ics.uci.edu/ml/ datasets/Breast+Cancer+Wisconsin+%28Original%29 ) has missing values. 1. Use the mean/mode imputation method to impute values for the missing data. 2. Use regression to impute values for the missing data. 3. Use regression with perturbation to impute values for the missing data. 4. (Optional) Compare the results and quality of classification models (e.g., SVM, KNN) build using (1) the data sets from questions 1,2,3; (2) the data that remains after data points with missing values are removed; and (3) the data set when a binary variable is introduced to indicate missing values. Question 15.1 Describe a situation or problem from your job, everyday life, current events, etc., for which optimization would be appropriate. What data would you need? I worked at a bank, and our fraud agents needs to review Direct Deposits that we identified as suspiscious. In that case, I built a logistic regression model to identify the Deposits that are more likely to be suspiscious. The challenge is that we can only use model score to prioritize the queue. Also Deposit with higher amounts might have a higher priority. Moreover, Deposits are processed in batch at different time of the day, so depending on the time of the day, the day of the week and the week of the monther, the quantity of deposits might differ; and agents have a limited time to review the suspicious deposits. So since agents are a limited resource, optimization might be appropriate to estimate the number of agent needed at any given time of the day. The data that I would need is: * The list of deposits at any given time/day, the amount and the fraud score. * The team budget, the max number of agents on week days vs week ends, and the average time spent to review each. * The minimum penetration rate(reviewed deposits over total deposit) [Show More]

Last updated: 3 years ago

Preview 1 out of 10 pages

Buy Now

Instant download

We Accept:

Payment methods accepted on Scholarfriends (We Accept)
Preview image of Georgia Institute Of Technology ISYE 6501 Homework 10 Complete Solutions - Introduction To Analytics Modeling - GTX ISYE 6501 document

Buy this document to get the full access instantly

Instant Download Access after purchase

Buy Now

Instant download

We Accept:

Payment methods accepted on Scholarfriends (We Accept)

Reviews( 0 )

$10.00

Buy Now

We Accept:

Payment methods accepted on Scholarfriends (We Accept)

Instant download

Can't find what you want? Try our AI powered Search

210
0

Document information


Connected school, study & course


About the document


Uploaded On

May 16, 2022

Number of pages

10

Written in

All

Seller


Profile illustration for Tessa
Tessa

Member since 3 years

561 Documents Sold

Reviews Received
83
31
21
3
21
Additional information

This document has been written for:

Uploaded

May 16, 2022

Downloads

 0

Views

 210

Document Keyword Tags

Recommended For You

Get more on Solutions Guide »

$10.00
What is Scholarfriends

Scholarfriends.com Online Platform by Browsegrades Inc. 651N South Broad St, Middletown DE. United States.

We are here to help

We're available through e-mail, Twitter, Facebook, and live chat.
 FAQ
 Questions? Leave a message!

Follow us on
 Twitter

Copyright © Scholarfriends · High quality services·