Engineering > QUESTIONS & ANSWERS > ISYE 6501: Introduction to Analytics Modeling Homework 2 (All)

ISYE 6501: Introduction to Analytics Modeling Homework 2

Document Content and Description Below

Question 4.1 – Clustering Models Describe a situation or problem from your job, everyday life, current events, etc., for which a clustering model would be appropriate. List some (up to 5) predicto... rs that you might use. Our company is exploring the use of on-demand and shared-space office facilities for our employees. This would help us shift our facility footprint from a few larger office buildings that may not align well to our operating model to numerous smaller facility spaces, presumably better aligned to operations. We are considering the following predictors: 1. Proximity to our clients a. Current clients b. Prospective and/or former clients 2. Proximity to our staff (zip codes) 3. Proximity to major airports and interstates 4. Facility cost ($/sf) Question 4.2 – Iris Clustering The iris data set iris.txt contains 150 data points, each with four predictor variables and one categorical response. The predictors are the width and length of the sepal and petal of flowers and the response is the type of flower. The data is available from the R library datasets and can be accessed with iris once the library is loaded. It is also available at the UCI Machine Learning Repository (https://archive.ics.uci.edu/ml/datasets/Iris ). The response values are only given to see how well a specific method performed and should not be used to build the model. Use the R function kmeans to cluster the points as well as possible. Report the best combination of predictors, your suggested value of k, and how well your best clustering predicts flower type. Examining the tabular data reveals four attributes (sepal length, sepal width, petal length, and petal width) and three species. Plotting the petal lengths vs widths; sepal lengths vs widths; petal lengths vs sepal widths; and sepal lengths vs petal widths collectively suggest three clusters. Sepal length vs sepal width is not as revealing as the other three; and petal length vs petal width shows the best cluster separation based on similar sizes within each species (and vary significantly between each species). The plots are show for comparison on the following page. This study source was downloaded by 100000842525582 from CourseHero.com on 05-13-2022 05:33:43 GMT -05:00 https://www.coursehero.com/file/32154435/ISYE6501-Homework-2docx/ Since the plotted data suggests three clusters, initial k means clustering was conducted using all four attributes and k = 3. Plotting the elbow diagram reveals a bend in the curve with diminishing returns around k=3 or k=4. Using kmeans, an expectation-maximization algorithm, with k=3 revealed a sum of squares of 88.4%, an accuracy of 89.333%, and a cluster center distance of 78.85144. Adjusting k to 2 revealed a lower sum of squares of 77.6%, and accuracy of 98.0%, and a cluster center distance of 152.348. Adjusting k to 4 revealed a higher sum of squares of 91.6%, an accuracy of 84%, and a distance to cluster center of 71.75951. The distance reveals how well Plotting the predicted clusters reveals how well the data has split up among the different species. “Petal Width v Petal Length” cluster assignment reveals the best clustering, as shown below. This study source was downloaded by 100000842525582 from CourseHero.com on [Show More]

Last updated: 1 year ago

Preview 1 out of 5 pages

Reviews( 0 )

Recommended For You

 History> QUESTIONS & ANSWERS > ISYE 6501 Midterm 2. Complete Solutions - Introduction To Analytics Modeling - GTX ISYE 6501 (All)

preview
ISYE 6501 Midterm 2. Complete Solutions - Introduction To Analytics Modeling - GTX ISYE 6501

90 Minute Time Limit Instructions Work alone. Do not collaborate with or copy from anyone else. You may use any of the following resources: One sheet (both sides) of handwritten (not photocopied o...

By bundleHub Solution guider , Uploaded: Jul 13, 2022

$10

 History> QUESTIONS & ANSWERS > Complete Solutions - Introduction To Analytics Modeling - GTX ISYE 6501 (All)

preview
Complete Solutions - Introduction To Analytics Modeling - GTX ISYE 6501

For each of the following five questions, select the probability distribution that could best be used to model the described scenario. Each distribution might be used, zero, one, or more than one ti...

By bundleHub Solution guider , Uploaded: Jul 13, 2022

$10

 Engineering> QUESTIONS & ANSWERS > HomeWork #1 EDX GTx: ISYE6501x - Introduction to Analytics Modeling (All)

preview
HomeWork #1 EDX GTx: ISYE6501x - Introduction to Analytics Modeling

HomeWork #1 EDX GTx: ISYE6501x - Introduction to Analytics Modeling Mónica Rojas May 17, 2020 Table of Contents Results...............................................................................

By Nutmegs , Uploaded: May 20, 2022

$9

 Engineering> QUESTIONS & ANSWERS > ISYE 6501: Introduction to Analytics Modeling Homework 3 (All)

preview
ISYE 6501: Introduction to Analytics Modeling Homework 3

Overview This week’s lesson involves data preparation, including outlier identification, handling outliers, and an introduction to change detection. Data preparation involves inspecting data visual...

By Nutmegs , Uploaded: May 20, 2022

$8.5

 Engineering> QUESTIONS & ANSWERS > KSM GTx: ISYE6501x Introduction to Analytics Modeling Week 13: Homework 12 (All)

preview
KSM GTx: ISYE6501x Introduction to Analytics Modeling Week 13: Homework 12

KSM GTx: ISYE6501x Introduction to Analytics Modeling Week 13: Homework 12 April 8, 2020 Question 18.1 Describe analytics models and data that could be used to make good recommendations to the p...

By Nutmegs , Uploaded: May 19, 2022

$7.5

 Information Technology> QUESTIONS & ANSWERS > Georgia Institute Of Technology ISYE 6501 Homework 7 Complete Solutions - Introduction To Analytics Modeling - GTX ISYE 6501 (All)

preview
Georgia Institute Of Technology ISYE 6501 Homework 7 Complete Solutions - Introduction To Analytics Modeling - GTX ISYE 6501

Georgia Institute Of Technology ISYE 6501 Homework 7 Complete Solutions - Introduction To Analytics Modeling - GTX ISYE 6501

By Tessa , Uploaded: May 13, 2022

$10

 Information Technology> QUESTIONS & ANSWERS > Georgia Institute Of Technology ISYE 650 - 113 week 2 homework solutions Introduction To Analytics Modeling - GTX ISYE 6501 (All)

preview
Georgia Institute Of Technology ISYE 650 - 113 week 2 homework solutions Introduction To Analytics Modeling - GTX ISYE 6501

Question 1 Describe a situation or problem from your job, everyday life, current events, etc., for which a clustering model would be appropriate. List some (up to 5) predictors that you might use....

By Tessa , Uploaded: May 13, 2022

$10

 Information Technology> QUESTIONS & ANSWERS > Georgia Institute Of Technology ISYE 6501 Homework 2 Complete Solutions - Introduction To Analytics Modeling - GTX ISYE 6501 (All)

preview
Georgia Institute Of Technology ISYE 6501 Homework 2 Complete Solutions - Introduction To Analytics Modeling - GTX ISYE 6501

Question 4.1 At my job we use clustering in an image system to understand the minerals that are coming into our processing facility. Comparing the images to known mineral images, the system can quan...

By Tessa , Uploaded: May 13, 2022

$10

 Information Technology> QUESTIONS & ANSWERS > Georgia Institute Of Technology ISYE 6501 Homework 2 (Complete Solution) Introduction To Analytics Modeling - GTX ISYE 6501 (All)

preview
Georgia Institute Of Technology ISYE 6501 Homework 2 (Complete Solution) Introduction To Analytics Modeling - GTX ISYE 6501

Georgia Institute Of Technology ISYE 6501 Homework 2 (Complete Solution) Introduction To Analytics Modeling - GTX ISYE 6501

By Tessa , Uploaded: May 13, 2022

$10

 Information Technology> QUESTIONS & ANSWERS > Georgia Institute Of Technology ISYE 6501 week 2 hw solutions Introduction To Analytics Modeling - GTX ISYE 6501 (All)

preview
Georgia Institute Of Technology ISYE 6501 week 2 hw solutions Introduction To Analytics Modeling - GTX ISYE 6501

Question 1 Using the same data set as Homework 1 Question 2 use the ksvm or kknn function to find a good classifier: (a) using cross-validation for the k-nearest-neighbors model; and (b) splitting...

By Tessa , Uploaded: May 13, 2022

$10

$7.00

Add to cart

Instant download

Can't find what you want? Try our AI powered Search

OR

GET ASSIGNMENT HELP
86
0

Document information


Connected school, study & course



About the document


Uploaded On

May 20, 2022

Number of pages

5

Written in

Seller


seller-icon
Nutmegs

Member since 2 years

570 Documents Sold


Additional information

This document has been written for:

Uploaded

May 20, 2022

Downloads

 0

Views

 86

Document Keyword Tags

THE BEST STUDY GUIDES

Avoid resits and achieve higher grades with the best study guides, textbook notes, and class notes written by your fellow students

custom preview

Avoid examination resits

Your fellow students know the appropriate material to use to deliver high quality content. With this great service and assistance from fellow students, you can become well prepared and avoid having to resits exams.

custom preview

Get the best grades

Your fellow student knows the best materials to research on and use. This guarantee you the best grades in your examination. Your fellow students use high quality materials, textbooks and notes to ensure high quality

custom preview

Earn from your notes

Get paid by selling your notes and study materials to other students. Earn alot of cash and help other students in study by providing them with appropriate and high quality study materials.

WHAT STUDENTS SAY ABOUT US


What is Browsegrades

In Browsegrades, a student can earn by offering help to other student. Students can help other students with materials by upploading their notes and earn money.

We are here to help

We're available through e-mail, Twitter, Facebook, and live chat.
 FAQ
 Questions? Leave a message!

Follow us on
 Twitter

Copyright © Browsegrades · High quality services·