data science presentation
TRANSCRIPT
![Page 1: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/1.jpg)
VacAdvisor
Fred N. KiwanukaFellow Insight Data Science
August 6, 2015
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 2: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/2.jpg)
What are my options for vacation with a specified budget ?
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 3: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/3.jpg)
The Data
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 4: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/4.jpg)
The Data
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 5: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/5.jpg)
The Data
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 6: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/6.jpg)
The Data
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 7: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/7.jpg)
Conceptual Framework
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 8: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/8.jpg)
Algorithm: Clustering
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 9: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/9.jpg)
Algorithm: Clustering
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 10: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/10.jpg)
Algorithm: Clustering
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 11: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/11.jpg)
Cluster Validation
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 12: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/12.jpg)
Cluster Validation
Table: (Cluster Validation)
Number of Clusters WSS(103) City Cities Closest to Centroid
2 10.32 Seattle Detroit, Charlotte, South Bend
3 9.93 Seattle Boston, Phoenix, Detroit
4 9.91 Seattle Charlotte, South Bend, Minneapolis
5 8.40 Seattle Detroit, Charlotte, South Bend
7 9.62 Seattle Detroit, Charlotte, South Bend
10 9.63 Seattle Sacrameto, San Jose, Colombus
12 9.52 Seattle Sacrameto, San Jose, Colombus
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 13: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/13.jpg)
Silhouette Score
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 14: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/14.jpg)
Silhouette Score
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 15: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/15.jpg)
Silhouette Score
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 16: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/16.jpg)
Silhouette Score
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 17: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/17.jpg)
Silhouette Score
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 18: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/18.jpg)
Cluster Initialization and Validation
Table: (Cluster Initialization and Validation)
Alg Time(s) homo compl v-meas ARI AMI
k-means 0.03 0.971 0.971 0.971 0.988 0.970
VQ 0.04 1.000 1.000 1.000 1.000 1.000
PCA +kmeans 0.00 1.000 1.000 1.000 1.000 1.000
Mean Shift 0.24 1.000 0.970 0.972 0.980 0.972
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 19: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/19.jpg)
Fred N. Kiwanuka
PhD(Groningen), MSC(London), MIT(Fellow)
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 20: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/20.jpg)
Mobile Malaria Diagnosis
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 21: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/21.jpg)
Classification Challenge
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 22: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/22.jpg)
Feature Engineering
Number of Images: 50,000 and 60 feature vector for each image
Perimeter
Moment of Inertia [4 features]
Elongation
Jaggedness
Circularity
Moment features [9 features]
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor
![Page 23: Data Science Presentation](https://reader030.vdocuments.net/reader030/viewer/2022032513/55d15272bb61eb756d8b458f/html5/thumbnails/23.jpg)
Results
Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor