dataanalytics damien lafferty, james daly, niall turbitt georg steinbuß

18
DATA ANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

Upload: anne-dalton

Post on 18-Jan-2016

219 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

DATAANALYTICS

Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

Page 2: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

DATA ANALYTICS•Examining Raw Data

•Drawing Conclusions

•Lake & Stream

Page 3: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

DATA ANALYTICSDATA LAKE DATA STREAM• Storage• Long-term

historical data• Easy

• Real-time data• Parallel analysis• Difficult

Page 4: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

UAll Lowercase

All Uppercase Cl s et ssiylanAr

Page 5: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

UAll vowels

All consonants Cl s et ssiylanAr

Page 6: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

Undergraduate Degree

Facebook Friends

Archery Clubs

Home Town

Housemates

Work Placement Graduate

Mixer

Page 7: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

Technical CommunicationNiall Turbitt

Georg Christian

Lu Xin

Sean Cawley

Damien Lafferty

Facebook Friends

Page 8: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

Structured Data

Unstructured Data

Page 9: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

• 80% of all data is unstructured data

• Unstructured data estimated at 3,000,000 petabytes

Dublin

Cork

• Relative distance from the Earth to Jupiter

Page 10: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

TEXT

•Forms the majority of unstructured data

•Nearly one million bits of content shared on Facebook every minute

•Over 100,000 tweets per minute

Page 11: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

TEXT MINING EXAMPLE

• People’s mood on coffee, wine, beer and soda from Twitter

• Compare tweets to database of positive and negative words

• Calculate a sentiment score:

Score = # of Positive Words - # of Negative Words

• If Score > 0 - 'positive opinion'

• If Score < 0 - 'negative opinion'

• If Score = 0 - 'neutral opinion'

Page 12: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß
Page 13: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß

WHY?

Page 14: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß
Page 15: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß
Page 16: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß
Page 17: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß
Page 18: DATAANALYTICS Damien Lafferty, James Daly, Niall Turbitt Georg Steinbuß