big data in media
DESCRIPTION
This survey of big data and advanced technology as it relates to media was given to an industry association in November of 2012. Rocket FuelTRANSCRIPT
![Page 1: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/1.jpg)
“Big Data” for Media
November 20, 2012
This presentation was prepared by SoundView Technology Group and is being provided to the Danish Media Association to promote discussion and provide information. All information contained herein is freely shareable by the association as well as anyone participating in the workshop. Attribution is appreciated but not required.
![Page 2: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/2.jpg)
2
1979-1981 Hardware, Programming
1981-1992
Applied AI, CMU, IBM
1993-2004 Equity Research, Banking,
Brokerage, Investing
2004-Present Advisor, Entrepreneur, Researcher, Publisher,
Programmer
Kris Tuttle
Related Work
![Page 3: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/3.jpg)
3
Eddie Obeng: After Midnight
![Page 4: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/4.jpg)
4
Eddie Obeng: After Midnight
![Page 5: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/5.jpg)
5
Eddie Obeng: After Midnight
![Page 6: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/6.jpg)
6
Discussion Points
• Flyover of recent significant related events and developments
• Unpacking “Big Data” and looking at the pieces
• The intersection of big data and media: • The obvious – advertising, targeting, conversions • Data analysis = content? • Content discovery, generation and ranking • Hyper-personalization & attenuation
• Wrap up and transition to Q&A
© SoundView Technology Group 2012
![Page 7: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/7.jpg)
7
Flyover – Evolution of what’s News
Source: Monday Note
© SoundView Technology Group 2012
![Page 8: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/8.jpg)
8
Flyover – Artist Control
![Page 9: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/9.jpg)
9
Flyover – “Amateur” Content Development
The Kickstarter model is disrupting the way things are created, produced and initially adopted.
![Page 10: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/10.jpg)
10
Flyover – Sourcing & Distribution
![Page 11: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/11.jpg)
11
Flyover – Sourcing & Distribution
![Page 12: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/12.jpg)
12
Flyover – Data Analysis as Content
![Page 13: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/13.jpg)
13
Flyover – Mobile
![Page 14: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/14.jpg)
Big Data Applications
14
How Big Data Looks
© SoundView Technology Group 2012
Smörgåsboard
Size Speed
Sloppy
![Page 15: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/15.jpg)
15
Unpacking Big Data
© SoundView Technology Group 2012
• Often not that big in terms of size often less than 1TB
• There is real big data and it’s sometimes enormous ~20 PB/day
• Tends to have additional special features:
o Multiple sources – transactional, log files, databases, geospatial,
o Semi or unstructured – clickstreams, html, sensor data
o Shorter duration – seconds to days versus weeks to months
o More flexible – sometimes scheme-less, and built for speed
• Today the default technology for most projects is Hadoop
• Some machine learning is becoming a common feature
![Page 16: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/16.jpg)
16
Tools are Immature
© SoundView Technology Group 2012
![Page 17: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/17.jpg)
17
New DB Technologies
© SoundView Technology Group 2012
![Page 18: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/18.jpg)
18
New Language Technology
© SoundView Technology Group 2012
![Page 19: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/19.jpg)
19
Data for Targeting & Conversion
© SoundView Technology Group 2012
![Page 20: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/20.jpg)
20
Data for Targeting & Conversion
• Integrating geospatial data for local advertising
• Intersecting social data with news and search for customization
• Going beyond “A/B testing” and using real-time data analysis to improve content on the fly
• Saved data opens the door to machine learning and better algorithms
© SoundView Technology Group 2012
![Page 21: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/21.jpg)
21
Data Analysis as Content
• Increased building of proprietary data and surveys – no brainer
• In-house data scientists and coding capabilities will help
• There are quite a few good text analysis and processing tools out there – NLTK, Python
• NoSQL and Network DB tecnology can help
© SoundView Technology Group 2012
![Page 22: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/22.jpg)
22
Data-driven Editorial & Contributions
© SoundView Technology Group 2012
![Page 23: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/23.jpg)
23
Using Data to Hyper-Personalize
• The future of reader relationships is about more than content quality – readers will grow to expect highly personalized content
• Managing, distributing and “learning” from collected data requires a
much more sophisticated view of how content is represented, managed and distributed
© SoundView Technology Group 2012
![Page 24: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/24.jpg)
24
Getting Started – Education
© SoundView Technology Group 2012 http://bigdatauniversity.com/
![Page 25: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/25.jpg)
25
Getting Started – Data Sources
© SoundView Technology Group 2012
![Page 26: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/26.jpg)
26
Getting Started – Vendors
© SoundView Technology Group 2012
![Page 27: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/27.jpg)
27
Getting Hygge with Big Data
© SoundView Technology Group 2012
• We are still in the very early days of implementation – nothing is precluded
• Appoint or hire a “data czar” - more than a point person, the one who can translate the technology into implementation opportunities
• For each line of business map revenue growth and profit margins to variables that might be improved with more data and better analysis
• In-house custom coding capability is a strategic advantage – data integration remains a big challenge
• Start building some proprietary data – surveys, histories, aggregations
• Are any ideas worth implementing in the local market? (Zite, HuffingPost, NYT/Nate Silver)
• Experiment, experiment, experiment
![Page 28: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/28.jpg)
28
![Page 29: Big Data in Media](https://reader034.vdocuments.net/reader034/viewer/2022042813/548257d95906b5f1048b465a/html5/thumbnails/29.jpg)
Kris Tuttle
+1-617-934-1877 (US)
+33(0)6.7439.8593 (France)