spark summit keynote by shaun connolly
TRANSCRIPT
![Page 1: Spark Summit Keynote by Shaun Connolly](https://reader036.vdocuments.net/reader036/viewer/2022062823/5875ba151a28ab8b618b803b/html5/thumbnails/1.jpg)
Accelerating Enterprise Spark
Shaun ConnollyHortonworks Strategy
@shaunconnolly
![Page 2: Spark Summit Keynote by Shaun Connolly](https://reader036.vdocuments.net/reader036/viewer/2022062823/5875ba151a28ab8b618b803b/html5/thumbnails/2.jpg)
Apache Spark Unlocks Enormous Potential of Data in
the Enterprise
![Page 3: Spark Summit Keynote by Shaun Connolly](https://reader036.vdocuments.net/reader036/viewer/2022062823/5875ba151a28ab8b618b803b/html5/thumbnails/3.jpg)
Page 3 © Hortonworks Inc. 2011 – 2016. All Rights Reserved
Personalized Online Ads
Petabytes of Weblogs Analyzed with Spark at Scale• Data streams from a vast array of
desktop and mobile devices• 13 billion daily events processed
latency as low as 40 milliseconds • No data cleansing necessary prior
to analysis with Apache Spark• 2 clusters consolidated into 1
YARN-based HDP cluster• Launched new product Webtrends
Explore™ -- powered by HDP
Per-Customer Click Path
Web LogAnalysis
SQL Server Offload
“We’re able to…look at this data set and process it and do predictions, behavioral analysis. We can do things that allow us to determine ROI for different actions and behavioral patterns.”
Peter Crossley, Chief Architect
Behavioral Segmentation
Ad Click Predictions
LCV Analysis
![Page 4: Spark Summit Keynote by Shaun Connolly](https://reader036.vdocuments.net/reader036/viewer/2022062823/5875ba151a28ab8b618b803b/html5/thumbnails/4.jpg)
Page 4 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
New Use Cases
Cable Company: Optimize Advertising• Monitor channel changes with Spark Streaming• Correlate changes with Ads/Programming• Allocate Ads real time: Show ads to user who are
watching a show and will stay for > over 20 seconds
Railroad Company: Real-time View of State of Track• Optimize the track and train maintenance • Large volume and granularity of track data• GeoSpatial analytics is critical
![Page 5: Spark Summit Keynote by Shaun Connolly](https://reader036.vdocuments.net/reader036/viewer/2022062823/5875ba151a28ab8b618b803b/html5/thumbnails/5.jpg)
Spark TrendsImplications for the Enterprise
Data API Enterprise Ready /”Hardened”
Data Science is still the Frontier
![Page 6: Spark Summit Keynote by Shaun Connolly](https://reader036.vdocuments.net/reader036/viewer/2022062823/5875ba151a28ab8b618b803b/html5/thumbnails/6.jpg)
Page 6 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
ETL, Streaming, Reporting, Analytics
Must Integrate into Existing Environments
A Critical Tool in the Enterprise Tool Box
The Data API
![Page 7: Spark Summit Keynote by Shaun Connolly](https://reader036.vdocuments.net/reader036/viewer/2022062823/5875ba151a28ab8b618b803b/html5/thumbnails/7.jpg)
Page 7 © Hortonworks Inc. 2011 – 2015. All Rights Reserved
HA, DR, Tooling, Debugging, Operations
Security, Encryption, Governance Models
Scale
Implications of Enterprise-Ready / “Hardened”
![Page 8: Spark Summit Keynote by Shaun Connolly](https://reader036.vdocuments.net/reader036/viewer/2022062823/5875ba151a28ab8b618b803b/html5/thumbnails/8.jpg)
Agile Analytics & Data Science
Need to Democratize
Easy and Better Tooling
Train and Encourage More People to Join Us
![Page 9: Spark Summit Keynote by Shaun Connolly](https://reader036.vdocuments.net/reader036/viewer/2022062823/5875ba151a28ab8b618b803b/html5/thumbnails/9.jpg)
Hortonworks Strategy for Enterprise Spark at Scale
Agile Analytics & Data Science
Accelerate Capabilities for the Enterprise
Innovate at the Core
![Page 10: Spark Summit Keynote by Shaun Connolly](https://reader036.vdocuments.net/reader036/viewer/2022062823/5875ba151a28ab8b618b803b/html5/thumbnails/10.jpg)
Stay tuned…. March 1
![Page 11: Spark Summit Keynote by Shaun Connolly](https://reader036.vdocuments.net/reader036/viewer/2022062823/5875ba151a28ab8b618b803b/html5/thumbnails/11.jpg)
Thank You!Shaun Connolly
@shaunconnolly