taming the beast: extracting value from hadoop
TRANSCRIPT
![Page 1: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/1.jpg)
John L Myers
Enterprise Management Associates
Managing Research Director
@johnlmyers44
Taming the Beast:
Extracting Value from Hadoop
Ingo Mierswa
RapidMiner
Founder & CTO
![Page 2: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/2.jpg)
Panel Moderator
Lyndsay Wise, Research Director, EMA
Lyndsay has over 10 years experience in software
research, BI consulting, and strategy development,
specializing in software evaluation and best-fit solution
selection. Her focus at EMA is on data integration, data
governance, cloud technologies, data visualization,
analytics, and collaboration.
Slide 2 © 2015 Enterprise Management Associates, Inc.
![Page 3: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/3.jpg)
Featured Speakers
John Myers, Managing Research Director, EMA
John has over 10 years of experience working in areas related to business
analytics in professional services consulting and product development
roles. Additionally, John helps organizations solve their business analytics
problems, whether they relate to operational platforms – such as customer
care or billing – or applied analytical applications – such as revenue
assurance or fraud management.
Ingo Mierswa, Founder & CTO, RapidMiner
Ingo, an industry-veteran data scientist, is the founder and CTO of
RapidMiner, the industry’s #1 open source platform for predictive
analytics. Ingo is passionate about the technological innovation enabled
by the open source community and envisions a world where easy-to-use
predictive analytics software empowers all business analysts and data
scientists. Ingo is the author of numerous award-winning publications
about predictive analytics and big data, and has spoken at countless
industry events.
Slide 3 © 2015 Enterprise Management Associates, Inc.
![Page 4: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/4.jpg)
A PDF of the PowerPoint
presentation will be available
Event Presentation
Logistics for Today’s Webinar
Slide 4 © 2015 Enterprise Management Associates, Inc.
An archived version of the event recording will be
available at www.enterprisemanagement.com
• Log questions in the Q&A panel located on the
lower right corner of your screen
• Questions will be addressed during the Q&A
session of the event
Questions
Event Recording
![Page 5: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/5.jpg)
Join the Conversation…
Submit your questions or comments to the panel
using: @wiseanalytics @johnlmyers44 @rapidminer
#predictiveanalytics
Slide 5 © 2015 Enterprise Management Associates, Inc.
![Page 6: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/6.jpg)
Topic #1:
Issues With Data Lakes
![Page 7: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/7.jpg)
Adoption of Hadoop-based Data Lake Architectures
Slide 7 © 2015 Enterprise Management Associates, Inc.
![Page 8: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/8.jpg)
Topic #2:
Obstacles Implementing
Analytics On Hadoop
![Page 9: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/9.jpg)
Obstacles Implementing Analytics
Slide 9 © 2015 Enterprise Management Associates, Inc.
![Page 10: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/10.jpg)
Topic #3:
Processing Requirements for
Predictive Analytics
![Page 11: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/11.jpg)
Required Processing and Compute Latency
for Big Data Projects
Slide 11 © 2015 Enterprise Management Associates, Inc.
![Page 12: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/12.jpg)
©2015 RapidMiner, Inc. All rights reserved. - 12 -
Architecture of Hadoop
Orchestration node
Worker nodes
![Page 13: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/13.jpg)
©2015 RapidMiner, Inc. All rights reserved. - 13 -
Leverage Hadoop’s Compute Capacity
• Design advanced analytics workflows in your predictive analytics platform
• Ensure your solution automatically translates predictive analytics needs into native Hadoop code, e.g., MapReduce, Hive, Pig, Spark, etc.
• Push predictive analytic instructions into your Hadoop
• Hadoop performs calculations across the entire Hadoop cluster for a holistic view of your data
• Data remains in Hadoop Results are delivered to the business
• Recommendations
– GUI workflow language (code-free)
– Don’t forget about security
ResultsAnalytic instructions
translated to native
Hadoop
Calculations
Results
operationalized in
business processes
Predictive Analytics Platform
![Page 14: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/14.jpg)
Topic #4:
Successful Big Data Analytics
Projects
![Page 15: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/15.jpg)
Project Success
Slide 15 © 2015 Enterprise Management Associates, Inc.
![Page 16: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/16.jpg)
©2015 RapidMiner, Inc. All rights reserved. - 16 -
![Page 17: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/17.jpg)
©2015 RapidMiner, Inc. All rights reserved. - 17 -
OPERATIONALIZEPredictive Decisions
Close the Loop BetweenInsight and Action
Embed predictive models into critical business processes
Recommend best options for human or automated actions
©2015 RapidMiner, Inc. All rights reserved. - 17 -
![Page 18: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/18.jpg)
Topic #5:
Best Practices For
Implementing
Advanced/Modern Analytics
![Page 19: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/19.jpg)
©2015 RapidMiner, Inc. All rights reserved. - 19 -
EFFORTLESS Predictive Analytics
Immediately Empower Analysts to Anticipate
Opportunity & Risk
Easily Combine Any Data at Unlimited Scale with Any Model
Code-Free, Lightning-Fastand Intuitive
©2015 RapidMiner, Inc. All rights reserved. - 19 -
![Page 20: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/20.jpg)
Topic #6:
Use Of Mixed Environments
For Implementation Of Big
Data Analytics
![Page 21: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/21.jpg)
Growing Importance of Cloud Resources
Slide 21 © 2015 Enterprise Management Associates, Inc.
![Page 22: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/22.jpg)
©2015 RapidMiner, Inc. All rights reserved. - 22 -
- 22 -
Design Once, Deploy ANYWHERE
Leverage Investments in Existing and Future Systems
Design predictive analytics independent of platforms
Seamlessly execute predictive analytics in-memory or in any source, including
data-at-rest or data-in-motion
- 22 -©2015 RapidMiner, Inc. All rights reserved.
![Page 23: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/23.jpg)
Topic #7:
Evolving Role of
the Data Consumer
![Page 24: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/24.jpg)
What We Used to Think
of Analytical Users
Slide 24 © 2015 Enterprise Management Associates, Inc.
![Page 25: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/25.jpg)
Empowering the Line of Business
Slide 25 © 2015 Enterprise Management Associates, Inc.
![Page 26: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/26.jpg)
Topic #8:
Use Cases – Monetizing
Insights Buried In Your
Multi-Structured Data
![Page 27: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/27.jpg)
©2015 RapidMiner, Inc. All rights reserved. - 27 -
Challenge Better understand TV viewing habits to prevent churn and optimize advertising
“RapidMiner allows us to leverage Big Data, in real-time.”
-- Avi BernsteinProfessor at the University of Zurich, Department of Informatics
Drive Broadcast Revenue and Customer Retention
<5stime to generate high value activities based
on predictive analytics
Solution Process Big Data from three million TV viewers, in real-time, to make program recommendations and personalized advertising
![Page 28: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/28.jpg)
©2015 RapidMiner, Inc. All rights reserved. - 28 -
Challenge Monitor corporate performance data in real time to identify correlations, outliers, and economic drivers
“We benefit from the availability of community extensions via the RapidMiner Marketplace. We can easily search for what others have designed in RapidMiner, and use the extensions that are a fit for us.”
-- Tom GattenCEO
Track Data from Millions of Companies to Identify Critical Economic Drivers
4.5 Msubject matter experts’
content analyzed in the United Kingdom
every single day
Solution Use RapidMiner to mashup data of UK businesses, rapidly prototype predictive models & identify outlying, unusual, data
![Page 29: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/29.jpg)
Where To Go From Here?
Slide 29 © 2015 Enterprise Management Associates, Inc.
• Data lakes are an emerging data management architecture
• There are issues fully realizing value from data lakes
• Following best practice/pattern helps
![Page 30: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/30.jpg)
Join the Conversation…
Submit your questions or comments to the panel
using: @wiseanalytics @johnlmyers44 @rapidminer
#predictiveanalytics
Slide 30 © 2015 Enterprise Management Associates, Inc.
![Page 31: Taming the Beast: Extracting Value from Hadoop](https://reader031.vdocuments.net/reader031/viewer/2022030401/58cf09c01a28ab5f2b8b55e5/html5/thumbnails/31.jpg)
Q&A – Please Log Questions in the Q&A Panel
Slide 31 © 2015 Enterprise Management Associates, Inc.
• Visit RapidMiner.com to learn more about
Effortless Predictive Analytics
• Learn more about leading IT analyst firm Enterprise
Management Associates (EMA) at
enterprisemanagement.com