unlocking data science in the enterprise - with oracle and cloudera
Post on 21-Jan-2018
175 Views
Preview:
TRANSCRIPT
1© Cloudera, Inc. All rights reserved.
Unlocking data science in the enterprise withCloudera Data Science Workbench for Oracle Big Data
Jochen Faltermeier | Partner Manager, Central EMEA
Balazs Gaspar | Sales Engineer, Central EMEA / CEE
2CONFIDENTIAL. INTERNAL. ©Cloudera
We believe data can make what is impossible
today, possible tomorrow
3CONFIDENTIAL. INTERNAL. ©Cloudera 3© Cloudera, Inc. All rights reserved.
Cloudera at-a-glance
Customer successLarge enterprises fueling growth
48% 140%+customer growth net expansion
Last 4 years Global 8000 customers
Expansion driven by data and new
use cases
Open partner networkBest of breed solutions
3000+partners
Vast ecosystem of solution &
service providers
First to marketOpen source innovation
2008founded
1600+Clouderans
Global team doing business in 28 countries
Big data innovators from Google,
Yahoo and Oracle
4© Cloudera, Inc. All rights reserved.
Teaming strengths
• Executive sponsorship
• Install base: nearly 500 customers worldwide
• Complementary data management platform
• Simplify decision, total cost of ownership & time
to market
• Architecture and outcome led capabilities
• Customer support interlock
Innovation strengths
• Full stack of platform capabilities
(EDW/EDL/OCS)
• On-premise, hybrid and cloud deployment
options
• Tools for LOB, analysts, data scientists, IT
• Very high performance and data management
and analytics capabilities (EDH+ORAAH)
• Product development and integration across
BDA/BDCS/Public Cloud offerings
Partnership strengths
5© Cloudera, Inc. All rights reserved.
Cost of compute
Data volume
Time
MachineLearning
NOMachineLearning
1950s 1960s 1970s 1980s 1990s 2000s 2010s
Age of machine learning
6© Cloudera, Inc. All rights reserved.
PATTERN
RECOGNITIO
N
ANOMALY
DETECTIO
N
PREDICTION
SELF-SERVICE
INTELLIGENCE
SECURE
REPORTING
REAL-TIME
ANALYTICS
MACHINE LEARNING ANALYTICS
Enterprise-proven machine learning and analytics
700+CUSTOMERS RUN
ON
750+CUSTOMERS RUN
ON
7© Cloudera, Inc. All rights reserved.
The data-driven enterprise
Explosion of data and devices
(IoT)
30Bconnected
devices
440x more data
Transformation of IT infrastructure
open source
cloud
machine learning
$200Btotal
market1
1 IDC Worldwide Big Data and Business Analytics Market Through 2020
8© Cloudera, Inc. All rights reserved.
Data science / machine learning workflowFaster from data to exploration to action in a single platform
Data engineering Data science (Exploratory) Production (Operational)
Data wrangling
Visualization and analysis
Model training & testing
Productiondata pipelines Batch scoring
Online scoringServing
Data GovernanceGovernance
Processing
AcquisitionReports,
dashboards
9© Cloudera, Inc. All rights reserved.
Good news
Data Engineering Data science (Exploratory) Production (Operational)
Data wrangling
Visualization and analysis
Model training & testing
Productiondata pipelines Batch scoring
Online scoringServing
Data GovernanceGovernance
Processing
AcquisitionReports,
dashboards
Data has never been more plentiful
Open source data science and machine learning libraries are rapidly evolving
Commodity (and on-demand) compute makes scalable production machine learning affordable
10© Cloudera, Inc. All rights reserved.
Bad news
Data engineering Data science (Exploratory) Production (Operational)
Data wrangling
Visualization and analysis
Model training & testing
Productiondata pipelines Batch scoring
Online scoringServing
Data GovernanceGovernance
Processing
AcquisitionReports,
dashboards
Data needs to move across multiple different systems
Teams have different, conflicting requests for languages & libraries
Most data science done at small scale, individually, and is difficult to replicate
Very few models reach production
11© Cloudera, Inc. All rights reserved.
Access Scale Developer experience
Additional challenges
12© Cloudera, Inc. All rights reserved.
Our goal is to enable data science and machine learning at scale
13© Cloudera, Inc. All rights reserved.
Open data science in the enterprise
ITdrive adoption while maintaining compliance
Data Scientistexplore, experiment, iterate
14© Cloudera, Inc. All rights reserved.
Our goal: an open platform for data science at scale
Help more data scientistsuse the power of Hadoop
Use a powerful, familiar environment with direct access to
Hadoop data and compute
Data scientistData engineer
Make it easy and secure to add new users, use cases
Offer secure self-service analytics and a faster path to production on common, affordable infrastructure
Enterprise architectHadoop admin
15© Cloudera, Inc. All rights reserved.
Introducing Cloudera Data Science WorkbenchSelf-service data science for the enterprise
Accelerates data science from development to production with:
• Secure self-service environments for data scientists to work against Cloudera clusters
• Support for Python, R, and Scala, plus project dependency isolation for multiple library versions
• Workflow automation, version control, collaboration and sharing
16© Cloudera, Inc. All rights reserved.
Demo
17© Cloudera, Inc. All rights reserved.
With Cloudera Data Science Workbench…
Data scientists can:• Use R, Python, or Scala from a web
browser, with no desktop footprint
• Install any library or framework within isolated project environments
• Directly access data in secure clusters with Spark and Impala
• Share insights with their team for reproducible, collaborative research
• Automate and monitor data pipelines using built-in job scheduling
IT can:• Give their data science team the freedom
to work how they want, when they want
• Stay compliant with out-of-the-box support for full platform security, especially Kerberos
• Run on-premises or in the cloud, wherever data is managed
19CONFIDENTIAL. INTERNAL. ©Cloudera
Customer Data Center
Customer Managed
19
Big Data Appliance
Customer Data Center
Oracle Managed
Oracle Cloud
Oracle Managed
BDA Cloud Service
On-Premises Cloud @ Customer Public Cloud
Big Data Cloud to
Customer
Portfolio and Product Alignmentpowered by Cloudera
20CONFIDENTIAL. INTERNAL. ©Cloudera
Why Oracle and Cloudera?
Oracle Exadata
EDW
Relational/Transactional/Data Mining
Oracle Data Integrator/
Golden Gate<-------------->
Oracle Big Data Appliance
DATA LAKEHadoop/NoSQLSocial/Web/IoT
Oracle Big Data SQL
VALUE DRIVERS
• TTM, Data drive decisions
• Reduces Cost
• Reduce Risk
• TCO
TECHNICAL VALUE
• Query more data with BD SQL; a tool you
know and have invested in
• Easily integrate more data with existing app’s
• Secure, integrated, scalable data platform
USE CASES
• Customer 360
• Digital Transformation/Instrument your
business
• Secure your business (Cyber/Fraud)
Oracle Analytics Cloud & BDCS(Visualization, Automatic, BI, Analytics, Discovery,
Preparation)
21© Cloudera, Inc. All rights reserved.
Thank you
Watch the webinar series:
go.cloudera.com/cdsw-webinar-emea
top related