©talend2014€¦ · data payment mgmt svc credit check fraud detec- tion access service interest...
TRANSCRIPT
© Talend 2014
© Talend 2014
Accelerating you journey to a Data Driven Business Deliver all data where, when, and how the business needs it.
Gavin Targonski Director, Product Management Data Integra@on & Big Data Talend [email protected]
© Talend 2014
Agenda
➜ Businesses Need Data ➜ What is Big Data? ➜ Big Data Challenges ➜ Becoming a Data-‐Driven Organiza;on ➜ How Can Talend Help? ➜ Conclusions
© Talend 2014
Big Data Opportunities
US Healthcare
• $300 billion value per year • ~0.7 percent annual
produc@vity growth
European public sector
administra@on
• €250 billion value per year • ~0.5 percent annual
produc@vity growth
Personal loca@on data
• $100 billion+ revenue for service providers
• Up to $700 billion value to end users
US retail
• 60+% increase in net margin possible
• 0.5 – 1.0 percent annual produc@vity growth
Manufacturing
• Up to 50 percent decrease in product development
• Up to 7 percent reduc@on in working capital
© Talend 2014
What is Big Data?
Big Data is the frontier of a firm’s ability
to store, process, and access all of the data it needs to operate, make decisions,
reduce risks, and serve customers.
Mike Gualtieri, Forrester Research
© Talend 2014
The Three Something or Others
➜ Volume ➜ Velocity ➜ Variety
➜ Speed -‐ real @me data
➜ Scale -‐ unprecedented processing power ➜ Sources -‐ new kinds of data from sensors & social media
➜ Three 4mes the data at a third of the cost
Something or Other -‐ “something whose exact nature you do not know or have forgoYen”
© Talend 2014
Big Data Challenges
© Talend 2014
59%
Do not have an effec4ve informa4on strategy in place *
Challenges Facing IT Departments
Cannot consistently deliver the right informa@on at the right 4me **
98%
Consider data quality and the management of unstructured data to be a challenge to exis@ng prac@ces and tools *
48%
Have insufficient in-‐house exper4se to embrace new big data technologies *
52%
* Source: Talend Big Data Survey, Oct 2012 [231 respondents] ** Source: HP / Coleman Parkes Research Survey, 2011
© Talend 2014
Where Most Companies Are With Big Data
3 -‐ Governance and Security
4 -‐ Skills shortages
2 -‐ Infrastructure and architecture
5 -‐ Costs and jus4fying investment
1 -‐ What data sources do I integrate?
Big Data Challenges
© Talend 2014
Becoming a Data-‐Driven Organiza4on
© Talend 2014
Data-Driven Landscape
Accidental Architecture
Data Quality
Latency & Velocity
Scalability
Master Data Consistency
Talent / Skills
Siloed Data
© Talend 2014
Businesses gain Agility & Reuse
Internet Partner
Credit Data Back-end System
Back-end System
Customer Data
Payment Mgmt Svc
Credit Check
Fraud Detec-
tion
Access Service
Interest Calc
Balance Check
Customer Data
Services
Trade Execution
Services Components
Easily Deploy and Manage Data Services
Component-‐Oriented Architecture
Component Oriented Architecture
Reusable Services
Data Integra4on
Enterprise Service
Bus
Access Any Data, Anywhere
Virtualized Access to Heterogeneous Data Stores
© Talend 2014
But Can I Trust My Data?
➜ Governance ➜ Quality ➜ Seman@cs ➜ Sensor data ➜ Social media ➜ does LOL stand for "lots of love" or "laugh out loud"?
© Talend 2014
How can Talend help?
© Talend 2014
The Talend Platform
© Talend 2014
JAVA
ETL Day-‐to-‐day integra4on
Run everywhere
SQL
ELT DW
appliance
Teradata, Netezza…
MapReduce
Hadoop Highly scalable
Hadoop Grid
CAMEL
CAMEL Message transform-‐
a4on
High Frequency
Code Generator
Talend Key Architectural Differences ➜ No black-‐box engine ➜ Enables light-‐weight
distributed, customizable and parallelizable run @me
➜ Standards-‐based
16
© Talend 2014
APPLICAT
IONS
DATA
SYSTEMS
DATA
SOURC
ES
Tradi@onal Sources (RDBMS, OLTP, OLAP,
Mainframes)
TRADITIONAL REPOS
RDBMS EDW MPP
Talend and the Modern Data Architecture
New Sources (web logs, email, sensors, social media)
HADOOP
Applica@ons
Dashboards Analy@cs Suites
ETL OFFLOAD
© Talend 2014
Tap, Transform, Deliver
© Talend 2014
Data at Rest, Data in Motion, All Data
Tap data • Total data management • Big data, small data, structured, unstructured, new and exis@ng
Transform data
• Offers pre-‐emp@ve issue resolu@on & stronger customer rela@onships
• Cleanse, de-‐duplicate, enrich, validate
Deliver data where it’s needed • Real-‐@me, batch, through an API • Secure, governed and metered across your organiza@on
© Talend 2014
Large Global Retailer Opportunity to
increase revenue by crea@ng a holis@c
shopping experience across store, mobile,
web. Grow e-‐commerce revenue like stores Talend integrates (ETL/ELT) into
Hadoop feeding data into Netezza and
Teradata
Global Telecommunica4ons
equipment manufacturer
Needed a solu@on to move data from Oracle ERP to
Hadoop and merge with engineering and social media data to develop KPI's around
mobile device business.
Global Financial Services
organiza4on Marke@ng ini@a@ve combines social media with
tradi@onal data sources
Exploits Big Data for ETL.
Talend reduced the skills needed for Hadoop and MapReduce
Large Mul4-‐Na4onal specialist retailer
Growing ecommerce business and
improving store performance through beYer decision making
(analy@cs). Talend integrates Hadoop, Apache Tomcat Grid and Teradata EDW
Big Data Success Stories
© Talend 2014
Moving out of the Sand Box
Sandbox Analy@cs Real Time Opera@ons
Value
Break Even Point Planning Stage
Project is Jus@fied
Real Time Opera@ons Begin
Business Is Transformed
Talend Open Studio for Big Data
Talend Pla_orm
for Big Data
© Talend 2014
The Value of Talend for Big Data
Reducing the skills shortage • 800+ connectors to all data sources • 100% standards-‐based • Large community
Big Data Ready
• Na@ve support for Hadoop, MapReduce, and NoSQL • Built-‐in data quality and governance • Easy to use tools
Predictable licensing • a predictable and scalable subscrip@on model • based only on users (not CPUs or connectors)
$
1 -‐ What data sources do I integrate?
2 -‐ Infrastructure and architecture
3 -‐ Governance and Security
4 -‐ Skills shortages
5 -‐ Costs and jus4fying investment
© Talend 2014
5.5GB 6.1GB
Big Data Jumpstart Sandbox
• Talend Jumpstart Sandbox -‐ virtual image installed with: • Apache Hadoop distribu*ons provided Hortonworks or Cloudera • Pre-‐configured Talend PlaGorm for Big Data* • Three analysis scenarios for you to try:
– Clickstream data – Twi@er sen*ment – Apache weblogs
• Demonstra*ons of several NoSQL databases
www.talend.com/products/pla_orm-‐for-‐big-‐data
*Includes Talend Studio (graphical IDE), team working, management, data quality and advanced big data features.
© Talend 2014
Conclusions ➜ It’s a cri@cal make-‐or-‐break year as big data projects move from the sand-‐box to deployment.
➜ Talend can help you gain ac@onable insight: • Na4ve • Open • Predictable • Unified
➜ Achieve Instant Value From All Your Data • Tap, Transform, Deliver
© Talend 2014
Thank You!
e: [email protected] hfp://www.talend.com/download/big-‐data
Download Talend Sandbox for Big Data www.talend.com/download