analytic excellence - saying goodbye to old constraints
DESCRIPTION
The Briefing Room with Dr. Robin Bloor and Actian Live Webcast August 6, 2013 http://www.insideanalysis.com With all the innovations in compute power these days, one of the hardest hurdles to overcome is the tendency to think in old ways. By and large, the processing constraints of yesterday no longer apply. The new constraints revolve around the strategic management of data, and the effective use of business analytics. How can your organization take the helm in this new era of analysis? Register for this episode of The Briefing Room to find out! Veteran Analyst Wayne Eckerson of The BI Leadership Forum, will explain how a handful of key innovations has significantly changed the game for data processing and analytics. He'll be briefed by John Santaferraro of Actian, who will tout his company's unique position in "scale-up and scale-out" for analyzing data.TRANSCRIPT
The Briefing Room
Analytic Excellence: Saying Goodbye to Old Constraints
Twitter Tag: #briefr
The Briefing Room
! Reveal the essential characteristics of enterprise software, good and bad
! Provide a forum for detailed analysis of today’s innovative technologies
! Give vendors a chance to explain their product to savvy analysts
! Allow audience members to pose serious questions... and get answers!
Mission
Twitter Tag: #briefr
The Briefing Room
Topics
This Month: ANALYTIC PLATFORMS
September: ANALYTICS
October: DATA PROCESSING
Twitter Tag: #briefr
The Briefing Room
Analytic Platforms
Analytic Excellence: Saying Goodbye to Old Constraints
Twitter Tag: #briefr
The Briefing Room
Analyst: Wayne Eckerson
Wayne Eckerson has been a thought leader in the data warehousing, business intelligence and performance management fields since 1995. He has conducted numerous in-depth research studies and is the author of the best-selling book “Performance Dashboards: Measuring, Monitoring, and Managing Your Business.” He is a noted keynote speaker and blogger, and he consults and conducts workshops on business analytics, performance dashboards, and business intelligence, among other topics. For many years, Wayne served as director of education and research at The Data Warehousing Institute (TDWI).Wayne is also a principal consultant at BI Leader Consulting and a founder of BI Leadership Forum. Wayne is also director of research at TechTarget. He can be reached at [email protected].
Twitter Tag: #briefr
The Briefing Room
! Actian is a database and software development company that has recently made some tactical acquisitions
! Actian offers the ParAccel Big Data Analytics Platform, which includes an analytic database, an extensibility framework, on-demand integration and integrated analytics
! The database engine behind the Platform is fully columnar and leverages a shared-nothing architecture
Actian
Twitter Tag: #briefr
The Briefing Room
Guest: John Santaferraro
John Santaferraro is the Vice President of Product Marketing at Actian. Prior to joining Actian, Santaferraro was an independent industry analyst in the business intelligence and analytics market. Before that he developed and executed a vertical market strategy for Hewlett Packard's BI group, focusing on energy, communications, retail, healthcare and financial services; he was also instrumental in helping establish HP’s new BI business group with a combination of solutions, products and consulting. In 2000, John founded a marketing and sales consulting company, Ferraro Consulting, providing business acceleration strategy for technology companies.
Unconstrained Analytics ParAccel Big Data Analytics Platform
John Santaferraro
Vice President of Marketing, ParAccel Platform Group
August 5, 2013
Agenda
§ The Age of Data § Unconstrained Analytics § ParAccel Big Data Analytic Platform
10 Confidential © 2013 Actian Corporation
11
The Age of Data - Where Data and Algorithms Collide
Confidential © 2013 Actian Corporation
Free Your Analysts
12 Confidential © 2012 Actian Corporation
Unconstrained Analytics Load and Go
Run Ad Hoc Queries
Query Any Time
Query Any Data
Query All Data
Run Any Analytics
Execute Sophisticated Analytics
Return Results Quickly
Iterate Quickly Through Discovery
Share Workloads With Any Platform
Support All Analysts
Run Many Applications
Create Analytic Services
Support Entire Ecosystems of Analytics
13 Confidential. Do not Distribute.
Customer Churn
Support cost
analytics
Campaign ROI
Profitability
Customer value
Cross-sell
Customer retention
Customer experience
Profiling
Market trends
Up-Sell
Average order size
Segmentation
Channel management
Churn models
Decision management
Social media
analytics
Partner management Customer
satisfaction Communication management
Unconstrained Customer Analytics
Support the Entire Analytic Process
14 Confidential © 2013 Actian Corporation
Connect Prepare Analyze
Optimize
DATA VALUE
ParAccel Big Data Analytics Platform
15 Confidential © 2013 Actian Corporation
› On-demand integration
› In-advance integration
› In-flight preparation
› In-Hadoop preparation
› In-database preparation
› Database optimizations
› Hardware optimizations
› Dataflow optimizations
› In-database analytics
› HDFS-based analytics
Business Intelligence
Analytics
Enterprise
Social
New Data
Applications DW
www Mobile Machine
Data
High-Performance BI
High-Performance Analytics
Connect Prepare Analyze
Optimize
DATA VALUE
The highest performing analytic platform with the most comprehensive analytic capabilities for the full big data landscape
from point applications to enterprise-scale deployments.
Highest Performing Analytic Platform
In-Memory Option: Lock all data and processes to run in-memory
I/O Optimization Intelligent Prefetch, Intelligent Caching of Data In-Memory
Communication Optimization Packet delivery optimized for analytics, low overhead, plus Virtual Hotwire
Parallel Processing Each node processes, pipelines, and leverages both columnar & compression
Workload Management Establish query classes for long, short, and interactive queries
Compiled Queries Queries compiled to run within the database on each individual node
In-Database Analytics Store and run SQL, aggregate, and analytic functions in the database application
Execution Optimization Final optimization based on resources available
Planning Optimization Choose the best from up to competing plans based on costing model
SQL Optimization Extreme SQL Support with breakdown into 2000 segments, MPP and data-aware
Parallel Loading 1TB per node, per hour, up to 100 nodes, without complex data preparation
16 Confidential. Do not Distribute.
Most Cooperative Analytic Processing
Business Intelligence and Repor3ng Tools
Advanced Analy3cs
Analy3c Applica3ons
Machine Data
Opera3onal Data
3rd Party Info
Provider
Streaming Data Logs
ParAccel PlaBorm
On Demand Integra3on
Enterprise Data Warehouse
Hadoop
Big Data Apps
Embedded Analy3cs
Big Data Bottlenecks
18 Confidential © 2012 Actian Corporation
Prepare profile match cleanse
aggregate audit
Analyze sample model
discover visualize predict
Consume report chart
dashboard alert act
Connect monitor
log ingest
event capture decrypt
ParAccel Dataflow Parallelizes Everything
19 Confidential © 2012 Actian Corporation
Prepare profile match cleanse
aggregate audit
Analyze sample model
discover visualize predict
Consume report chart
dashboard alert act
Connect monitor
log ingest
event capture decrypt
ParAccel Dataflow Parallelizes Everything
20 Confidential © 2012 Actian Corporation
Prepare profile match cleanse
aggregate audit
Analyze sample model
discover visualize predict
Consume report chart
dashboard alert act
Connect monitor
log ingest
event capture decrypt
File Support Windows
Linux HDFS
Amazon S3
ParAccel Dataflow Engine and Hadoop Analytics
- a framework that allows you to develop ETL and analytics
visually on Hadoop (or other file systems) but requires no knowledge of MapReduce
or parallel programming to use
21 Confidential © 2013 Actian Corporation
Operator Library
22
Using ParAccel Dataflow Engine
23
What Does ParAccel Platform Mean to You?
24 Confidential © 2013 Actian Corporation
Differentiation • Highest Performing
Analytic Platform
• Most Comprehensive Analytic Capabilities
• Most Cooperative Analytic Processing
• Most Flexible
Deployment Options
Unique Value u Best Price
Performance
u Quickest Time to Analytic Value
u Quickest Time to Analytic Change
u Best Leverage of Existing Investments
u
Envisioning Unconstrained Analytics
§ What are the immediate, pending, and “no constraints” opportunities for analytics?
25 Confidential. Do not Distribute.
Immediate Needs
Weekly Market Basket Analysis
Pending Needs
Daily Market Basket Analysis
No Constraints
On-Demand Market Basket Analysis Demand signaling
Envisioning Unconstrained Analytics
§ What are the immediate, pending, and “no constraints” opportunities for data expansion?
26 Confidential. Do not Distribute.
Immediate Needs
Point of Sale + Loyalty + Credit + Pyschographic 2 Years Data
Pending Needs
Partner Data 6 Years Data Archived, Accessible
No Constraints
Social Media Data
Envisioning Unconstrained Analytics
§ What are the immediate, pending, and “no constraints” opportunities for analyst communities?
27 Confidential. Do not Distribute.
Immediate Needs
Business Analysts Pending Needs
Store Managers No Constraints
Suppliers
Questions and Answers
§ What are the immediate, pending, and “no constraints” opportunities for analytics?
28 Confidential. Do not Distribute.
Immediate Needs Pending Needs
No Constraints
Twitter Tag: #briefr
The Briefing Room
Perceptions & Questions
Analyst: Wayne Eckerson
1. General purpose RDBMS 2. Analy=cal pla?orms 3. Hadoop
Big data plaBorms – Three genera3ons
30 www.bileader.com
2nd genera3on: Analy3cal plaBorms
1010data Aster Data (Teradata) Calpont Datallegro (MicrosoJ) Exasol Greenplum (EMC) IBM SmartAnaly=cs Infobright Kogni=o Netezza (IBM) Oracle Exadata Paraccel Pervasive Sand Technology SAP HANA Sybase IQ (SAP) Teradata Ver=ca (HP)
• Purpose-‐built RDBMS • Quicker to deploy • Faster and more scalable • Lower cost per TB • Built-‐in analy3cs Deployment Op3ons -‐ SoJware only (Paraccel, Ver=ca) -‐ Appliance (HANA, Exadata, Netezza) -‐ Cloud (1010data, Kogni=o, HANA)
www.bileader.com
Mapping big data plaBorms
32
Structured à Semi-‐Structured à Unstructured
Hadoop
Analy=c Pla?orm (RDBMS)
General Purpose RDBMS
Low Latency Summarized Data à
High cost per TB
High Latency ß Detailed Data
Low cost per TB
Adapted with permission from Hortonworks www.bileader.com
Analy3cal Ecosystem
Machine Data
Web Data Hadoop Cluster
Power User
BI Server
Casual User
Operational System
Operational System
Upload & query
Query Free-‐standing Analy3cal sandbox
Logical or
Physical Data Mart
Data Warehouse
Virtual Sandboxes Top-‐down BI Bo+om-‐up BI
External Data
Alerts
Audio/video Data
Streaming/ CEP Engine
Visual discovery
tools
Event-‐driven messaging
Classic BI KEY: New Stuff
ODS
ETL
Interac=ve dashboards
• What are ParAccel’s key differen=ators in a crowded market?
• What types of workloads does ParAccel handle best?
• How does ParAccel integrate with Hadoop? • Who are ParAccel’s key partners? • How does ParAccel fit into the Ac=an database por?olio?
Ques3ons
34 www.bileadership.com
Twitter Tag: #briefr
The Briefing Room
Twitter Tag: #briefr
The Briefing Room
August: ANALYTIC PLATFORMS
September: ANALYTICS
October: DATA PROCESSING
Upcoming Topics
www.insideanalysis.com
Twitter Tag: #briefr
The Briefing Room
Thank You for Your
Attention PHOTO CREDIT: Photo on Slide 5 borrowed from the internet: http://www.dcmemorials.com/