storage for ai applications

23
Storage for AI Applications Live Webcast October 5, 2021 10:00 am PT / 1:00 pm ET

Upload: others

Post on 21-Jan-2022

14 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Storage for AI Applications

1

Storage for AI Applications

Live WebcastOctober 5, 202110:00 am PT / 1:00 pm ET

Page 2: Storage for AI Applications

2

Today’s Presenters

Brien PorterSolutions Lead

Intel

Tom FriendModerator

Chief EngineerIlluminosi

Craig TierneyPrincipal Product

ArchitectNVIDIA

Young PaikSenior Director

Product PlanningSamsung

Page 3: Storage for AI Applications

3

3

Page 4: Storage for AI Applications

4

Page 5: Storage for AI Applications

5

SNIA Legal Notice§ The material contained in this presentation is copyrighted by the SNIA unless otherwise

noted. § Member companies and individual members may use this material in presentations and

literature under the following conditions:§ Any slide or slides used must be reproduced in their entirety without modification§ The SNIA must be acknowledged as the source of any material used in the body of any document containing material

from these presentations.§ This presentation is a project of the SNIA.§ Neither the author nor the presenter is an attorney and nothing in this presentation is

intended to be, or should be construed as legal advice or an opinion of counsel. If you need legal advice or a legal opinion please contact your attorney.

§ The information presented herein represents the author's personal opinion and current understanding of the relevant issues involved. The author, the presenter, and the SNIA do not assume any responsibility or liability for damages arising out of any reliance on or use of this information.NO WARRANTIES, EXPRESS OR IMPLIED. USE AT YOUR OWN RISK.

Page 6: Storage for AI Applications

6

Agenda

§ Introduction to Data Science§ The Data and AI Lifecycle§Mapping Business Questions to Algorithms§Model training and data usage

§Roundtable Q&A

Page 7: Storage for AI Applications

7

Accelerated Data Science

2.2 exabytes (2.2B GB) of data created daily – McKinsey

$260B annual revenue by 2022 for big data and business analytics – IDC

Page 8: Storage for AI Applications

8

Design an Intelligent AI Data Services Platform

Storage Intelligence

Drive efficiencies for traditional architecture

Storage Optimization

Maximize performance of new data center technologies

AutomationArchitecture & workload

unifying cloud & enterprise

OrchestrationSeamless integration across

architecture, media, vendors

Thin-provisioningDe-duplicationCompression

Encryption

Data InventoryData Tiering

Optimize PerformanceAll Flash/NVM

Storage Memory TechnologyHybrid

Legacy SAS

Lower TCO

Scale Out ArchitectureHyperconverged Infrastructure

Software Defined Storage

Network Function VirtualizationSoftware Defined Infrastructure

Shift of the IO BottleneckBusiness Alignment

Modernize Transform

Page 9: Storage for AI Applications

9

Different Workloads Drive Different SolutionsHot

Warm

Cold

Cost-effective solutions are best suited for cold storage

High-performance solutions are best suited for warm and some hot tiers

AI ML/DL

Page 10: Storage for AI Applications

10

The NEED FOR Data Science

Ad PersonalizationClick Through Rate OptimizationChurn Reduction

CONSUMER INTERNET

Claim FraudCustomer Service Chatbots/RoutingRisk Evaluation

FINANCIAL SERVICESRemaining Useful Life EstimationFailure PredictionDemand Forecasting

MANUFACTURING

Detect Network/Security AnomaliesForecasting Network PerformanceNetwork Resource Optimization (SON)

TELECOM

Supply Chain & Inventory ManagementPrice Management / Markdown OptimizationPromotion Prioritization And Ad Targeting

RETAILPersonalization & Intelligent Customer InteractionsConnected Vehicle Predictive MaintenanceForecasting, Demand, & Capacity Planning

AUTOMOTIVE

Sensor Data Tag MappingAnomaly DetectionRobust Fault Prediction

OIL & GAS

Improve Clinical CareDrive Operational EfficiencySpeed Up Drug Discovery

HEALTHCARE

Page 11: Storage for AI Applications

11

The AI Lifecycle

Page 12: Storage for AI Applications

12

Understanding Enterprise Data & Analytics Workflow

Page 13: Storage for AI Applications

13

INPUTSBUSINESS

QUESTIONS AI / DL TASKEXAMPLE OUTPUTS

HEALTHCARE RETAIL FINANCE

Is “it” presentor not?

Detection Cancer Detection Targeted Ads Cybersecurity

What type of thing is “it”? Classification Image Classification Basket Analysis Credit Scoring

To what extent is “it” present? Segmentation

Tumor Size / Shape Analysis

Build 360° Customer View Credit Risk Analysis

What is the likely outcome?

Prediction Survivability Prediction

Sentiment & Behavior Recognition

Fraud Detection

What will likelysatisfy the objective? Recommendations

Therapy Recommendation

Recommendation Engine Algorithmic Trading

WHAT PROBLEM ARE YOU SOLVING?Defining the AI/DL task

Text Data

Images

Audio

Video

Page 14: Storage for AI Applications

14

AI/ML/DL Application Development

Page 15: Storage for AI Applications

15

UntrainedNeural Network

Model

AI/ML/DL Application Development

Page 16: Storage for AI Applications

16

UntrainedNeural Network

Model

Deep LearningFramework

TRAININGLearning a new capability

from existing data

AI/ML/DL Application Development

Page 17: Storage for AI Applications

17

UntrainedNeural Network

Model

Deep LearningFramework

TRAININGLearning a new capability

from existing data

AI/ML/DL Application Development

The dominant IO pattern here is re-read. The full dataset often is read 40, 50, 100 or even more times. This pattern is the same for DL and ML.

Page 18: Storage for AI Applications

18

UntrainedNeural Network

Model

Deep LearningFramework

TRAININGLearning a new capability

from existing data

Trained ModelNew Capability

DEEP LEARNING Application Development

Page 19: Storage for AI Applications

19

UntrainedNeural Network

Model

Deep LearningFramework

TRAININGLearning a new capability

from existing data

Trained ModelNew Capability

Trained ModelOptimized for Performance

AI/ML/DL Application Development

Page 20: Storage for AI Applications

20

UntrainedNeural Network

Model

Deep LearningFramework

TRAININGLearning a new capability

from existing data

Trained ModelNew Capability

App or ServiceFeaturing Capability

INFERENCEApplying this capability

to new data

Trained ModelOptimized for Performance

AI/ML/DL Application Development

Inference can be onllne, like a query from your phone. Or it can be offline where data processed come from existing storage. Offline inference is common for applications where you need to reanalyze data with an improve model, for example, reprocessing image data to find objects of interest.

This method of inference can generate extreme amounts of IO and in this case, data are not reused.

Page 21: Storage for AI Applications

21

Click to edit Master title style

Roundtable Q&A

Page 22: Storage for AI Applications

22

After this Webcast

§Please rate this webcast and provide us with your feedback§This webcast and a copy of the slides will be available at the SNIA

Educational Library https://www.snia.org/educational-library§A Q&A from this webcast, including answers to questions we couldn’t

get to today, will be posted on our blog at https://sniansfblog.org/§Follow us on Twitter @SNIANSF

Page 23: Storage for AI Applications

23 | ©2021 Storage Networking Association. All Rights Reserved.

Thank You