why you should care about synthetic data

29
DATA SYNTHETIC Presented by Real Impact Analytics WHY YOU SHOULD CARE ABOUT

Upload: real-impact-analytics

Post on 24-Jan-2017

202 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Why you should care about synthetic data

DATASYNTHETIC

Presented by Real Impact Analytics

WHY YOU SHOULDCARE ABOUT

Page 2: Why you should care about synthetic data

QUESTIONS?#SYNTHETICRIA

Page 3: Why you should care about synthetic data

OVERVIEWSYNTHETIC DATA

What is synthetic data?Why use it?How to create it?Who creates it?Conclusion

Page 4: Why you should care about synthetic data

WHAT IS SYNTHETIC DATA

Page 5: Why you should care about synthetic data

SYNTHETIC DATA?WHAT IS

Generic and artificial dataused to mimic real-worlddata sets.

Page 6: Why you should care about synthetic data

Generic and artificial dataused to mimic real-worlddata sets.

Protect people’s privacysubstitutes real data that contains personal information

SYNTHETIC DATA?WHAT IS

Page 7: Why you should care about synthetic data

Generic and artificial dataused to mimic real-worlddata sets.

Test robustness and accuracyduring software development

SYNTHETIC DATA?WHAT IS

Page 8: Why you should care about synthetic data

Generic and artificial dataused to mimic real-worlddata sets.

Create artificial basewith similar features of real data sets

SYNTHETIC DATA?WHAT IS

Page 9: Why you should care about synthetic data

WHYUSE IT?

Page 10: Why you should care about synthetic data

Use of actual data sets is nolonger allowed, to protecteveryone’s right to privacy.

Page 11: Why you should care about synthetic data

To develop big data tools, weneed realistic data sets fortesting algorithms and easy datavisualization.

Page 12: Why you should care about synthetic data

Synthetic data - similar to realdata sets & shareable to public -acts as a substitute withoutinvading anyone’s privacy.

Page 13: Why you should care about synthetic data

HOWTO CREATE IT?

Page 14: Why you should care about synthetic data

TO CREATE IT?HOW

DRAWINGNUMBERS

AGENT-BASEDMODELLING

OR1 2

Page 15: Why you should care about synthetic data

TO CREATE IT?HOW

DRAWING NUMBERS

Observe real-world statisticdistributions from original data to reproduce artificial bases by drawing simple numbers.

1

Page 16: Why you should care about synthetic data

EXAMPLETELECOM DATA

DRAWING NUMBERS

Page 17: Why you should care about synthetic data

DRAWING NUMBERS

Observe the real temportaldistributions of texts and phone calls from CDR data (call detail records).

Page 18: Why you should care about synthetic data

Create an artificial base of customers.

DRAWING NUMBERS

Page 19: Why you should care about synthetic data

Simulate texts and phone calls with time stamps following the distributions. The goal is to simulate CDRs so they follow the same distribution as real CDRs.

DRAWING NUMBERS

Page 20: Why you should care about synthetic data

TO CREATE IT?HOW

Create physical models to explain observed behaviour to generate generic, random data using this model.

AGENT-BASEDMODELLING2

Page 21: Why you should care about synthetic data

EXAMPLETELECOM DATA

AGENT-BASED MODELLING

Page 22: Why you should care about synthetic data

Analyze real data from texts and phone calls, identifying temporal and behavioural patterns.

AGENT-BASEDMODELLING

Page 23: Why you should care about synthetic data

Create a physical model based on those observations and evolutions over time.

AGENT-BASEDMODELLING

Page 24: Why you should care about synthetic data

This model simulates texts and phone calls over time as they would occur in real life.

AGENT-BASEDMODELLING

Page 25: Why you should care about synthetic data

WHOCREATES IT?

Page 26: Why you should care about synthetic data

CREATES IT?WHO

IN-HOUSE DEVELOPMENT

AD-HOC DEVELOPMENT

OR

DEPENDING ON THE COMPLEXITY OF THE DATA SET

Page 27: Why you should care about synthetic data

CONCLUSION

SYNTHETIC DATA

Page 28: Why you should care about synthetic data

SYNTHETIC DATACONCLUSION

Your ability to generate realistic syntheticdata is essential to developing algorithms and software that will maximize the valueof your big data tools, without transgressing privacy laws.

Page 29: Why you should care about synthetic data

[email protected]

@RIAnalytics

realimpactanalytics.com

@RealImpactAnalytics

Real Impact Analytics

Real Impact Analytics (RIA) taps into rich telecomdata to capture its value. The data is turned intoaction with big data apps that ease our clients’day-to-day work.

RIA provides guided and predictive analyticsthrough proprietary software. Five of the top tenglobal telecom operators trust us to enhancecustomer experience through Customer ValueManagement, and optimize daily operations withour Commercial Excellence apps.

To learn how Real Impact Analytics can create thesame value for you, visit realimpactanalytics.com.

About Us