graphalytics: a big data benchmark for graph-processing platforms › fileadmin › user_upload ›...

77
Graphalytics: A Big Data Benchmark for Graph-Processing Platforms Mihai Capotă , Tim Hegeman, Alexandru Iosup, Arnau Prat-Pérez, Orri Erling, Peter Boncz Delft University of Technology Universitat Politècnica de Catalunya OpenLink Software CWI

Upload: others

Post on 08-Jun-2020

12 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

Graphalytics:A Big Data Benchmark for Graph-Processing Platforms

Mihai Capotă, Tim Hegeman, Alexandru Iosup,Arnau Prat-Pérez, Orri Erling, Peter Boncz

Delft University of TechnologyUniversitat Politècnica de CatalunyaOpenLink SoftwareCWI

Page 2: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

2

The data deluge: large-scale graphs

Tens of billions of edges

Page 3: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

3

Graph processing

The data deluge: large-scale graphs

Tens of billions of edges

Page 4: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

4

Platform diversity

GraphX

Page 5: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

5

Yet another benchmark?

Page 6: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

6

Yet another benchmark?

● Lack of benchmarks for genericgraph processing platforms

Page 7: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

7

Yet another benchmark?

● Lack of benchmarks for genericgraph processing platforms

● Graph500– BFS

– Kroneker graph

Page 8: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

8

Yet another benchmark?

● Lack of benchmarks for genericgraph processing platforms

● Graph500– BFS

– Kroneker graph

● Several academic studies

– Specific to graph or RDF databases– Ad hoc setup, difficult to extend

Page 9: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

9

P-A-D triangle

AAlgorithm

PPlatform

DDataset

Page 10: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

10

P-A-D triangle

AAlgorithm

PPlatform

DDataset

Performance is enabled,portability is disabled

Page 11: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

11

P-A-D triangle

AAlgorithm

PPlatform

DDataset

Performance is enabled,portability is disabled

Algorithms for differentdata types and graphs

Page 12: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

12

P-A-D triangle

AAlgorithm

PPlatform

DDataset

Performance is enabled,portability is disabled

Algorithms for differentdata types and graphs

No systematic findings yet

Page 13: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

13

P-A-D triangle

AAlgorithm

PPlatform

DDataset

Performance is enabled,portability is disabled

Algorithms for differentdata types and graphs

No systematic findings yet

Deployment

Page 14: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

14

Graphalytics

Page 15: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

15

Graphalytics

● The first comprehensive benchmark forbig data graph-processing platforms

Page 16: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

16

Graphalytics

● The first comprehensive benchmark forbig data graph-processing platforms– Advanced benchmark harness

Page 17: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

17

Graphalytics

● The first comprehensive benchmark forbig data graph-processing platforms– Advanced benchmark harness

– Choke-point analysis

Page 18: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

18

Graphalytics

● The first comprehensive benchmark forbig data graph-processing platforms– Advanced benchmark harness

– Choke-point analysis

– Realistic graph generator

Page 19: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

19

Graphalytics

● The first comprehensive benchmark forbig data graph-processing platforms– Advanced benchmark harness

– Choke-point analysis

– Realistic graph generator

Page 20: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

20

Graphalytics

● The first comprehensive benchmark forbig data graph-processing platforms– Advanced benchmark harness

– Choke-point analysis

– Realistic graph generator

● Co-sponsored by Oracle

Page 21: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

21

Choke points

Page 22: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

22

Choke points

● Technological challenges for platforms

Chokepoint

Big datasystem

Timelyresults

Page 23: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

23

Choke points

● Technological challenges for platforms● Identified by experts

Chokepoint

Big datasystem

Timelyresults

Page 24: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

24

Choke points

● Technological challenges for platforms● Identified by experts● Real-world scenarios are enhanced to

stress choke points– Prevent tunnel vision

– Advance state of the art Chokepoint

Big datasystem

Timelyresults

Page 25: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

25

Choke points

● Technological challenges for platforms● Identified by experts● Real-world scenarios are enhanced to

stress choke points– Prevent tunnel vision

– Advance state of the art

Erling et al., The LDBC Social Network Benchmark:Interactive Workload , SIGMOD 2015

Chokepoint

Big datasystem

Timelyresults

Page 26: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

26

Examples of choke points

Page 27: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

27

Examples of choke points

● Network utilization

Page 28: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

28

Examples of choke points

● Network utilization● Memory footprint

Page 29: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

29

Examples of choke points

● Network utilization● Memory footprint● Access locality

Page 30: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

30

Examples of choke points

● Network utilization● Memory footprint● Access locality● Skewed execution

Page 31: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

31

Examples of choke points

● Network utilization● Memory footprint● Access locality● Skewed execution● CPU?

Page 32: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

32

Examples of choke points

● Network utilization● Memory footprint● Access locality● Skewed execution● CPU?

Ousterhout et al., “Making Sense ofPerformance in Data Analytics Frameworks”, NSDI 2015

Page 33: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

33

Examples of choke points

● Network utilization● Memory footprint● Access locality● Skewed execution● CPU?● Others?

Ousterhout et al., “Making Sense ofPerformance in Data Analytics Frameworks”, NSDI 2015

Page 34: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

34

Realistic graph generator

Page 35: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

35

Realistic graph generator

● LDBC Datagen– Synthetic social network similar to Facebook

Page 36: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

36

Realistic graph generator

● LDBC Datagen– Synthetic social network similar to Facebook

● Graphalytics enhancements– Multiple degree distributions

● Zeta and geometric implemented

Page 37: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

37

Realistic graph generator

● LDBC Datagen– Synthetic social network similar to Facebook

● Graphalytics enhancements– Multiple degree distributions

● Zeta and geometric implemented

– Other structural characteristics● Clustering coefficient● Assortativity

Page 38: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

38

Realistic graph generator

● LDBC Datagen– Synthetic social network similar to Facebook

● Graphalytics enhancements– Multiple degree distributions

● Zeta and geometric implemented

– Other structural characteristics● Clustering coefficient● Assortativity

Erling et al., The LDBC Social Network Benchmark:Interactive Workload , SIGMOD 2015

Page 39: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

39

Advanced benchmark harness

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 40: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

40

Advanced benchmark harness

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 41: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

41

Advanced benchmark harness

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 42: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

42

Advanced benchmark harness

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 43: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

43

Advanced benchmark harness

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 44: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

44

Advanced benchmark harness

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 45: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

45

Advanced benchmark harness

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 46: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

46

Advanced benchmark harness

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 47: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

47

Advanced benchmark harness

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 48: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

48

Advanced benchmark harness

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 49: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

49

Advanced benchmark harness

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 50: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

50

Advanced benchmark harness

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 51: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

52

Supported algorithms

ID Algorithm Class Use (%)BFS Breadth-first search Traversal 46STATS Local clustering

coefficientStatistics 16

CONN Weakly connected components

Connected components

13

CD Label propagation Community detection

5

EVO Forest fire evolution Evolution 4

Guo et al., How Well Do Graph-Processing Platforms Perform?An Empirical Performance Evaluation and Analysis, IPDPS 2014

Page 52: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

53

Experimental setup

● DAS-4 cluster– Typical big data setup

– 11 nodes, 24 GiB RAM, 2 x 8-core Xeon E5620

– 1 Gbit/s Ethernet

● Single machine– HPC-like setup

– 192 GiB RAM, 2 x 8-core Xeon E5-2450 v2

Page 53: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

54

Runtime

Page 54: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

55

Runtime

Page 55: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

56

Runtime

Page 56: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

57

Runtime

Page 57: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

58

Runtime

Page 58: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

59

Runtime

2 orders of magnitudedifference

Page 59: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

60

Runtime

Neo4j 2x faster than MR

Page 60: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

61

Runtime

MR 2x faster than Neo4j

Page 61: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

62

Runtime

Neo4j fails

Page 62: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

63

Runtime

Page 63: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

64

Edge-normalized performance

Page 64: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

65

Edge-normalized performance

Number of edges in graph / runtime

Page 65: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

66

Edge-normalized performance

20 x differencebecause of dataset structure

Page 66: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

67

Edge-normalized performance

Also for other platforms

Page 67: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

68

Edge-normalized performance

Page 68: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

69

See paper for more results

Page 69: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

70

Datagen scalability

Page 70: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

71

Conclusion

Page 71: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

72

Conclusion

● Use Graphalytics to:– Compare

– Tune

– Re-design

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 72: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

73

Conclusion

● Use Graphalytics to:– Compare

– Tune

– Re-design

● Open source (Apache License 2.0)– Contribute an implementation for your platform

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 73: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

74

Conclusion

● Use Graphalytics to:– Compare

– Tune

– Re-design

● Open source (Apache License 2.0)– Contribute an implementation for your platform

● SPEC RG standardization coming

BenchmarkCore

ReportGenerator

Platform-specificalgorithm

implementation

OutputValidator

SystemMonitor

DatasetGenerator

Datasets

UserResults

Graph processingplatform

Configuration

Page 74: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

75

Performance evaluation – Granular

Page 75: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

76

Deploying Granular

Page 76: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

77

Granular results

Page 77: Graphalytics: A Big Data Benchmark for Graph-Processing Platforms › fileadmin › user_upload › ... · 2015-06-01 · Graphalytics: A Big Data Benchmark for Graph-Processing Platforms

78

Thank you!

[email protected]

github.com/tudelft-atlarge/graphalytics