webinar: mongodb and polyglot persistence architecture

Polyglot Persistence

{ Name: ‘Bryan Reinero’,

Title: ‘Developer Advocate’,

Twitter: ‘@blimpyacht’,

Email: ‘bryan@mongdb.com’ }

What is the Polyglots?

• Using multiple Database Technologies in a Given Application

• Using the right tool for the right job

What is the Polyglots?

• Using multiple Database Technologies in a Given Application

• Using the right tool for the right job

Derived from “polyglot programming”. Applications programmed from a mix of languages.

Why is the Polyglots?

• Relational has been the dominant model• Higher performance requirements• Increasingly large datasets• Use of IaaS and commodity hardware

Vertical Scaling

Horizontal Scaling

Availability

http://avstop.com/ac/flighttrainghandbook/imagel4b.jpg

Availability

Requirements• Maximize uptime• Minimize time to recover

Availability

Requirements• Maximize uptime• Minimize time to recover

Hardware failures

Network partitions

Data center failures

Maintenance Operations

Availability

Business critical systems require automatic fault detection and fail over

Variant Data Models

Key-Value Store

Eratosthenes

Democritus

Hypatia

Euripides

ID Name

Variant Data Models

Eratosthenes

Democritus

Hypatia

Euripides

Graph Databases

Variant Data Models

Document Databases{

maker : ”Agusta",type : sportbike,rake : 7,trail : 3.93,engine : {

type : "internal combustion",layout : "inline"cylinders : 4,displacement : 750,

},transmission : {

type : "cassette",speeds : 6,pattern : "sequential”,ratios : [ 2.7, 1.94, 1.34, 1,

0.83, 0.64 ]}

The Goals of Normalization

• Model data an understandable form

• Reduce fact redundancy and data inconsistency

• Enforce integrity constraints

ApplicationServers MongoDB

Key / Value

Session Data, Shopping Carts

Product Catalog,User Accounts,Domain Objects

PaymentSystems,Reporting

GraphSocial Data,Recommendations

ApplicationServers MongoDB

Key / Value

Session Data, Shopping Carts

Product Catalog,User Accounts,Domain Objects

PaymentSystems,Reporting

GraphSocial Data,Recommendations

What are your requirements?

• Availability• Scalability• Performance• Access Patterns• Data Model

Key Value Stores

Used for• Session data• Cookies• Shopping carts

Eratosthenes

Democritus

Hypatia

Euripides

ID Name

Key Value Stores

• Fast, if in memory• Single access pattern• Complex data parsed

in client

Eratosthenes

Democritus

Hypatia

Euripides

ID Name

Key Value Store

“{maker : ‘Agusta’,type : sportbike,rake : 7,trail : 3.93,engine : {

type : ‘internal combustion’,layout : ‘inline’,cylinders : 4,displacement : 750,

},transmission : {

type : ‘cassette’,speeds : 6,pattern : ‘sequential’,ratios : [ 2.7, 1.94, 1.34, 1, 0.83, 0.64 ]

MongoDB

{ _id: 78234974,maker : ”Agusta",type : sportbike,rake : 7,trail : 3.93,engine : {

},transmission : {

type : "cassette",speeds : 6,pattern : "sequential”,ratios : [ 2.7, 1.94, 1.34, 1, 0.83, 0.64 ]

Self Defining Schema

MongoDB

},transmission : {

Self Defining SchemaNested Objects

MongoDB

},transmission : {

Self Defining SchemaNested ObjectsArray types

MongoDB

},transmission : {

Primary Key,Auto indexed

MongoDB

},transmission : {

Secondaryindexes

MongoDB

},transmission : {

Projectionsdb.vehicles.find ( {_id:78234974 }, { engine:1,_id:0 })

Data Model

RDBMS MongoDBTable, View ➜ CollectionRow ➜ DocumentIndex ➜ IndexJoin ➜ Embedded DocumentForeign Key ➜ ReferencePartition ➜ Shard

Flexible Schemas

{ maker : "M.V. Agusta",type : sportsbike,engine : {

type : ”internal combustion",

cylinders: 4,displacement : 750

},rake : 7,trail : 3.93

}{ maker : "M.V. Agusta",

type : Helicopterengine : {

type : "turboshaft"layout : "axial”,massflow : 1318

},Blades : 4undercarriage : "fixed"

Flexible Schemas

Discriminator column

cylinders: 4,displacement :

750},rake : 7,trail : 3.93

type : "turboshaft"

layout : "axial”,massflow : 1318

Flexible Schemas

Shared indexing strategy

750},rake : 7,trail : 3.93

type : "turboshaft"

Flexible Schemas

Polymorphic Attributes

750},rake : 7,trail : 3.93

type : Helicopter,engine : {

type : "turboshaft”,

},Blades : 4,undercarriage : "fixed"

Tao of MongoDB

• Model data for use, not storage• Avoid ad-hoc queries• Index effectively, index efficiently

Strong Consistency vs.

Eventual Consistency

Availability

Availablity

Fail-over

Strong vs. Eventual Consistency

Node A

Node B

Node C

Node E

Node D

Client 1

Client 2

Node A

Node B

Node C

Node E

Node D

Client 1

Client 2

Node A

Node B

Node C

Node E

Node D

Client 1

Client 2

Node A

Node B

Node C

Node E

Node D

Client 1

Client 2

Node A

Node B

Node C

Node E

Node D

Client 1

Client 2

Analytics

Hadoop

A framework for distributed processing of large data sets• Terabyte and petabyte datasets• Data warehousing• Advanced analytics• Not a database• No indexes• Batch processing

Use Cases

• Behavioral analytics• Segmentation• Fraud detection• Prediction• Pricing analytics• Sales analytics

Data Management

HadoopOffline ProcessingAnalyticsData Warehousing

MongoDBOnline OperationsApplicationOperational

Typical Implementations

Application Server

MongoDB as an Operational Store

Application Server

Data Flows

HadoopConnector

BSON Files

MapReduce & HDFS

Cluster

MONGOS

SHARD A

SHARDB

SHARD C

SHARD D

MONGOS Client

Hadoop / Spark Trade-offs

Plus• Access to Analytics

Libraries• Processes unstructured

data• Handles petabyte data

Minus• Overhead of a separate

distributed system• Writing MapReduce not

for the faint of heart• Designed for batch

oriented processing

Relational for Reporting & Business Intelligence

Plus• Existing ecosystem of BI

tools• Lower overhead than

Hadoop clusters• Large pool of expertise

and talent

RDBMSPrimary ETL

Replication

Integrations & ETL

RDBMSPrimary

LucenePrimaryMongo

Connector

Replication

Integrations with Search Solutions

Considerations

• Increased system complexity

• Operations overhead• Increased expertise

Thanks!

{ Name: ‘Bryan Reinero’,

Title: ‘Developer Advocate’,

Twitter: ‘@blimpyacht’,

Email: ‘bryan@mongdb.com’ }

webinar: mongodb and polyglot persistence architecture

Technology

developing polyglot persistence applications (svcc,...

polyglot persistance con postgresql, couchdb, mongodb, redis...

the rise of nosql and polyglot persistence

polyglot persistence with...

polyglot persistence for enterprise cloud applications

polyglot persistence & multi-model databases

developing polyglot persistence applications (devnexus 2013)

warsaw polyglot persistence presentation

scalable databases - from relational databases to polyglot...

building polyglot persistence java applications

thinking beyond rdbms : building polyglot persistence ......

polyglot persistence - two great tastes that taste great...

towards automated polyglot persistence · 2020. 6. 24. ·...

developing polyglot persistence applications #javaone 2012

mongodb in the middle of a hybrid cloud and polyglot...

mongodb and rdbms: using polyglot persistence at equifax

polyglot persistence with mongodb and neo4j

polyglot persistence in azure

polyglot persistence in the real world: cassandra + s3 +...

thinking beyond rdbms - building polyglot persistence java...