new thoughts on distributed file system...new challenges -cloud native storage •multi-tenancy...

11
New Thoughts On Distributed File System In The Cloud Native Era SpeakerShuoran Liu [email protected] & Maintainer@ChubaoFS

Upload: others

Post on 30-Jun-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: New Thoughts On Distributed File System...New Challenges -cloud native storage •Multi-tenancy •Elasticity, scalability and high availability –Avoid bottlenecks in a cluster •Optimize

New Thoughts On Distributed File System In The Cloud Native Era

Speaker:Shuoran [email protected] & Maintainer@ChubaoFS

Page 2: New Thoughts On Distributed File System...New Challenges -cloud native storage •Multi-tenancy •Elasticity, scalability and high availability –Avoid bottlenecks in a cluster •Optimize

Start with CSI – volume lifecycle https://github.com/container-storage-interface/spec/blob/master/spec.md

Page 3: New Thoughts On Distributed File System...New Challenges -cloud native storage •Multi-tenancy •Elasticity, scalability and high availability –Avoid bottlenecks in a cluster •Optimize

What We Got – A mind map

Page 4: New Thoughts On Distributed File System...New Challenges -cloud native storage •Multi-tenancy •Elasticity, scalability and high availability –Avoid bottlenecks in a cluster •Optimize

New Challenges - cloud native storage

• Multi-tenancy

• Elasticity, scalability and high availability– Avoid bottlenecks in a cluster

• Optimize for small files– High concurrent meta operations

– Data aggregation and deletion

• POSIX compliance and performance trade-off

• One copy of data with various interfaces– Per upstream and downstream user needs

Page 5: New Thoughts On Distributed File System...New Challenges -cloud native storage •Multi-tenancy •Elasticity, scalability and high availability –Avoid bottlenecks in a cluster •Optimize

What does it look in a cluster

Page 6: New Thoughts On Distributed File System...New Challenges -cloud native storage •Multi-tenancy •Elasticity, scalability and high availability –Avoid bottlenecks in a cluster •Optimize

Small Files – meta concurrencyLocality vs Distribution

Page 7: New Thoughts On Distributed File System...New Challenges -cloud native storage •Multi-tenancy •Elasticity, scalability and high availability –Avoid bottlenecks in a cluster •Optimize

Small Files – data aggregation

Page 8: New Thoughts On Distributed File System...New Challenges -cloud native storage •Multi-tenancy •Elasticity, scalability and high availability –Avoid bottlenecks in a cluster •Optimize

POSIX Compliance

Spark

MySQL

HBase

Elasticsearch

Data Consistency Performance

To what extent shall we compromise?

Page 9: New Thoughts On Distributed File System...New Challenges -cloud native storage •Multi-tenancy •Elasticity, scalability and high availability –Avoid bottlenecks in a cluster •Optimize

Fusion Storage

Page 10: New Thoughts On Distributed File System...New Challenges -cloud native storage •Multi-tenancy •Elasticity, scalability and high availability –Avoid bottlenecks in a cluster •Optimize

Some other thoughts

Spark

MySQL

HBase

Elasticsearch

• Benefit from low latency storage

– Used for raft log storage to improve random write performance

• CSI plugin driver dilemma

– Multiple volumes mounted in a single container

Page 11: New Thoughts On Distributed File System...New Challenges -cloud native storage •Multi-tenancy •Elasticity, scalability and high availability –Avoid bottlenecks in a cluster •Optimize

Thanks!

Contact: @shuoranliuGithub: github.com/chubaofs