information centric networking(icn) for delivering big data with …delaat/rp/2013-2014/p63/... ·...

34
Information Centric Networking(ICN) for Delivering Big Data with Persistent Identifiers(PID) Andreas Karakannas Research Project 2 Supervised by: Zhiming Zhao

Upload: others

Post on 23-Apr-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Information Centric

Networking(ICN) for Delivering Big

Data with Persistent Identifiers(PID)

Andreas Karakannas

Research Project 2

Supervised by:

Zhiming Zhao

Page 2: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Background

User at Web Browser

www.resolver.org

1

2

3

4

http://www.resolver.org/<PID>

PID: ark:12345/CIA/DNS_1.pdf

URL: https://www.os3.nl/_media/2013-

2014/courses/cia/dns_1.pdf

Information Centric Networking

A new network concept

Based on the idea that users are

interested in accessing Digital Objects

regardless of their locations.

No end-to-end communication

Digital Objects are uniquely identified

Request for Objects are routed based on

the Digital Object unique name (NO IP

ROUTING!!!)

Objects are cached in the path from

source to destination(In-Network

Caching).

In-Network Caching aims to achieve

efficient & reliable distribution of the

contents among the network

infrastructure.

PIDs in IP Network

Page 3: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

How can PID types be mapped/resolved

to ICNs’ Object Identifiers?

What is the efficiency of ICNs’ caching

algorithms for delivering Big Data?

Research Questions

Page 4: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Approach

Theoretical Studies on latest ICN

Projects and PID Standards.

Propose Mapping Architecture Design

based on the Theoretical study

Evaluate In-Network Caching

Performance for Big Data Objects

Page 5: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

ICN approaches

A Survey of Information-Centric Networking Research

Theoretical

Studies

ICN

Approaches

Page 6: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

ICN approaches

A Survey of Information-Centric Networking Research

Theoretical

Studies

ICN

Approaches

Page 7: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

The most mature ICN approach.

The only approach with published specification.(Packet

Format 0.1a2 published on March 27,2014).

Most research in caching algorithms in ICN is based on

NDN.

Only one with available open source

simulators(ndnSIM,ccnSIM) for evaluating caching

performance under different scenarios.

Theoretical

Studies

ICN

Approaches

NDN

Page 8: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Names in NDN

◦ Based on URI syntax

◦ Have hierarchical structure (e.g.

/NL/Amsterdam/UVA/ComputerScience/OS3/CIA/DNS.pdf)

◦ Names can be anything: a pdf file, a video, an endpoint, a command to

turn on some lights.

◦ Names are used in the Routing procedure.

2 Types of packets

◦ INTEREST(request) packets

Contains the Name of the Request

e.g. INTEREST(/NL/Amsterdam/UVA/ComputerScience/OS3/CIA/DNS.pdf)

◦ DATA(answer) packets

Contains the Name of the Request & the Data

e.g. DATA(NL/Amsterdam/UVA/ComputerScience/OS3/CIA/DNS.pdf, <DATA>)

Theoretical

Studies

ICN

Approaches

NDN

Page 9: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Theoretical

Studies

NDN

Populating the

Name Prefix

Page 10: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Theoretical

Studies

NDN

Populating the

Name Prefix

Page 11: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Theoretical

Studies

NDN

Populating the

Name Prefix

Page 12: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Theoretical

Studies

NDN

Populating the

Name Prefix

Page 13: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Theoretical

Studies

NDN

Routing the

INTEREST

packet

Page 14: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Theoretical

Studies

NDN

Routing the

INTEREST

packet

Page 15: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Theoretical

Studies

NDN

Routing the

INTEREST

packet

Page 16: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Theoretical

Studies

NDN

Routing the

DATA

packet

Page 17: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Theoretical

Studies

NDN

Routing the

DATA

packet

Page 18: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Theoretical

Studies

NDN

Routing the

DATA

packet

Page 19: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Theoretical

Studies

NDN

Cache HIT

Page 20: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Named Data Networking(NDN)

Theoretical

Studies

NDN

Cache HIT

Page 21: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Persistent Identifiers(PIDs)

A name with specific syntax that uniquely identifies an

object for a long-lasting period regardless of its’ location

and lifespan.

Different PID types are available for naming digital

objects.

Each PID has three parts:

Unique Identifier of the A Unique Identifier of the A Unique Identifier of the

PID Type(e.g.urn:,ark: ) Authority(e.g. isbn,ietf) Digital Object (e.g. 0-7645-2641-3)

Further Delegation to sub-Authorities

is possible

Example : urn:isbn:0-7645-2641-3

PID

PID Type Authority Name of Dig. Object

Theoretical

Studies

PID

Page 22: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Persistent Identifiers(PIDs)

PID

Types

PID Type

Identifier

Authority Name

URL url: <protocol><host>:<port> [/<path>[?<searchpart>]]

URN urn: <NID>: <NSS>

ARK ark: <NAAN> /”<Name>[<Qualifier>]

HANDLE handle: <Handle Naming Authority> /<Handle Local Name>

PURL purl: <protocol><resolver

address>

/<name>

DOI doi: 10.<Naming Authority> /<doi name syntax>

Most-well known PID Types

Theoretical

Studies

PID Standards

Page 23: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Mapping Architecture Design Goals

Generic

Extensible

Scalable

Easy to Implement, Manage &

Administrate

Mapping

Architecture

Design

Page 24: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Mapping Architecture Name-Space Implementation

Root PID Server

<Root PID NDN Name>

URN Handle Doi Ark . . . . . . . . .

ISBN IETF . . . . 12345 56789 . . . . . . . . . . . . .

Root PID Layer

PID Type Layer

Authority PID Layer

(Further Delegation is Possible)

PID NDN-Name

urn:isbn:0-7645-2641-3 /UvA/NaturalScience/CS/CIA/DNS.pdf

. .

. .

Page 25: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Client 1

1.INTEREST(<PID_Resolver

NDN Name><PID>)

Clients’

PID

Resolver

Server

<PID

Resolver

NDN

Name>

2.INTEREST(<Root_PID_Server NDN Name><PID>)

3.DATA(<Root PID Server NDN Name><PID>,<Answer>)

Root PID Server

<Root PID Server NDN

Name>

PID Type Server

<PID Type Server NDN

Name>

4.INTEREST(<PID Type Server NDN Name> <PID>)

5.DATA(< PID Type Server NDN Name> <PID>,<Answer>)

Authority PID Server

<Authority PID Server

NDN Name>

6.INTEREST(<Authority PID Server NDN Name ><PID>)

7.DATA(<Authority PID Server NDN Name ><PID>,<Answer>) 8.DATA(<PID Resolver NDN

Name><PID><Answer>)

User Interface

9.INTEREST(<PIDs’ NDN Name>) CONTENT ROUTER

NDN 9.INTEREST(<PIDs’ NDN Name>)

10.DATA(<PIDs’ NDN Name>,<Data>) 10.DATA(<PIDs’ NDN Name>,<Data>)

Iterative Resolution of PIDs to NDN names

Page 26: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Caching Strategies

Decision Algorithms(DA) Which Content Router caches what?

LCE,LCD,FIX(P),ProbCache

Replacement Algorithm(RA) How are Content Routers replaced Objects in the Content

Store?

FIFO,RANDOM,LRU,LFU

Evaluate In-

Network

Caching

Performance

Page 27: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Simulation Parameters

Big Data Repository

Parameter Description Values

R Big Data Repository Size 51.2TBytes

|R| Num. of Big Data Objects in R 150

B Size of Big Data Object 350GBytes

c Num. of sub-Objects a Big Data

Object is consisted of

[1,2,4,6..20]

a Popularity of Big Data sets is

based on Zipf Distribution:

P(x=i)=(1/i^a)/C

C= 𝟏/𝒊^𝒂|𝑹|𝒊=𝟏

1

Parameter Description Values

C The Content Store Size in each

Content Router expressed as Size

of a Big Data Object

[0.5B,1B,2B,4B,8B,16B]

CA Caching Algorithm [LCE,LCD,FIX(0,5),FIX

(0.25),ProbCache]

RA Replacement Algorithm LRU

CLIENT

Parameter Description Values

T Indicated the number of Requests

for a Big Data Object the Client has

send so far

-

Evaluate In-

Network

Caching

Performance

Page 28: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Network Topologies

Binary Tree

String

In both Network

Topologies the distance

between the client and

the Big Data Repository

is 4 Hops(Content

Routers)

0 1 2 3 4

4

3

2 2

1 1 1 1

0 0 0 0 0 0 0 0

Evaluate In-

Network

Caching

Performance

Page 29: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Performance Metrics

In ICN the in-network caching aims to:

• From the Customer point of view: Reduce the average time required to download the requested

content.

• From the Publisher point of view: Reduce the number of requests the publisher needs to serve.

• From the Network point of view Reduce the network traffic.

Average Number of Hops per simulation

describes all the above benefits.

Evaluate In-

Network

Caching

Performance

Page 30: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Collection of Measurements

Collection of the Average number of Hops for each simulation starts

when the Average Number of Hops converges for at least 50T.

Evaluate In-

Network

Caching

Performance

00,20,40,60,8

11,21,41,61,8

22,22,42,62,8

33,23,43,63,8

44,2

1

16

32

48

64

80

96

11

2

12

8

14

4

16

0

17

6

19

2

20

8

22

4

24

0

25

6

27

2

28

8

30

4

32

0

33

6

35

2

36

8

38

4

40

0

41

6

43

2

44

8

46

4

48

0

49

6

51

2

52

8

Ave

rage

Nu

mb

er

of

Ho

ps

T-Clients Requests

LCE

Fix(0.5)

Fix(0.25)

ProbCache

LCD

No Cache

Page 31: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Results : String Network (1 Client)

• Number of sub-Objects(c) a Big Data Object is consisted of has neglectable impact on the performance of caching algorithms.

• C:B ≤ 1 Low Caching Algorithms Performance • C:B ≥ 2 Significant Benefits can be gained from this point and onwards.

1,4

1,6

1,8

2,0

2,2

2,4

2,6

2,8

3,0

3,2

3,4

3,6

3,8

4,0

4,2

0,5 1 2 4 8 16

Ave

rage

Ho

ps

Content Router Cache Size/Big Data Object Size (C:B)

LCE

Fix(0.5)

Fix(0.25)

ProbCache

LCD

Indicates the Standard Deviation for different c values[1,2,4..20]. c : The number of sub-Objects a Big Data Object is consisted of.

Page 32: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Results : Binary Tree Network (8 Clients)

• Number of sub-Objects(c) a Big Data Object is consisted of has neglectable impact on the performance of caching algorithms.

• C:B ≤ 1 Low Caching Algorithms Performance • C:B ≥ 2 Significant Benefits can be gained from this point and onwards.

Indicates the Standard Deviation for different c values[1,2,4..20]. c : The number of sub-Objects a Big Data Object is consisted of.

1,4

1,6

1,8

2,0

2,2

2,4

2,6

2,8

3,0

3,2

3,4

3,6

3,8

4,0

4,2

0,5 1 2 4 8 16

Ave

rage

Ho

ps

Content Router Cache Size / Big Data Object Size (C:B)

LCE

Fix(0.5)

Fix(0.25)

ProbCache

LCD

Page 33: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Conclusion

Based on our research in ICN approaches & PID

Standards, mapping PIDs to ICN Names is possible

◦ Decentralized Solution Proposed for NDN approach.

Generic

Extensible

Scalable

Administration & Management is needed on each Layer

Evaluation of Caching Algorithms gave us

◦ Cache Size/Big Data Set Size(C:B), plays critical role on the efficiency of

current caching algorithms.

C:B ≤ 1 Insignificant gain from Caching.

C:B ≥ 2 Significant Benefits can be gained from this point and onwards.

◦ Number of sub-Object the Big Data Object is segmented does not

significantly affect the efficiency of caching algorithms.

Page 34: Information Centric Networking(ICN) for Delivering Big Data with …delaat/rp/2013-2014/p63/... · 2021. 3. 31. · Information Centric Networking(ICN) for Delivering Big Data with

Questions?