an empirical study of mobile network behavior and application … · 2019-04-22 · i an...

22
Introduction Dataset Analysis Performance Degradtion Detection Conclusion An Empirical Study of Mobile Network Behavior and Application Performance in the Wild S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China April 16, 2019 S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Upload: others

Post on 18-Apr-2020

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

An Empirical Study of Mobile Network Behavior and ApplicationPerformance in the Wild

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok

Southern University of Science and Technology, China

April 16, 2019

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 2: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Introduction

I A two-year long dataset conducted by a mobile crowdsourcing app.

I Characterize the performance of different protocols, DNS deployments, IPanycast, etc. in the wild.

I An performance degradation detection method based on Apriori algorithm,tailored for imbalaced and sparse datasets.

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 3: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Data Collection

I VPN-basedI Real trafficI No “root” needed

I Crowdsourcing

I Per-app measurement

Internal connections(raw IP packets)

Relay External connections(socket channel)

Tunnel

Apps TCP/UDP clinets

App

serv

ers

Packet parsing and mapping

Virtual network interface

TCP state machine

Smar

tpho

ne

Measurement points

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 4: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Data Features

I User InformationI country, device model, android version, etc.I collects once per installation

I Network InfromationI type (WiFi or cellular), name (SSID or vendor name), geo-location etc.I collects each time on app enabled or network status changed

I MeasurementI RTT, server IP and port, package name, the domain name etc.I measure each TCP connection or DNS query once.

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 5: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Basic Statistics

I Country Distribution:11,200 users from 173countries, mostly USAand Southeast Asia.

3407

1361

465418407

343

322

310

271181

169

161

153

150

139

2943

USA Indonesia

India Malaysia

UK Russia

Germany Brazil

Italy Australia

Canada Philippines

France Spain

Ukraine Others

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 6: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Basic Statistics

I Device Details: 1,615different smartphonemodels from 226manufacturers

39.08%

9.65%7.08%

5.61%

4.73%

3.24%

3.13%

3.12%

2.73%

1.75%

1.56% 1.52%

1.16%

0.94%

0.48%

14.21%

Samsung LGE

Xiaomi HUAWEI

Motorola Asus

LENOVO Sony

ZTE OnePlus

HTC TCL

OPPO Google

Meizu Others

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 7: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Basic Statistics

I Applications: 17,059 apps with 1,197 apps have >1k measurements

I Measurements: 13,204,649 TCP records and 6,489,646 DNS records, covering286,404 destination IP addresses.

I Network types: 65.42% WiFi, 23.97% LTE, 10.61% other cellular networks.I only 5.94% of WiFi measurements were observed to have >300Mbps PHY rates.I more than one third of the ISPs (238 ISPs) have no 4G measurements observed,

mainly in Africa and Asia.

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 8: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Protocols

Our analysis shows that XMPP traffics experience longer latency than HTTP(s).

0 50 100 150 200 250 300 350 4000

0.2

0.4

0.6

0.8

1

RTT (ms)

CDF

80

443

53

522*

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 9: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

DNS Performance1

Users using DNS server that are located on different countries experience longerlatency to app servers. This suggests the need for IP Anycasting.

0 50 100 150 200 250 300 350 4000

0.2

0.4

0.6

0.8

1

Resolving Time (ms)

CDF

Private LDNS

Same isp

Same country

Diff country

0 50 100 150 200 250 300 350 4000

0.2

0.4

0.6

0.8

1

RTT (ms)

CDF

Private LDNS

Same isp

Same country

Diff country

1servers deployed IP Anycast are considered “diff country” in this chapterS. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 10: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

IP Anycast

We identify Anycast IP using the the list conducted by iGreedy.2 We use rlm() from Rpackage MASS with default parameters to perform robust regression.

0 5 10 15 20 25

−50

0

50r = 0.70

#IP per domain

Perform

ance

gain

(%)

Outlier

2https://anycast.telecom-paristech.fr/dataset/S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 11: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Application Servers

Face

book

TextN

ow

Youtu

be

Clean

mas

ter

ESfil

em

anag

er

Goo

gleSe

arch

Wea

ther

Cha

nnel

Wha

tsap

p

Gm

ail

Sam

sung

Email

Instag

ram

FBM

esse

nger

Sam

sung

Cloud

Wec

hat

Teleg

ram

0

10,000

20,000

30,000

#u

niq

ue

serv

erIP

s

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 12: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Application Servers

The Ad servers and trackers are identified by EasyList.3

3https://easylist.to/S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 13: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Performance Degradtion Detection

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 14: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Challenges

I Imbalanced: For example, 83.5% of the 16,868 HSPAP measurements for ISPMobilis are from one user. If those measurements are excluded, the median RTTcan decrease from 332ms to 219ms.I normal association rules method bias to the performance of the dominating user.

I Sparse: Although the total number of observations is huge, records for eachcombination of features can be very small.I it’s impossible to model the normal performance for all combinations of features

separately.

I Large: We need a scalable method to process the increasingly large data.

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 15: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Our Method4

1. Based on the famous association rules mining method, the Apriori algorithm.

2. We filter each candidate rule to ensure no more than half of the supportingrecords have the same feature.

3. We identify performance degradtion events by comparing the meadian RTT of thesupporting records for one candidate rule and a subset of it.I For example, median RTT of LTE records is 73 in our data, while the RTT of the

records that use LTE and linux kernel 3.10.49 has a median of 340.

4. Use Hypothesis test to verify that the supporting data cannot be split further.

4For more detailed description of our method we refer interested readers to attend IWQoS on 24-25June 2019, Phoenix, AZ, USA or read the proceedings.

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 16: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Evaluation

1. Low false positive rate in random dataI We randomly shuffle the RTT of the records.I We mathmatically proved that the probability of our methods thinking there is

anomalies are very small in our configuration.

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 17: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Evaluation

1. Low false positive rate in random data

2. Real world case of Google Germany

2016-09 2017-01 2017-05 2017-09 2018-01 2018-05

0

50

100

150

21

79RTT

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 18: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Evaluation1. Low false positive rate in random data

2. Real world case of Google Germany

3. Real world case of Microsoft Office Mobile

0 50 100 150 200 250 300 350 4000

0.2

0.4

0.6

0.8

1

RTT (ms)

CDF

Amazon

Microsoft

Cloudflare

Akamai

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 19: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Conclusion

I Though IEEE 802.11ac equipments has become the mainstream in the market,only a small portion (6%) of Wifi exceed PHY rates of 300Mbps.

I Still more than one third of the ISPs do not deploy 4G networks.

I There are many users use external DNS resolvers. IP Anycast may improve themobile app performance in this case.

I Traffics using XMPP protocols experience longer RTT than HTTPS, whichsuggests that IM and VoIP services can be further improved.

I Advertisements servers often have longer latency than application servers.

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 20: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Introduction Dataset Analysis Performance Degradtion Detection Conclusion

Future Works

I 5G deployment and performance

I Actively measure the server when unexpected high RTTs are observed.

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 21: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Any questions?

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild

Page 22: An Empirical Study of Mobile Network Behavior and Application … · 2019-04-22 · I An performance degradation detection method based on Apriori algorithm, tailored for imbalaced

Thank you!

S. Zhang, W. Li, D. Wu, B. Jin, RKC. Chang, D. Gao, Y. Wang, RKP. Mok Southern University of Science and Technology, China

An Empirical Study of Mobile Network Behavior and Application Performance in the Wild