how comcast uses data science to improve the customer experience
TRANSCRIPT
•
•
•
•
•
•
•
•
•
•
•
•
•
•
0
0.5
1
1.5
2
2.5
3
3.5
4
0 168 336 504 672 840
Sim
ilari
ty
Δ Hours
•
•
•
0
0.5
1
1.5
2
2.5
3
3.5
4
0 168 336 504 672 840
Sim
ilari
ty
Δ Hours
Daily pattern: views in similar hours of the days are more similar
•
•
•
0
0.5
1
1.5
2
2.5
3
3.5
4
0 168 336 504 672 840
Sim
ilari
ty
Δ Hours
Weekly pattern: views in similar days of the week are more similar
Daily pattern: views in similar hours of the days are more similar
•
•
•
0
0.5
1
1.5
2
2.5
3
3.5
4
0 168 336 504 672 840
Sim
ilari
ty
Δ Hours
Weekly pattern: views in similar days of the week are more similar
Daily pattern: views in similar hours of the days are more similar
Overall decay: recent views are more similar
•
•
0
0.02
0.04
0.06
0.08
0.1
0 168 336
Sim
ilari
ty
∆ Hours
0
0 168 336 504 672 840Sim
ilari
ty
Δ Hours
VOD Daypart View Similarity
0
1
2
3
4
0 168 336 504 672 840
Sim
ilari
ty
VOD Daypart View Similarity
0
0.1
0.2
0.3
0.4
0 168 336
Rep
eat
per
cen
tage
∆ Hours
Repeated-Search Pattern
0
0.1
0 168 336
Sim
ilari
ty
∆ Hours
Linear(Station-level) View Similarity
•
•
Number of Signals
0.77
= New Signal
•
•
•
•
0%
20%
40%
60%
80%
100%
Top 5 Top 10 Top 20
Only Social
Nielsen + DVR
Nielsen + DVR +Social
•
•
•
•
20.00
25.00
30.00
35.00
40.00
45.00
50.00
55.00
60.00
65.00
70.00
0.40 0.60 0.80 1.00 1.20 1.40 1.60
Re
call
Resolution (Weighted Sum, Normalized by User Study)
Recall vs. Resolution
BaseLine
CoMPASS
CoMPASS++
ViaccessOrca
DigitalSmith
Rovi
ContentWise
Gravity
MortarData
ThinkAnalytics
Compass (Trend)
Various internal and externalAlgorithms
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
Picture courtesy of Cisco
•
•
•
•
Baseline w/100% cachesize
Net
wo
rk B
W S
aved
by
Cac
he
Baseline w/100% cachesize
Prediction w/30% cache size
Net
wo
rk B
W S
aved
by
Cac
he
Baseline w/100% cachesize
Prediction w/30% cache size
Prediction w/60% cache size
Net
wo
rk B
W S
aved
by
Cac
he