CFA A Practical Prediction System for Video Quality

  • Slides: 23
Download presentation
CFA: A Practical Prediction System for Video Quality Optimization Junchen Jiang, Vyas Sekar, Henry

CFA: A Practical Prediction System for Video Quality Optimization Junchen Jiang, Vyas Sekar, Henry Milner, Davis Shepherd, Ion Stoica, Hui Zhang 1

One-Minute Overview Prediction leads to dramatic quality improvement Predicting video quality is very challenging

One-Minute Overview Prediction leads to dramatic quality improvement Predicting video quality is very challenging Persistence of critical features CFA 2

User Engagement Internet Video Quality Matters! 40% sessions Avg Bitrate 13% sessions Buffering ratio

User Engagement Internet Video Quality Matters! 40% sessions Avg Bitrate 13% sessions Buffering ratio Better quality Longer user Engagement More revenues! A significant room of improvement! 3

New Paradigm: Centralized Control Platform Prediction Oracle Answer “What-if” questions. e. g. , What

New Paradigm: Centralized Control Platform Prediction Oracle Answer “What-if” questions. e. g. , What if I use 400 Kbps, Akamai? Local reactive adaptation is too slow A Case for Centralized Control Plane Fundamentally crippled for initial selections Real-time global network view Prediction Oracle e. g. , potentially 50% less re-buffering [SIGCOMM 12, NSDI 15] Local adaptation 400 Kbps Internet 1 Mbps 400 Kbps 1 Mbps 4

Key Missing Piece: How to Build a Prediction Oracle? Prediction CFA Oracle Our contribution:

Key Missing Piece: How to Build a Prediction Oracle? Prediction CFA Oracle Our contribution: Critical Feature Analytics (CFA) Data-Driven Video Quality Prediction System 400 Kbps Internet 1 Mbps 400 Kbps 1 Mbps 5

Outline • Motivation Challenges of Video Quality Prediction System • The CFA Approach •

Outline • Motivation Challenges of Video Quality Prediction System • The CFA Approach • Evaluation 6

Why is Building a Quality Prediction System Challenging? Trains a Quality Prediction Model Pred(quality

Why is Building a Quality Prediction System Challenging? Trains a Quality Prediction Model Pred(quality of other sessions) Quality of other sessions Quality prediction for new sessions Challenge 1: Complex factors affect video quality Need expressive models to capture these factors Challenge 2: Video quality changes quickly Need to refresh predictions in near real-time (e. g. , 30 sec) 7

Challenge 1: Complex relation between video quality and features NY Comcast Level 3 CDN

Challenge 1: Complex relation between video quality and features NY Comcast Level 3 CDN PIT AT&T Akamai Quality depends on combinations of features City ASN NY Comcast PIT Comcast NY AT&T CDN Level 3 Video “Foo” Device “bar” NY Akamai “Foo” “bar” Comcast Quality 8

Such feature combinations differ cross clients & time Time City 3: 00 PM NY

Such feature combinations differ cross clients & time Time City 3: 00 PM NY ASN CDN Video Device Quality Comcast Level 3 “foo” “bar” 3: 00 PM PIT Comcast Akamai “foo” “bar” 7: 00 PM PIT Comcast Akamai “foo” “bar” Combinational effects: Quality depends on combinations of multiple features Spatial diversity: Quality-determining features differ cross clients Model drift: Quality-determining features change over time 9

Challenge 2: Video Quality Changes Quickly Using fresh quality measurement is critical! 10

Challenge 2: Video Quality Changes Quickly Using fresh quality measurement is critical! 10

Needs both model expressiveness & fast update Not expressive enough to model complex factors

Needs both model expressiveness & fast update Not expressive enough to model complex factors Algorithm for Problems that have “Persistent” critical features e. g. , Naive. Bayes, Decision Tree Fast Needs tens of min to update model, Not interpretable Update speed Low e. g. , SVM r tte Be Slow CFA Simple ML Complex ML Expressiveness High 11

Outline • Motivation • Challenges The CFA Approach • Evaluation 12

Outline • Motivation • Challenges The CFA Approach • Evaluation 12

The Basic CFA Workflow: Similar feature values short history similar quality All historical sessions

The Basic CFA Workflow: Similar feature values short history similar quality All historical sessions with observed quality Session under prediction s Similar sessions to s Finding similar sessions Strawman: Matching on all features Curse of dimensionality: Hard to find sessions matching on all features Quality estimate (e. g. , median) Quality prediction for s Accurate Reliable Matching on ✔ ✖ all features 13

Insight to Find Similar Sessions: Critical Features Critical features: subset of features ultimately determines

Insight to Find Similar Sessions: Critical Features Critical features: subset of features ultimately determines video quality City ASN CDN Content NY Comcast Level 3 “foo” Device “bar” Quality F( ) Strawman: Matching on all features in the last minute NY Comcast Level 3 * * F( ) Quality (Failure rate) Match on all features (ground truth) Match on Critical Features Match on all features (ground truth) 0. 6 Use sessions matching “NY, Comcast, Level 3, foo, bar” in last minute 0. 4 Enough sessions matching “NY, 0. 2 of dimensionality: Curse Comcast, Level 3” in last minute Few sessions matching on 0 0 30 60 all features in one minute s 90 Time (min) 120 150 180 14

The CFA Workflow Based on Critical Features Similar sessions to s Finding similar sessions

The CFA Workflow Based on Critical Features Similar sessions to s Finding similar sessions s How to get critical features? Matching on Critical Features Finding Critical feature of s Quality estimate (e. g. , median) Quality prediction for s Accurate Reliable Matching on all features ✔ ✖ Matching on critical features ✔ ✔ 15

Insight to Learn Critical Features: Critical Features are Persistent Quality (Failure rate) 1 Match

Insight to Learn Critical Features: Critical Features are Persistent Quality (Failure rate) 1 Match on all features (ground truth) Match on Critical Features Match on Non-Critical Features 0. 8 0. 6 0. 4 Curse of dimensionality: No data of ground truth 0. 2 0 Strawman: Learn Critical Features over last minute s 0 30 60 Longer history Enough sessions to construct ground-truth quality 90 Time (min) 120 150 180 CFA approach: Learn Critical Features over last hour 16

How to Estimate Quality with Fresh Updates? Similar sessions to s Finding similar sessions

How to Estimate Quality with Fresh Updates? Similar sessions to s Finding similar sessions s Quality estimate (e. g. , median) Matching on Critical Features A few seconds Quality prediction for s A few seconds Finding Critical feature of s Takes 10 s minutes 17

CFA Approach to Fresh Updates Sequential workflow Learn critical features (every tens of min)

CFA Approach to Fresh Updates Sequential workflow Learn critical features (every tens of min) Enabled by Persistence of Critical Features Estimate quality (every tens of sec) Learn critical features (every tens of min) Time Learn critical features (every tens of min) Slow path Decoupled workflow Fast path 18

Putting Everything Together: CFA Implementation The C 3 platform [NSDI’ 15] Backend cluster Learn

Putting Everything Together: CFA Implementation The C 3 platform [NSDI’ 15] Backend cluster Learn critical features + estimate quality Update quality prediction per 10 s of sec Geo-distributed frontend clusters Pick the (CDN, bitrate) of the best predicted quality and return it in 10 s of ms Video clients 19

Outline • Motivation • Challenges • The CFA Approach Evaluation 20

Outline • Motivation • Challenges • The CFA Approach Evaluation 20

32% 2 1. 5 1. 82 red uct ion 1. 23 1 0. 5

32% 2 1. 5 1. 82 red uct ion 1. 23 1 0. 5 0 Baseline (random) CFA 4 Avg bitrate (Mbps) Avg buffering ratio (%) Real-world A/B Testing 3 e as e r c n i 12. 3% 2. 85 3. 21 2 1 0 Baseline (random) CFA Substantial quality improvement by CFA. 21

CFA vs. Strawman Prediction Algorithms 0. 014 0. 012 0. 01 0. 008 0.

CFA vs. Strawman Prediction Algorithms 0. 014 0. 012 0. 01 0. 008 0. 006 0. 004 0. 002 0 23% 0. 013 less e rror 0. 01 Best of strawmen CFA is more accurate 4 Avg bitrate Prediction error (bitrate) (Decision tree, Naïve Bayes, k. NN, Last-mile, ASN-based etc) 3 2 e t a r t i b r e h 16% hig 2. 76 3. 2 1 0 Best of strawmen CFA leads to better quality 22

Conclusion • Higher video quality Long user engagement More revenues! • Prediction has huge

Conclusion • Higher video quality Long user engagement More revenues! • Prediction has huge potential but is also challenging: Quality-determining features are complex, heterogeneous and dynamic. • CFA uses domain-specific insights Video quality depends on a subset of persistent critical features. CFA leads to 30% less buffering ratio and 12% high bitrate • Key takeaway: Prediction Performance improvement Persistence of critical features Accurate prediction 23