CSE 679 Lecture Voice and Video over IP

  • Slides: 79
Download presentation
CSE 679: Lecture “Voice and Video over IP (VVo. IP)” Prasad Calyam, Sr. Systems

CSE 679: Lecture “Voice and Video over IP (VVo. IP)” Prasad Calyam, Sr. Systems Developer/Engineer, Ohio Supercomputer Center 6 th Nov 2007

Topics of Discussion r Introduction r Signaling Protocols Terminology r Introduction to H. 323

Topics of Discussion r Introduction r Signaling Protocols Terminology r Introduction to H. 323 - ITU Standard r Introduction to SIP – IETF Standard r Comparison of H. 323 and SIP r Basics of RTP, RTCP r Factors affecting VVo. IP System Performance r Measuring VVo. IP System Performance m OSC’s H. 323 Beacon Tool m OSC’s Vperf Tool r Conclusion

Voice and Video over IP (VVo. IP) r Large-scale deployments of VVo. IP are

Voice and Video over IP (VVo. IP) r Large-scale deployments of VVo. IP are on the rise m Vo. IP • Skype, Yahoo Messenger, Google Talk m Video streaming (one-way voice and video) • My. Space, Google Video, You. Tube, IPTV, … m Video conferencing (two-way voice and video) • Polycom, Web. Ex, Acrobat Connect, … r VVo. IP popularity reasons m Increased access to broadband m Advances in standardization of H. 323 and SIP protocols r Today’s protocols allow a wide variety of communication devices to talk to each other

VVo. IP Deployment

VVo. IP Deployment

VVo. IP Deployment (2) H. 323 SIP Gatekeeper Corporate LAN Gateway Multipoint Control Unit

VVo. IP Deployment (2) H. 323 SIP Gatekeeper Corporate LAN Gateway Multipoint Control Unit Switched Circuit Network (POTS and ISDN) Router Internet H. 320 (over ISDN) H. 324 (over POTS) Speech-Only (telephones)

Desktop and Room Videoconferencing Systems

Desktop and Room Videoconferencing Systems

3 Ways to Videoconference over the Internet 1. Point-to-Point

3 Ways to Videoconference over the Internet 1. Point-to-Point

3 Ways to Videoconference over the Internet (Contd. ) 2. Multi-Point Star Topology

3 Ways to Videoconference over the Internet (Contd. ) 2. Multi-Point Star Topology

3 Ways to Videoconference over the Internet (Contd. ) 3. Multi-Point Multi-Star Topology

3 Ways to Videoconference over the Internet (Contd. ) 3. Multi-Point Multi-Star Topology

Megaconferences – World’s largest Annual Internet Videoconferences MCU Cascading World sites participation Live/Archive Streaming

Megaconferences – World’s largest Annual Internet Videoconferences MCU Cascading World sites participation Live/Archive Streaming Group Music, 3 G Video, Antartica, Virtual Picnics, …

Signaling Protocols Terminology r Call Establishment and Teardown r Call Control and Supplementary Services

Signaling Protocols Terminology r Call Establishment and Teardown r Call Control and Supplementary Services m Call waiting, Call hold, Call transfer r Capability Exchange r Admission Control r Protocol Encoding (ASN 1, HTTP)

H. 323 – ITU Standard r H. 323 is an umbrella standard that defines

H. 323 – ITU Standard r H. 323 is an umbrella standard that defines how real-time multimedia communications such as Videoconferencing can be supported on packet switched networks (Internet) r Devices: Terminals, Gateways, Gatekeepers and MCUs r Codecs: m m Video: H. 261, H. 263, H. 264 Audio: G. 711, G. 722, G. 723. 1 r Signaling: H. 225, H. 245 r Transport Mechanisms: TCP, UDP, RTP and RTCP r Data collaboration: T. 120 r Many others…

H. 323 Protocol Stack

H. 323 Protocol Stack

H. 323 Call setup and teardown

H. 323 Call setup and teardown

H. 323 Call setup and teardown (Contd. )

H. 323 Call setup and teardown (Contd. )

SIP - IETF Standard r Session Initiation Protocol (SIP) r SIP Elements: User Agent

SIP - IETF Standard r Session Initiation Protocol (SIP) r SIP Elements: User Agent Client (UAC), User Agent Server (UAS) m Easy to locate users due to the flexibility in SIP to contact external location servers to determine user or routing policies (url, email ID, e. g. pcalyam@osc. edu) r Server Types: Redirect Server, Proxy Server and Registrar m m SIP Proxy: perform application layer routing of SIP requests and responses. SIP Registrar: UAC sends a registration message and the Registrar stores registration information in a location service using a non-SIP protocol (E. g. LDAP)

SIP Deployment Architecture OSU You at Hawaii Friend at San Jose

SIP Deployment Architecture OSU You at Hawaii Friend at San Jose

Comparison of H. 323 and SIP r Evolution m m H. 323 evolved from

Comparison of H. 323 and SIP r Evolution m m H. 323 evolved from Telecommunications Community (ITU-T) SIP evolved from Internet Community (IETF) r Protocols m m Differences in the signaling and control procedures Off-the-record: SIP is equivalent to H. 225 and RAS of H. 323 r Feature sets m m m m Functionality Quality of Service Manageability Scalability Flexibility Interoperability Ease of Implementation

Basics of RTP and RTCP r RTP m Provides end-to-end network transport functions suitable

Basics of RTP and RTCP r RTP m Provides end-to-end network transport functions suitable for applications transmitting real-time data • Audio, video or simulation data, over multicast or unicast network services r RTCP m To allow monitoring of data delivery in a manner scalable to large multicast networks m To provide minimal control and identification functionality r RTP and RTCP need best effort delivery m UDP provides this

RTP Packet

RTP Packet

RTCP Packet

RTCP Packet

Ethereal RTP Analysis – Try at Home! r Use the OPENXTRA version of Ethereal!

Ethereal RTP Analysis – Try at Home! r Use the OPENXTRA version of Ethereal! http: //resource. intel. com/telecom/support/appnotes/9008 an. pdf r Steps for analyzing the Traces m Load the packet trace into Ethereal • Trace will contain both forward and reverse direction streams (Check “Source” and “Destination” IP addresses) r Decode streams as RTP (default is UDP) m This will mark all related packets as belonging to a specific audio and video codec streams r Analyze individual audio or video streams m Import various information fields as. csv file (“Save as CSV” option) m Also has wave file generation relating to an audio stream (“Save Payload” option) • Works only for G. 711 Codec streams! • Good for PESQ where you want to compare original and degraded wave files to obtain Objective MOS information

Ethereal RTP Analysis (2) General UDP Stream decoded as an H. 263 payload stream

Ethereal RTP Analysis (2) General UDP Stream decoded as an H. 263 payload stream

Ethereal RTP Analysis (3) Audio Stream Video Stream

Ethereal RTP Analysis (3) Audio Stream Video Stream

Ethereal RTP Analysis (4) Loss Re-ordering; 45413 45415 45414 (Observe Sequence #s; could also

Ethereal RTP Analysis (4) Loss Re-ordering; 45413 45415 45414 (Observe Sequence #s; could also be 2 consecutive packet losses) Pink-marked packets relate to either lost or re-ordered packets!

Ethereal RTP Analysis (5) An Imported CSV File!

Ethereal RTP Analysis (5) An Imported CSV File!

Ethereal RTP Analysis (6) Create interesting visualizations to understand various RTP packet characteristics; can

Ethereal RTP Analysis (6) Create interesting visualizations to understand various RTP packet characteristics; can do the same for both Voice and Video packets!!!

VVo. IP System r End-user Quality of Experience (Qo. E) is mainly dependent on

VVo. IP System r End-user Quality of Experience (Qo. E) is mainly dependent on Network Quality of Service (Qo. S) metrics Qo. S Metrics: bandwidth, delay, jitter, loss m Device factors such as voice/video codecs, peak video bit rate (a. k. a. 256/384/768 dialing speed) also matter Network Qo. S End-user Qo. E m r Research underway to map the network Qo. S to end-user Qo. E

Understanding Delay… SENDER SIDE NETWORK RECEIVER SIDE Compression Delay Propagation Delay Resynchronization Delay Transmission

Understanding Delay… SENDER SIDE NETWORK RECEIVER SIDE Compression Delay Propagation Delay Resynchronization Delay Transmission Delay Processing Delay Decompression Delay Electronic Delay Queuing Delay Presentation Delay r Delay is the amount of time that a packet takes to travel from the sender’s application to reach the receiver’s destination application m Caused by codecs, router queuing delays, … r One-way delay requirement is stringent for H. 323 Videoconferencing to maintain good interaction between both ends

Understanding Jitter… r Jitter is the variation in delay of the packets arriving at

Understanding Jitter… r Jitter is the variation in delay of the packets arriving at the receiving end m Caused by congestion, insufficient bandwidth, varying packet sizes in the network, out of order packets, … r Excessive jitter may cause packet loss in the receiver jitter buffers thus affecting the playback of the audio and video streams

Understanding Loss… r Packet Loss is the packets discarded deliberately (RED, TTL=0) or non-deliberately

Understanding Loss… r Packet Loss is the packets discarded deliberately (RED, TTL=0) or non-deliberately by intermediate links, nodes and end-systems along a given transmission path m Caused by line properties (Layer 1), full buffers (Layer 3) or late arrivals (at the application)

Understanding Bandwidth bottleneck …

Understanding Bandwidth bottleneck …

Voice and Video Packet Streams r Total packet size (tps) – sum of payload

Voice and Video Packet Streams r Total packet size (tps) – sum of payload (ps), IP/UDP/RTP header (40 bytes), and Ethernet header (14 bytes) r Dialing speed is ; = 64 Kbps fixed for G. 711 voice codec m Voice has fixed packet sizes (tpsvoice ≤ 534 bytes) m Video packet sizes are dependent on alev in the content

Video alev r Low alev m Slow body movements and constant background; E. g.

Video alev r Low alev m Slow body movements and constant background; E. g. Claire video sequence r High alev m Rapid body movements and/or quick scene changes; E. g. Foreman video sequence r ‘Listening’ versus ‘Talking’ Talking video alev(i. e. , High) consumes more bandwidth than Listening video alev (i. e. , Low) Foreman Claire m

Factors affecting VVo. IP System Performance r Human Factors m Video alev m Individual

Factors affecting VVo. IP System Performance r Human Factors m Video alev m Individual perception of audio/video quality - qmos r Device Factors m MCUs, Routers, Firewalls, NATs, Modems, Operating System, Processor, memory, … r Network Factors m Bandwidth, Delay, Jitter, Loss

Measuring VVo. IP System Performance r Challenges for monitoring large-scale VVo. IP deployments m

Measuring VVo. IP System Performance r Challenges for monitoring large-scale VVo. IP deployments m Real-time or online monitoring of end-user Quality of Experience (Qo. E) • Traditional network Quality of Service (Qo. S) monitoring not adequate m Need objective techniques for automated network-wide monitoring • Cannot rely on end-users to provide subjective rankings – expensive and time consuming m Objective Qo. E measurements can be used for dynamic resource management to optimize end-user Qo. E

Resource Management Example r Multipoint Control Unit (MCU) bridges three or more videoconference participants

Resource Management Example r Multipoint Control Unit (MCU) bridges three or more videoconference participants r MCUs have limited ports; large videoconferences involve cascaded MCUs

Resource Management Example Site-C Site-A Site-B Best performing path from end-user site Call Admission

Resource Management Example Site-C Site-A Site-B Best performing path from end-user site Call Admission Controller ?

End-user Qo. E Types r Streaming Qo. E m End-user Qo. E affected just

End-user Qo. E Types r Streaming Qo. E m End-user Qo. E affected just by voice and video impairments • Video frame freezing • Voice drop-outs • Lack of lip sync between voice and video r Interaction Qo. E m End-user Qo. E also affected by additional interaction effort in a conversation • “Can you repeat what you just said? ” • “This line is noisy, lets hang-up and reconnect…” r Qo. E is measured using “Mean Opinion Score” (MOS) rankings

Existing Objective Techniques r ITU-T E-Model is a success story for Vo. IP Qo.

Existing Objective Techniques r ITU-T E-Model is a success story for Vo. IP Qo. E estimation m OSC’S H. 323 Beacon tool has E-Model implementation m It does not apply for VVo. IP Qo. E estimation • Designed for CBR voice traffic and handles only voice related impairments • Does not address the VBR video traffic and impairments such as video frame freezing r ITU-T J. 144 developed for VVo. IP Qo. E estimation m “PSNR-based MOS” – PSNR calculation requires original and reconstructed video frames for frame-by-frame comparisons m Not suitable for online monitoring • PSNR calculation is a time consuming and computationally intensive process m Does not consider joint degradation of voice and video i. e. , lack of lip synchronization

PSNR to MOS Mapping

PSNR to MOS Mapping

Scenario: A Researcher and an Industry professional want to Videoconference

Scenario: A Researcher and an Industry professional want to Videoconference

Case 1: Researcher is unable to make a call!

Case 1: Researcher is unable to make a call!

There was a mis-configured firewall blocking necessary ports…

There was a mis-configured firewall blocking necessary ports…

Case 2: Industry professional is unable to make a call!

Case 2: Industry professional is unable to make a call!

His LAN’s Internet connectivity was nonfunctional at that time…

His LAN’s Internet connectivity was nonfunctional at that time…

Case 3: They connected, but of them experienced bad audio & video!

Case 3: They connected, but of them experienced bad audio & video!

There was congestion at one of the intermediate routers along the path…

There was congestion at one of the intermediate routers along the path…

There was congestion at one of the intermediate routers along the path…

There was congestion at one of the intermediate routers along the path…

There was congestion at one of the intermediate routers along the path…

There was congestion at one of the intermediate routers along the path…

The performance problem can be anywhere in the E 2 E Path!!!

The performance problem can be anywhere in the E 2 E Path!!!

An end-to-end troubleshooting tool can help!

An end-to-end troubleshooting tool can help!

Troubleshooting VVo. IP System Performance “OSC H. 323 Beacon” r An application-specific measurement tool

Troubleshooting VVo. IP System Performance “OSC H. 323 Beacon” r An application-specific measurement tool m To monitor and qualify the performance of an H. 323 sessions at the host and in the network (end-to-end) r Useful to an end-user/conference operator/network engineer r Uses Open. H 323 and J 323 Engine libraries r Easy to install and use! r Open source http: //www. itecohio. org/beacon P. Calyam, W. Mandrawa, M. Sridharan, A. Khan, P. Schopis, “H. 323 Beacon: An H. 323 application related end-to-end performance troubleshooting tool”, ACM SIGCOMM 2004 Workshop on Network Troubleshooting (Net. Ts), 2004.

Initial call setup failures and haphazard disconnection detection…

Initial call setup failures and haphazard disconnection detection…

Network Health Status… r Delay, Jitter and Loss m Real-time, offline raw data and

Network Health Status… r Delay, Jitter and Loss m Real-time, offline raw data and test session summary

Network Health Plots… r Watermarks for “Good”, “Acceptable” and “Poor” grade of quality as

Network Health Plots… r Watermarks for “Good”, “Acceptable” and “Poor” grade of quality as experienced by end-user r Delay: (0 -150)ms, (150 -300)ms, > 300 ms r Jitter: (0 -20)ms, (20 -50)ms, > 50 ms r Loss: (0 -0. 5)%, (0. 5 -1. 5)%, > 1. 5 Poor Acceptable Good

Audio and Video Quality Assessments r Audio loopback feature r E-Model-based objective MOS ranking

Audio and Video Quality Assessments r Audio loopback feature r E-Model-based objective MOS ranking r Slider-based subjective MOS ranking

Customization of tests… r Test results data folder, TCP/UDP/RTP port settings, H. 225 and

Customization of tests… r Test results data folder, TCP/UDP/RTP port settings, H. 225 and H. 245 parameters, preferred codec, watermarks for delay, jitter, loss, …

H. 323 Beacon Server

H. 323 Beacon Server

Online VVo. IP Qo. E Measurement Problem r Given: m Video-on-demand (streaming) or Videoconferencing

Online VVo. IP Qo. E Measurement Problem r Given: m Video-on-demand (streaming) or Videoconferencing (interactive) m Voice/video codec m Dialing speed r Problem: m An objective technique that can estimate both streaming and interactive VVo. IP Qo. E in terms of MOS rankings m Estimation has to be real-time without involving actual end-users, video sequences and VVo. IP appliances m An active measurement tool that can: (a) emulate VVo. IP traffic on a network path, and (b) uses the objective VVo. IP estimation technique Vperf Tool

GAP-Model r Earlier studies estimate Qo. E affected by Qo. S metrics in isolation

GAP-Model r Earlier studies estimate Qo. E affected by Qo. S metrics in isolation m E. g. impact due to only bandwidth/delay/loss/jitter We consider network health as a combination of different levels of bandwidth, delay, jitter and loss – hence more realistic r The levels are quantified by well-known “Good”, “Acceptable” and “Poor” (GAP) performance levels for Qo. S metrics r Our strategy r m m Derive “closed-form expressions” for modeling MOS using offline human subject studies under different network health conditions Leverage the GAP-Model in Vperf tool for online Qo. E estimation for a measured set of statistically stable network Qo. S metrics P. Calyam, M. Sridharan, W. Mandrawa, P. Schopis “Performance Measurement and Analysis of H. 323 Traffic”, Passive and Active Measurement Workshop (PAM), Proceedings in Springer-Verlag LNCS, 2004.

Test-cases Reduction Problem r Modeling Qo. E as a function of 4 Qo. S

Test-cases Reduction Problem r Modeling Qo. E as a function of 4 Qo. S metrics in 3 levels (GAP) requires administering 81 test cases per human subject m Test-cases ordering using < bnet dnet lnet jnet > sequence: [<GGGG>, m 81 test cases are burdensome to any human subject <GGGA>, …, <APPP>, <PPPP>] • They involve long hours of testing • Results may be error-prone due to human subject exhaustion r To overcome this problem, we developed novel “Test-cases Reduction” strategies P. Calyam, E. Ekici, C. -G. Lee, M. Haffner, N. Howes, “A ‘GAP-Model’ based Framework for Online VVo. IP Qo. E Measurement”, In Second-round Review - Journal of Communications and Networks (JCN), 2007.

Test-case Reduction Strategy-1 r Reduction based on network condition infeasibility m We conducted a

Test-case Reduction Strategy-1 r Reduction based on network condition infeasibility m We conducted a network emulator (NISTnet) qualification study to identify any practically infeasible network conditions • E. g. there cannot be Good loss levels when there is Poor bandwidth level provisioned in the network path m Reduces the test-cases number from 81 to 42

Test-case Reduction Strategy-2 r Reduction based on human-subjects’ ranking inference m Eliminate more severe

Test-case Reduction Strategy-2 r Reduction based on human-subjects’ ranking inference m Eliminate more severe test cases during the testing based on the Poor rankings for less severe test cases • E. g. If human subject ranked test case <GPPP> with an extremely Poor MOS, it can be inferred that more severe test cases <APPP> and <PPPP> will also receive extremely Poor MOS m Reduces the 42 test-cases further depending on the human subjects’ rankings during the testing NOTE: Our test-case reduction strategies resulted in atmost 90 minutes of testing time per human subject (including training, administering and short-breaks)

Closed-network Testing r r r Test environment setup m Testing was automated as much

Closed-network Testing r r r Test environment setup m Testing was automated as much as possible for repeatability m 21 belonging to 3 categories: (i) Expert, (ii) General, and (iii) Novice Human subjects selection Double stimulus impairment scale method using Streaming-Kelly, Interactive-Kelly video clips m m Human subject compares baseline video sequence with impaired video sequence for MOS ranking In-band chat channel between human subject and test administrator (a) Isolated LAN Testbed (b) MOS Slider

Closed-form Expressions r Polynomial curve fitting on the Streaming and Interactive qmos training data

Closed-form Expressions r Polynomial curve fitting on the Streaming and Interactive qmos training data obtained from closed-network testing m Average qmos – mean of the 21 human subject rankings for a particular network health m r condition Lower and Upper bound qmos – 25 th and 75 th percentile values • To account for the possible qmos variation range influenced by the human subject categories Hence, 6 sets of regression surface model parameters in GAP-Model

Vperf Tool Implementation of GAP-Model r After test duration δt, a set of statistically

Vperf Tool Implementation of GAP-Model r After test duration δt, a set of statistically stable network Qo. S measurements are obtained r When input to GAP-Model, online VVo. IP Qo. E estimates are instantly produced

GAP-Model Validation r GAP-Model Validation with human subjects (V-MOS) and network conditions not tested

GAP-Model Validation r GAP-Model Validation with human subjects (V-MOS) and network conditions not tested during model formulation V-MOS within the lower and upper bounds

GAP-Model Validation (2) r GAP-Model validation with ITU-T J. 144 estimates (P-MOS) and network

GAP-Model Validation (2) r GAP-Model validation with ITU-T J. 144 estimates (P-MOS) and network conditions not tested during model formulation P-MOS within the lower and upper bounds

MAPTs Methodology r “Multi-Activity Packet Trains” (MAPTs) measure Interaction Qo. E in an automated

MAPTs Methodology r “Multi-Activity Packet Trains” (MAPTs) measure Interaction Qo. E in an automated manner m m m They mimic participant interaction patterns and video activity levels as affected by network fault events Given a session-agenda, excessive talking than normal due to unwanted participant interaction patterns impacts Interaction Qo. E “Unwanted Agenda-bandwidth” measurement and compare with baseline (consumption during normal conditions) • Higher values indicate poor interaction Qo. E and caution about potential increase in Internet traffic congestion levels • Measurements serve as an input for ISPs to improve network performance using suitable traffic engineering techniques P. Calyam, M. Haffner, E. Ekici, C. -G. Lee, “Measuring Interaction Qo. E in Internet Videoconferencing”, IEEE/IFIP Management of Multimedia and Mobile Networks and Services (MMNS), Proceedings in Springer-Verlag LNCS, 2007.

Proposed Solution Methodology (2) ‘repeat’ ‘disconnect’ ‘reorient’ Type-I and Type-II fault detection

Proposed Solution Methodology (2) ‘repeat’ ‘disconnect’ ‘reorient’ Type-I and Type-II fault detection

MAPTs Emulation r Emulation of Participant Interaction Patterns (PIPs) using MAPTs for a given

MAPTs Emulation r Emulation of Participant Interaction Patterns (PIPs) using MAPTs for a given session agenda Normal – PIP 1

MAPTs Emulation (2) r Emulation of Participant Interaction Patterns (PIPs) using MAPTs for a

MAPTs Emulation (2) r Emulation of Participant Interaction Patterns (PIPs) using MAPTs for a “Type-I” network fault event m Type-I: Performance of any network factor changes from Good grade to Acceptable grade over a 5 second duration Repeat – PIP 2

MAPTs Emulation (3) r Emulation of Participant Interaction Patterns (PIPs) using MAPTs for a

MAPTs Emulation (3) r Emulation of Participant Interaction Patterns (PIPs) using MAPTs for a “Type-II” network fault event m Type-II: Performance of any network factor changes from Good grade to Poor grade over a 10 second duration Disconnect/Reorient – PIP 3

Vperf Tool Implementation of MAPTs r r Per-second frequency of Interim Test Report generation

Vperf Tool Implementation of MAPTs r r Per-second frequency of Interim Test Report generation Interaction Qo. E reported by Vperf tool - based on the progress of the sessionagenda

MAPTs Performance r Increased the number of Type-I and Type-II network fault events in

MAPTs Performance r Increased the number of Type-I and Type-II network fault events in a controlled LAN testbed for a fixed session-agenda m NISTnet network emulator for network fault generation r Recorded Unwanted Agenda-Bandwidth and Unwanted Agenda-Time measured by Vperf tool (a) Impact of Type-I Network Fault Events on Unwanted Agenda-Bandwidth (b) Impact of Type-I and Type-II Network Fault Events on Unwanted Agenda-Time

Summary r Signaling Protocols Terminology r Introduction to H. 323 and SIP r Comparison

Summary r Signaling Protocols Terminology r Introduction to H. 323 and SIP r Comparison of H. 323 and SIP r Basics of RTP, RTCP r Factors affecting VVo. IP System Performance r Measuring VVo. IP System Performance m OSC’s H. 323 Beacon Tool m OSC’s Vperf Tool

Questions?

Questions?