CS 414 Multimedia Systems Design Lecture 3 Digital
- Slides: 34
CS 414 – Multimedia Systems Design Lecture 3 – Digital Audio Representation Klara Nahrstedt Spring 2012 CS 414 - Spring 2012
Administrative n Form Groups for MPs ¨ Deadline January 23 (today!!) to email TA CS 414 - Spring 2012
Integrating Aspects of Multimedia Image/Video Capture Audio/Video Perception/ Playback Audio/Video Presentation Playback Image/Video Information Representation Transmission Audio Capture Transmission Compression Processing Audio Information Representation Media Server Storage CS 414 - Spring 2012 A/V Playback
Today Introduced Concepts n Analog to Digital Sound Conversion ¨ Sampling, sampling rate, Nyquist Theorem ¨ Quantization, pulse code modulation, differentiated PCM, Adaptive PCM Signal-to-Noise Ratio n Data Rates n CS 414 - Spring 2012
Key Questions n How can a continuous wave form be converted into discrete samples? n How can discrete samples be converted back into a continuous form? CS 414 - Spring 2012
Lifecycle from Sound to Digital to Sound Source: http: //en. wikipedia. org/wiki/Digital_audio
Characteristics of Sound n n n Amplitude Wavelength (w) Frequency ( ) Timbre Hearing: [20 Hz – 20 KHz] Speech: [200 Hz – 8 KHz]
Digital Representation of Audio n Must convert wave form to digital ¨sample ¨quantize ¨compress
Sampling (in time) n Measure amplitude at regular intervals n How many times should we sample? CS 414 - Spring 2012
Nyquist Theorem For lossless digitization, the sampling rate should be at least twice the maximum frequency response. In mathematical terms: fs > 2*fm n where fs is sampling frequency and fm is the maximum frequency in the signal n CS 414 - Spring 2012
Nyquist Limit n max data rate = 2 H log 2 V bits/second, where H = bandwidth (in Hz) V = discrete levels (bits per signal change) n Shows the maximum number of bits that can be sent per second on a noiseless channel with a bandwidth of H, if V bits are sent per signal ¨ Example: what is the maximum data rate for a 3 k. Hz channel that transmits data using 2 levels (binary) ? ¨ Solution: H = 3 k. Hz; V = 2; max. data rate = 2 x 3, 000 xln 2=6, 000 bits/second CS 414 - Spring 2012
Limited Sampling n But what if one cannot sample fast enough? CS 414 - Spring 2012
Limited Sampling n Reduce signal frequency to half of maximum sampling frequency ¨ low-pass filter removes higher-frequencies ¨ e. g. , if max sampling frequency is 22 k. Hz, must low-pass filter a signal down to 11 k. Hz CS 414 - Spring 2012
Sampling Ranges n Auditory range 20 Hz to 22. 05 k. Hz ¨ must sample up to to 44. 1 k. Hz ¨ common examples are 8. 000 k. Hz, 11. 025 k. Hz, 16. 000 k. Hz, 22. 05 k. Hz, and 44. 1 KHz n Speech frequency [200 Hz, 8 k. Hz] ¨ sample up to 16 k. Hz ¨ but typically 4 k. Hz to 11 k. Hz is used CS 414 - Spring 2012
CS 414 - Spring 2012
Quantization CS 414 - Spring 2012
Sampling and 4 -bit quantization Source: http: //en. wikipedia. org/wiki/Digital_audio
Quantization n Typically use ¨ 8 bits = 256 levels ¨ 16 bits = 65, 536 levels n How should the levels be distributed? ¨ Linearly? (PCM) ¨ Perceptually? (u-Law) ¨ Differential? (DPCM) ¨ Adaptively? (ADPCM) CS 414 - Spring 2012
Pulse Code Modulation n Pulse modulation ¨ Use discrete time samples of analog signals ¨ Transmission is composed of analog information sent at different times ¨ Variation of pulse amplitude or pulse timing allowed to vary continuously over all values n PCM ¨ Analog signal is quantized into a number of discrete levels CS 414 - Spring 2012
PCM Quantization and Digitization Quantization CS 414 - Spring 2012
Linear Quantization (PCM) Divide amplitude spectrum into N units (for log 2 N bit quantization) Quantization Index n Sound Intensity CS 414 - Spring 2012
Non-uniform Quantization CS 414 - Spring 2012
Perceptual Quantization (u-Law) Want intensity values logarithmically mapped over N quantization units Quantization Index n Sound Intensity CS 414 - Spring 2012
Differential Pulse Code Modulation (DPCM) n What if we look at sample differences, not the samples themselves? ¨ dt = xt-xt-1 ¨ Differences tend to be smaller n Use 4 bits instead of 12, maybe? CS 414 - Spring 2012
Differential Pulse Code Modulation (DPCM) n n Changes between adjacent samples small Send value, then relative changes ¨ value uses full bits, changes use fewer bits ¨ E. g. , 220, 218, 221, 219, 220, 221, 222, 218, . . (all values between 218 and 222) ¨ Difference sequence sent: 220, +2, -3, 2, -1, -1, +4. . ¨ Result: originally for encoding sequence 0. . 255 numbers need 8 bits; ¨ Difference coding: need only 3 bits CS 414 - Spring 2012
Adaptive Differential Pulse Code Modulation (ADPCM) n n Adaptive similar to DPCM, but adjusts the width of the quantization steps Encode difference in 4 bits, but vary the mapping of bits to difference dynamically ¨ If rapid change, use large differences ¨ If slow change, use small differences CS 414 - Spring 2012
Signal-to-Noise Ratio (metric to quantify quality of digital audio) CS 414 - Spring 2012
Signal To Noise Ratio n Measures strength of signal to noise SNR (in DB)= n Given sound form with amplitude in [-A, A] A n Signal energy = 0 -A CS 414 - Spring 2012
Quantization Error n Difference between actual and sampled value ¨ amplitude between [-A, A] ¨ quantization levels = N n e. g. , if A = 1, N = 8, = 1/4 CS 414 - Spring 2012
Compute Signal to Noise Ratio n Signal energy = ; Noise energy = n Signal to noise = n Every bit increases SNR by ~ 6 decibels CS 414 - Spring 2012 ;
Example n n n Consider a full load sinusoidal modulating signal of amplitude A, which utilizes all the representation levels provided The average signal power is P= A 2/2 The total range of quantizer is 2 A because modulating signal swings between –A and A. Therefore, if it is N=16 (4 -bit quantizer), Δ = 2 A/24 = A/8 The quantization noise is Δ 2/12 = A 2/768 The S/N ratio is (A 2/2)/(A 2/768) = 384; SNR (in d. B) 25. 8 d. B CS 414 - Spring 2012
Data Rates n Data rate = sample rate * quantization * channel n Compare rates for CD vs. mono audio ¨ 8000 samples/second * 8 bits/sample * 1 channel = 8 k. Bytes / second ¨ 44, 100 samples/second * 16 bits/sample * 2 channel = 176 k. Bytes / second ~= 10 MB / minute CS 414 - Spring 2012
Comparison and Sampling/Coding Techniques CS 414 - Spring 2012
Summary CS 414 - Spring 2012
- 01:640:244 lecture notes - lecture 15: plat, idah, farad
- Digital systems testing and testable design
- Digital design: a systems approach
- Digital design a system approach
- Digital system design using verilog
- Multimedia becomes interactive multimedia when
- What is non linear multimedia
- Chapter 1 introduction to multimedia
- Esa multimedia.esa.int./multimedia/virtual-tour-iss
- 0 414
- Mil-std-414
- Mil std 414
- 414 climate change
- Cs 414
- Cmsc414 umd
- Cse 414
- Gcd of 414 and 662
- Cmsc 414
- Cmsc 414
- Cmsc 414
- Operating systems lecture notes
- Lecture sound systems
- Lecture sound systems
- Justin sconza
- Analog and digital video
- Multimedia data representation
- Kata video berasal dari kata
- What is sound in multimedia
- Digital audio in multimedia
- Analog and digital video in multimedia
- Types of multimedia systems
- The term multimedia
- Advanced multimedia systems
- Multimedia management system
- Anchorage length eurocode