11 4 MPEG2 Audio n MPEG2 Audio MPEG

  • Slides: 42
Download presentation

11. 4 MPEG-2 Audio n MPEG-2 Audio简介 Ø MPEG 2标准委员会定义了两种声音数据压 缩标准 n MPEG 2

11. 4 MPEG-2 Audio n MPEG-2 Audio简介 Ø MPEG 2标准委员会定义了两种声音数据压 缩标准 n MPEG 2 Audio (ISO/IEC 13818 3)[12] u u n 也称MPEG 2 Multichannel Audio (多通道声音) 因为它与MPEG 1 Audio是兼容的,所以又称为MPEG 2 BC (Backward Compatible)标准 MPEG 2 AAC (ISO/IEC 13818 7)[22] u 2021年 10月31日 因为它与MPEG 1 Audio格式不兼容,因此通常把它称 为非后向兼容MPEG 2 NBC(Non Backward Compatible)标准 第 11章 MPEG声音 26/42

11. 6 MPEG-4 Audio(续 1) 图 11 -23 MPEG-4 Audio数据速率和应用目标(引自ISO/IEC 14496 -3 Subpart 1:

11. 6 MPEG-4 Audio(续 1) 图 11 -23 MPEG-4 Audio数据速率和应用目标(引自ISO/IEC 14496 -3 Subpart 1: 1998) 2021年 10月31日 第 11章 MPEG声音 31/42

11. 6 MPEG-4 Audio(续 3) MC/LSF: multi channel and low sampling frequency AAC: advanced

11. 6 MPEG-4 Audio(续 3) MC/LSF: multi channel and low sampling frequency AAC: advanced audio coder SBR: spectral band replication SSC: sinusoidal coding SLS: scalable lossless DST: direct stream transfer 图 11 24 MPEG 4 Audio概要[24] 2021年 10月31日 第 11章 MPEG声音 33/42

11. 6 MPEG-4 Audio(续 4) n MPEG 4 Audio 具与文档 Ø 提供的声音 具可分成 8种类型

11. 6 MPEG-4 Audio(续 4) n MPEG 4 Audio 具与文档 Ø 提供的声音 具可分成 8种类型 (1) 话音编码 具(speech coding tools) (2) 声音编码 具(audio coding tools) (3) 无损声音编码 具(lossless audio coding tools) (4) 声音合成 具(synthesis tools) (5) 编排 具(composition tools) (6) 性能可变 具(scalability tools) (7) 上行数据流控制 具(upstream) (8) 抗错 具(error robustness facilities) 2021年 10月31日 第 11章 MPEG声音 34/42

11. 6 MPEG-4 Audio(续 5) Ø 描述各种 具的文档(ISO IEC 14496 3)有10部分 Subpart 1: Main

11. 6 MPEG-4 Audio(续 5) Ø 描述各种 具的文档(ISO IEC 14496 3)有10部分 Subpart 1: Main Subpart 2: Speech coding — HVXC Subpart 3: Speech coding — CELP Subpart 4: General Audio coding (GA) — AAC, Twin. VQ, BSAC Subpart 5: Structured Audio (SA) Subpart 6: Text To Speech Interface (TTSI) Subpart 7: Parametric Audio Coding — HILN Subpart 8: Parametric coding for high quality audio — SSC Subpart 9: MPEG 1/2 Audio in MPEG 4 Subpart 10: Lossless coding of over sampled audio — DST 2021年 10月31日 第 11章 MPEG声音 35/42

第 11章 MPEG声音(参考文献) 参考文献和站点 n 1. 2. 3. 4. 5. 6. 7. The MPEG

第 11章 MPEG声音(参考文献) 参考文献和站点 n 1. 2. 3. 4. 5. 6. 7. The MPEG Home Page, http: //www. chiariglione. org/mpeg/ MPEG Industry Forum, http: //www. mpegif. org/resources. php MPEG Audio Resources and Software, http: //www. mpeg. org/MPEG/audio. html The MPEG Audio Web Page, http: //sound. media. mit. edu/mpeg 4/audio/ J. S. Tobias, Ed. , Foundations of Modern Auditory Theory, Vol. 1, Academic Press, New York, 1970 Hugo Fastl and Eberhard Zwicker, Psychoacoustics: Facts and Models (Springer Series in Information Sciences), 3 rd ed. 2007. pp 149 173 Ted Painter and Andreas Spanias, Perceptual Coding of Digital Audio, Proceedings of the IEEE, VOL. 88, NO. 4, April 2000. http: //www. eas. asu. edu/~spanias/papers /paper audio tedspanias 00. pdf 2021年 10月31日 第 11章 MPEG声音 36/42

第 11章 MPEG声音(参考文献 续 1) 8. 9. 10. 11. 12. 13. Miroslava Raspopovic, Charles

第 11章 MPEG声音(参考文献 续 1) 8. 9. 10. 11. 12. 13. Miroslava Raspopovic, Charles Thompson, Donn Clark, Design of Perception Based Audio Codec - Final Report, May 25 th, 2001. http: //morse. uml. edu/~mira/Research/Codec. pdf Teddy Surya Gunawan, Eliathamby Ambikairajah, Audio Compression and Speech Enhancement using Temporal Masking Models, thesis submitted for the degree of Doctor of Philosophy, 2007. http: //www. library. unsw. edu. au/~thesis/adt NUN/uploads/approved /adt NUN 20070226. 040348/public/01 front. pdf Advanced Television Systems Committee, Inc. , Digital Audio Compression Standard (AC 3, E AC 3), Revision B, Document A/52 B, 14 June 2005. http: //www. atsc. org/standards. html ITU Radio communication Study Groups, A guide to digital terrestrial television broadcasting in the VHF/UHF bands, 1998. http: //happy. emu. id. au/lab/tut/dttbtuti. htm ISO/IEC 13818 3,ISO/IEC JTC 1/SC 29/WG 11 NO 803, Information Technology - Generic Coding of Moving Pictures and Associated Audio: Audio,11/November/1994 P. U. Y. Dehery, M. Lever, A MUSICAM source codec for digital audio broadcasting and storage, in Proceedings of Int. Conf. Acoustic, Speech, Signal Processing, pp. 3605 3608, IEEE, 1991 2021年 10月31日 第 11章 MPEG声音 37/42

第 11章 MPEG声音(参考文献 续 2) 14. 15. 16. 17. 18. K. Brandenburg, J. Herre,

第 11章 MPEG声音(参考文献 续 2) 14. 15. 16. 17. 18. K. Brandenburg, J. Herre, J. D. Johnston, Y. Mahieux, and E. Schroeder, ASPEC: Adaptive spectral entropy coding of high quality music signals, in Proc. 90 th Convention. Aud. Eng. Soc. , Feb. 1991 P. Noll, Wideband Speech and Audio Coding, IEEE Comm. Mag. , pp. 34 44, Nov. 1993. http: //ieeexplore. ieee. org/iel 1/35/6505/00256878. pdf Davis Pan. A Tutorial on MPEG/Audio Compression. IEEE Multimedia, 1995, pp 60 74. http: //www. ee. columbia. edu/~dpwe/e 6820/papers/Pan 95 mpega. pdf Karlheinz Brandenburg, OCF-A New Coding Algorithm for High Quality Sound Signals, 1987. http: //ieeexplore. ieee. org/iel 6/8363/26345/01169893. pdf. Princen J, Bradley, A. Analysis/Synthesis Filter Bank Design Based on Time Domain Aliasing Cancellation. IEEE Transactions, ASSP 34, No. 5, Oct 1986, pp 1153 1161, http: //ieeexplore. ieee. org/iel 6/29/26200/01164954. pdf 2021年 10月31日 第 11章 MPEG声音 38/42

第 11章 MPEG声音(参考文献 续 3) 19. 20. 21. 22. 23. 24. Ye Wang and

第 11章 MPEG声音(参考文献 续 3) 19. 20. 21. 22. 23. 24. Ye Wang and Miikka Vilermo,The Modified Discrete Cosine Transform: Its Implications for Audio Coding and Error Concealment, AES 22 nd International Conference on Virtual, Synthetic and Entertainment Audio, 2002. http: //www. comp. nus. edu. sg/~wangye/papers /00027_aes 22. pdf. Hossein Najafzadeh Azghandi,Perceptual Coding of Narrowband Audio Signals, April 2000. http: //www mmsp. ece. mcgill. ca/MMSP/Theses/T 1999 2001. html ISO/IEC 11172 3, Coding of moving pictures and associated audio for digital storage media at up to about 1. 5 mbit/s,3 Annex C (informative) The encoding process. 1993 ISO/IEC 13818 7: 2004(E), Information technology — Generic coding of moving pictures and associated audio information — Part 7: Advanced Audio Coding (AAC) Bosi Metal, ISO/IEC MPEG-2 Advanced Audio Coding. Journal of the Audio Engineering Society, No. 10, pp. 789 813, October 1997. Takehiro Moriya, Noboru Harada, Yutaka Kamamoto, and Hiroshi Sekigawa,MPEG-4 ALS—International Standard for Lossless Audio Coding , NTT Technical Review,pp 40 45, Vol. 4 No. 8, Aug. 2006. 2021年 10月31日 第 11章 MPEG声音 39/42

第 11章 MPEG声音(参考文献 续 4) 25. 26. 27. 28. 29. 30. ISO/IEC 14496 3,

第 11章 MPEG声音(参考文献 续 4) 25. 26. 27. 28. 29. 30. ISO/IEC 14496 3, Third edition, 2005 12 01, Information technology — Coding of audio-visual objects — Part 3: Audio. Dennis H. Klatt. Review of text-to-speech conversion for English. J. Acoustical. Soc. Am. 82(3), September 1987. http: //ieeexplore. ieee. org/iel 6/8370/26352/01171431. pdf Stefan Meltzer and Gerald Moser, MPEG-4 HE-AAC v 2 audio coding for today's media world, EBU Technical Review – January 2006,http: //www. codingtechnologies. com/ Tilman Liebchen, Takehiro Moriya, Noboru Harada, Yutaka Kamamoto, and Yuriy A. Reznik, The MPEG-4 Audio Lossless Coding (ALS) Standard - Technology and Applications, 119 th AES Convention, New York, October 7 10, 2005. MPEG 4 Audio Lossless Coding (ALS)文档: http: //www. nue. tu berlin. de/forschung/projekte/lossless/mp 4 als. html, ETSI EN 300 401 V 1. 3. 3 (2001 05), Radio Broadcasting Systems; Digital Audio Broadcasting (DAB) to mobile, portable and fixed receivers, http: //www. lrr. in. tum. de/zope /lectures/labcourses/SS 03/mikroprakt/files/spec/dab_main. pdf 2021年 10月31日 第 11章 MPEG声音 40/42

第 11章 MPEG声音(参考文献 续 5) 31. 32. 33. 34. 35. Arbitron Inc. August 2005,

第 11章 MPEG声音(参考文献 续 5) 31. 32. 33. 34. 35. Arbitron Inc. August 2005, Critical Band Encoding Technology Audio Encoding System from Arbitron, http: //www. ccbe. ca/Downloads/Arbitron Encoding white paper intl. pdf. JONG HWA KIM, Lossless Wideband Audio Compression: Prediction and Transform, Berlin 2004, http: //edocs. tu berlin. de/diss/2003/kim_jonghwa. pdf Theile, G. Stoll and M. Link. Low bit-rate coding of highquality audio signals An introduction to the MASCAM system, EBU Review, Technical no. 230 : 158 81, Aug. 1988 J. Princen, A. Johnson, and A. Bradley, "Subband/Transform Coding Using Filter Bank Designs Based on Time Domain Aliasing Cancellation, " ICASSP 1987 Conf. Proc. , May 1987, pp. 2161 2164. http: //ieeexplore. ieee. org/iel 6/8363/26345/01169405. pdf Esin Darici Haritaoglu, Wideband Speech and Audio Coding, http: //www. umiacs. umd. edu/users/desin/Speech/new. html 2021年 10月31日 第 11章 MPEG声音 41/42

END 第 11章 MPEG声音

END 第 11章 MPEG声音