| 研究生: |
王怡雯 Yi-Wen Wang |
|---|---|
| 論文名稱: |
適合MPEG-2/4 AAC聲學模型之 Design and VLSI Implementation for Psychoacoustic Model in MPEG-2/4 Advanced Audio Coding |
| 指導教授: |
蔡宗漢
Tsung-Han Tsai |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 電機工程學系 Department of Electrical Engineering |
| 畢業學年度: | 92 |
| 語文別: | 英文 |
| 論文頁數: | 58 |
| 中文關鍵詞: | 聲學模型 |
| 外文關鍵詞: | Psychoacoustic Model |
| 相關次數: | 點閱:7 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
資料壓縮技術對於聲音的系統是個必要的任務,它不只可以處理龐大的資料,但是也要求高品質的解析度。有一種聲音編碼的壓縮技術叫做MPEG,MPEG 是一個標準化對於聲音壓縮上是有效率的。它可以有意義的降低在頻寬傳送和資料儲存的需求上而且在失真率上也很低。
這篇論文所要呈現是一個新的低複雜度設計聲學模型,它的重要技術是應用於一個低功率的MPEG-2/4 AAC編碼器。在現今MPEG AAC的計算複雜很高,沒有辦法達到聲音即時播放,是攜帶式裝置的一個瓶頸。為了克服這個問題,必需對聲學模型做分析和最佳化的設計,所以,在演算法上,spreading function的計算方式是用查表方式來取代。除此之外,MDCT-Based 聲學模型也適合關於複雜度的降低和品質的保持,聲學模型的計算雜複度總共被降低到達到80%;在架構上,我們呈現一個專屬MDCT-Based聲學模型硬體設計,所以就可以實現一個即時播放在MPEG-2/4 AAC立體聲編碼器上,在位元速128kbit/sec下,頻率在20MHz都可以保持CD的品質。
Data compression technique is an essential task for audio systems, which not only handles enormous amounts of data, but also requires the high quality resolution. One of theses audio coding techniques, Moving Pictures Experts Group (MPEG) is powerful audio compression standardization. It can significantly reduce the requirements of transmission bandwidth and data storage, but with low distortion.
The paper presents a new low complexity design of Psycho-Acoustic Model (PAM), which is the key technology for a low power MPEG-2/4 Advanced Audio Coding (AAC) encoding. The real-time constraint of MPEG AAC leads to a heavy computational bottleneck on today’s portable devices. To overcome this problem, design analysis and optimization of PAM are addressed. At algorithmic level, the calculation of spreading function was replaced with look-up tables. Besides, Modified-Discrete-Cosine-Transform-based (MDCT-based) PAM was referred to and adopted concerning reducing complexity and maintaining quality. The computational complexity of PAM could be reduced by more than 80% in total. At architectural level, we presented a dedicated hardware design of MDCT-based PAM. The proposed design could be implemented in a real-time MPEG-2/4 AAC stereo encoder at Low Complexity profile and at bitrate 128 kb/s below clock rate 20 MHz while maintaining CD quality.
[1] MPEG. Coding of moving pictures and associated audio for digital storage media at up to 1.5 Mbit/s, part 3: Audio, International Standard IS 11172-3, ISO/IEC JTC1/SC29 WG11, 1992.
[2] MPEG. Information Technology – generic coding of moving pictures and associated audio, part 3: Audio, International Standard IS 13818-3, ISO/IEC JTC1/SC29 WG11, 1994.
[3] MPEG. MPEG-2 Advanced Audio Coding, AAC, International Standard IS 13818-7, ISO/IEC JTC1/SC29 WG11, 1997.
[4] Marina Bosi, Karlheinz Brandenburg, Schuyler Quackenbush, Louis Fielder, Kenzo Akagiri, Hendrik Fuchs, Martin Dietz, Jurgen Herre, Grant Davidson, Yoshiaki Oikawa, “ISO/IEC MPEG-2 Advanced Audio Coding,” J. Audio Eng. Soc., Vol. 45, No. 10, 1997 October.
[5] MPEG. Information technology – Coding of audio-visual objects – Part 3: Audio, International Standard IS 14496-3, ISO/IEC JTC1/SC29 WG11, 1999.
[6] Karlheinz Brandenburg, “MP3 and AAC explained,” AES 17th International Conference on High Quality Audio Coding, Italy, Sep. 2-5, 1999.
[7] ISO/IEC 14496-5 2001 Software Reference. Available: http://www.iso.ch/iso/en/ittf/PubliclyAvailableStandards/ISO_IEC_14496-5_2001_Software_Reference/
[8] Yuichiro Takamizawa, Toshiyuki Nomura, Masao Ikekawa, “High-quality and processor-efficient implementation of an MPEG-2 AAC encoder,” in Proceedings of the 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 2, Page(s): 985 –988.
[9] Do Hyoung Kim, Dong Hyun Kim, Jae Ho Chung, “Optimization of MPEG-4 GA AAC on general PC,” in Proceedings of the 44th IEEE 2001 Midwest Symposium on Circuits and Systems, Vol. 2, pp. 923-925.
[10] Ivan Dimkoviae, Dragorad Milovanoviae, Zoran Bojkoviae, “Fast software implementation of MPEG advanced audio encoder,” 2002 14th International Conference on Digital Signal Processing, Vol. 2, Page(s): 839 –843.
[11] Dong-Yan Huang, Xuesong Gong, Daqing Zhou, Toshio Miki, Sanae Hotani, “Implementation of the MPEG-4 Advanced Audio Coding encoder on ADSP-21060 SHARC,” in Proceedings of the 1999 IEEE International Symposium on Circuits and Systems, Vol. 3, page(s): 544 –547.
[12] Yuichiro Takamizawa, Tsuyoshi Okumura, Toshiyuki Nomura, Masao Ikekawa, and Ichiro Kuroda, “20mW MPEG-2/4 AAC LC stereo encoder on a 16-bit DSP,” Workshop and Exhibition on MPEG-4, San Jose, California, June 25-27 2002.
[13] Marc Gayer, Markus Lohwasser, Manfred Lutzky, “Implementing MPEG Advanced Audio Coding and Layer-3 encoders on 32-bit and 16-bit fixed-point processors,” presented at the AES 115th Convention, New York, Oct. 10-13, 2003.
[14] Tsung-Han Tsai, Shih-Way Huang, Liang-Gee Chen, “Design of a low power psychoacoustic model co-processor for MPEG-2/4 AAC LC stereo encoder,” in Proceedings of the 2003 IEEE International Symposium on Circuits and Systems, Vol. 2, page(s): 552 –555, May 25-28, 2003.
[15] Chi-Min Liu, Chin-Ching Chen, Wen-Chieh Lee, Szu-Wei Lee, “A fast bit allocation method for MPEG layer III,” IEEE International Conference on Consumer Electronics, 1999, pages 22-23.
[16] Chih-Kai Yang, Sau-Gee Chen, “New static and dynamic search algorithms for fast MP3 bit allocations,” in 2003 IEEE International Conference on Multimedia and Expo, Vol. 1, pages 77-80.
[17] Hyen-O Oh, Joon-Seok Kim, Chang-Jun Song, Young-Cheol Park, Dae-Hee Youn, “Low power MPEG/audio encoders using simplified psychoacoustic model and fast bit allocation,” IEEE Transaction of Consumer Electronics, Volume: 47 Issue: 3, Page(s): 613 –621, Aug. 2001.
[18] Vasudev Bhaskaran, Konstantions Konstantinides, “Image and video compresson standards algorithms and architectures,” Hewlett-Packard Laboratories, Second Edition, 2000.
[19] Ted Painter, Andreas Spanias, “Perceptual coding of digital audio,” Proceedings of the IEEE,Vol.88,P.451-515,2000.
[20] Marina Bosi, Richard E. Goldberg, “Introduction to digital audio coding and standards,” Kluwer Academic Publishers,2003.
[21] Miroslava Raspopovic, “Design of Perception Based Audio Codec,” University of Massachusetts Lowell,2001.
[22] H. Fletcher, “Auditory Patterns,” Rev. Mod. Phys., pp. 47-65, Jan. 1940.
[23] Terhardt, E., “Calculating Virtual Pitch,” Hearing Research, pp. 155-182, 1, 1979.
[24] Fengduo Hu, “ITE Technology Incorporated,”2003.
[25] Srinivasan P., Jamieson L.H., “High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modeling,” IEEE Transactions on,Vol.46,P.1085-1093,1998.
[26] P. Duhamel, Y. Mahieux, J.P. Petit, “A fast algorithm for the implementation of filter banks based on time domain aliasing cancellation,” in Proceedings of the 1991 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 2209-2212.
[27] James D. Johnston, “Transform coding of audio signals using perceptual noise criteria,” IEEE Journal on Selected Area on Communications, Vol. 6, No 2, Feb. 1988.
[28] Mark Kahrs, Karlheinz Brandenburg, Applications of digital signal processing to audio and acoustics. Kluwer Academic Publishers, 1998, p.59.
[29] Winnie Lau, Alex Chwu, “A common transform engine for MPEG & AC3 audio encoder,”
[30] Chichyang Chen, Rui-Lin Chen, Chih-Huan Yang, ”Pipelined Computation of Very Large Word-Length LNS Addition/Subtraction with Polynomial Hardware Cost”, IEEE Transactions on Computers, Vol. 49 Issue: 7, Page(s): 716 -726, July 2000.
[31] ISO/IEC JTC1/SC29 WG11 No.1650 “IS 13818-7 (MPEG-2 Advanced Audio Coding , AAC)”, April 1997.