MPEG-2 AAC編碼器的SoC設計｜國立中央大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	沙宇杰 Yu-Jie Sha
論文名稱：	MPEG-2 AAC編碼器的SoC設計 An SoC Design of MPEG-2 AAC Encoder
指導教授：	蔡宗漢 Tsung-Han Tsai
口試委員:
學位類別：	碩士 Master
系所名稱：	資訊電機學院 - 電機工程學系 Department of Electrical Engineering
畢業學年度：	100
語文別：	英文
論文頁數：	95
中文關鍵詞：	SoC 、MDCT 、Encoder 、AAC
外文關鍵詞：	SoC, Encoder, MDCT, AAC
相關次數：	點閱：19 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

在日常生活的娛樂與通訊中，數位音訊編碼扮演著一個很重要的角色。自從MPEG Layer-III（MP3）被發佈出來並且在3C產品中變成非常有名，MPEG協會提出了MPEG-2 Advanced Audio Coding（AAC）為新一代的音訊編碼規格。在這個新的規格中，它的性能與壓縮率都比MP3要來的高。可是它的演算法也相對來說更複雜而且包含更高的運算量。因此如何降低AAC編碼器的運算量並且維持低失真率是一個主要的挑戰。
在本論文中，我們提出了一個MPEG-2 AAC編碼器的SoC設計，而且針對MPEG-2 AAC編碼器中最主要的組件做最佳化，也就是聲學模型（PAM）。為了要減少聲學模型的複雜度，我們使用了基於改良式離散餘弦轉換之聲學模型（MDCT-based PAM）來減輕整個系統的運算量。我們使用台積電的0.18μm 1P6M CMOS製程來實現我們所提出的編碼器。為了增加我們所設計的硬體加速器的處理能力（throughput），我們使用了管線與摺疊式的設計。本設計的總邏輯閘數是284K個，而基於改良式離散餘弦轉換之聲學模型與Andes Core N903處理器針對取樣頻率為44100赫茲、立體聲道的音樂可以達到即時編碼，其所需的操作頻率各為10兆赫與100兆赫。

Digital audio coding has played an important role in our daily life for entertainment and communication. Since MPEG Layer-III (MP3) had been published, and became very popular in consumer applications, the MPEG organization proposed MPEG-2 Advanced Audio Coding (AAC) standard as the next generation of audio standard. Both performance and compression ratio of AAC are better than MP3. However, the algorithm is more complex and computation-intensive. Thus, how to reduce the computation and maintain the audio fidelity is the major challenge of AAC encoder.
In this thesis, we proposed an SoC design of MPEG-2 AAC encoder, and optimized the key component of MPEG-2 AAC encoder, which is the psychoacoustic model (PAM). In order to reduce the complexity of psychoacoustic model, we used the MDCT-based PAM to alleviate the loading of the entire system. Our proposed encoder is implemented in TSMC 0.18 μm 1P6M CMOS technology. Pipelining and folding technique are used to increase throughput of the hardware accelerator. The total gate count is 284K and the operating frequency for real-time stereo channel of sampling rate 44.1KHz audio sequence processing is 10 MHz and 100MHz for MDCT-based PAM and Andes Core N903 processor respectively.

摘要	i
Abstract	ix
Table of Contents	x
List of Figures	xii
List of Tables	xiv
Chapter 1 Introduction	1
1  The History and Feature of Audio Application	2
2  The System of MPEG-2/4 AAC, HE-AAC v1/v2	6
2.1  The system of MPEG-2/4 AAC encoder	6
2.2  The System of HE-AAC v1 (AAC+SBR)	7
2.3  The System of HE-AAC v2 (AAC+SBR+PS)	8
3  Overview of SoC Platform-based Design	9
4  Motivation	10
5  Thesis Organization	14
Chapter 2 The Overview of MPEG-2/4 AAC Encoder	15
1  Filterbank	18
1.1  Window Shape Adaptation	19
1.2  Window Type Decision	19
1.3  Modified Discrete Cosine Transform (MDCT)	21
2  Psychoacoustic Model	22
3  The Spectral Processing Part (SPP)	25
3.1  Temporal Noise Shaping (TNS)	25
3.2  Joint Stereo Coding	25
4 Quantization Loop (Q loop)	26
Chapter 3 The Algorithm of Low Complexity MDCT-Based Psychoacoustic Model	29
1  FFT-Based MDCT Algorithm	32
2  Low Complexity MDCT-Based Psychoacoustic Model	35
Chapter 4 SoC Design of Proposed AAC Encoder	42
1  Architecture of Proposed Design	43
1.1  Software/Hardware Partition	43
1.2  System Architecture	45
1.3  Software Development	49
1.4  Hardware Implementation	50
2  The Architecture of MDCT-Based PAM	51
2.1  MDCT	53
2.2  Threshold Generator (TG)	54
2.3  The Log Module	55
Chapter 5 Implementations and Results	58
1  Performance Evaluation	59
2  Power Analysis and Evaluation	60
3  Designs for Testing Strategy Power Analysis and Evaluation	62
4  Comparison and Results	62
Chapter 6 Conclusions and Future Work	71
References	73

                                

[1] MPEG. Coding of moving pictures and associated audio for digital storage media at up to 1.5 Mbit/s, part 3: Audio, International Standard IS 11172-3, ISO/IEC JTC1/SC29 WG11, 1992.
[2] MPEG. Information Technology – generic coding of moving pictures and associated audio, part 3: Audio, International Standard IS 13818-3, ISO/IEC JTC1/SC29 WG11, 1994.
[3] MPEG-2 Advanced Audio Coding, AAC, International Standard IS 13818-7, ISO/IEC JTC1/SC29 WG11, 1997.
[4] MPEG. Information technology – Coding of audio-visual objects – Part 3: Audio, International Standard IS 14496-3, ISO/IEC JTC1/SC29 WG11, 1999.
[5] MPEG. Information technology – Coding of audio-visual objects – Part 3: Audio, Amendment 1: Bandwidth extension. ISO/IEC 14496-3:2001/Amd. 1:2003, Nov. 2003.
[6] MPEG Information technology – Coding of audio-visual objects – part 3: Audio, Amendment 2: Parametric coding for high-quality audio, ISO/IEC 14496-3/Amd. 2: 2004.
[7] MPEG Information technology – Coding of audio-visual objects – part 3: Audio, Amendment 2: Audio Lossless Coding, ISO/IEC 14496-3/Amd. 2: 2005.
[8] MPEG Information technology – Coding of audio-visual objects – part 3: Audio, Amendment 3: Scalable Lossless Coding, ISO/IEC 14496-3/Amd. 3: 2005.
[9] R. Geiger, T. Sporer, J. Koller, and K. Brandenburg, “Audio Coding based on Integer Transform,” in AES 111th Convention, New York, NY, USA Preprint 5471 Sept. 2001.
[10] T. Liebchen, “An introduction to MPEG-4 audio lossless coding,” IEEE Proc. ICSSAP2004, vol. 3, pp. iii – 1012 – 1015, May 2004.
[11] P. Coussy, A. Baganne, and E. Martin. “Virtual component IP re-use in telecommunication systems design: a case study of MPEG-2/JPEG2000 encoder,” IEEE Proc .ICECS2002, vol. 2, pp.733-736, Sept. 2002.
[12] C.N. Liu, and T.H. Tsai, “SoC platform based design of MPEG-2/4 AAC audio decoder,” IEEE Proc. ISCAS2005., vol. 3, pp.2581-2584, May 2005.
[13] Domazet, D.; Kovac, M.; “Advanced software implementation of MPEG-AAC audio encoder”, 4th EURASIP Conference focused on Video/Image Processing and Multimedia Communications, 2003. Volume 2, 2-5 July 2003 Page(s):679 – 684 vol.2.
[14] D. Huang, X. Gong, D. Zhou, T. Miki, S. Hotani, “Implementation of the MPEG-4 Advanced Audio Coding encoder on ADSP-21060 SHARC,” in Proceedings of the 1999 IEEE International Symposium on Circuits and Systems, Vol. 3, page(s): 544 – 547.
[15] D. Alberto, P. Rafael, R. Begona, A. Enrique, and P. Antonio; “A Robust and Efficient
Implementation of MPEG-2/4 AAC Natural Audio Coders” , in AES 112th Convention 2002
May 10-13 Munich,Germany.
[16] P. Antonio, A. Enrique, R. Begona, P. Rafael, and D. Alberto;” Realtime implementations of MPEG-2 and MPEG-4 natural audio coders” , in AES 110th Convention 2001 May 12-15 Amsterdam, The Netherlands.
[17] Y.C. Lu; C.-F. Shen and C.K. Chen; “A novel hardware accelerator architecture for MPEG-2/4 AAC encoder”, 2004 IEEE International Conference on Multimedia and Expo, 2004. ICME ''04. Volume 2, 27-30 June 2004 Page(s):1139 – 1142 Vol.2.
[18] M. Kahrs, K. Brandenburg, Applications of digital signal processing to audio and acoustics. Kluwer Academic Publishers, 1998, p.59.
[19] H.-C. Liu, “A Low Complexity Platform-Based Psychoacoustic Model (PAM) for MPEG-2/4 AAC Encoder,” Department of Electrical Engineering, National Central University, Jhongli City, Taoyuan County, Taiwan (R.O.C.); Master Thesis, 2008.
[20] Fengduo Hu, “ITE Technology Incorporated,” 2003.
[21] Y. Takamizawa, T. Nomura, and M. Ikekawa, “High-quality and processor-efficient
implementation of an MPEG-2 AAC encoder,” in Proceedings of the 2001 IEEE International
Conference on Acoustics, Speech, and Signal Processing, Vol. 2, Page(s): 985 – 988.
[22] J. D. Johnston, “Transform coding of audio signals using perceptual noise criteria,” IEEE
Journal on Selected Areas in Communications, Vol. 6, No 2, pp. 314-323, Feb., 1988.
[23] I. Dimkoviae, D. Milovanoviae, Z. Bojkoviae, “Fast software implementation of MPEG
advanced audio encoder,” 2002 14th International Conference on Digital Signal Processing,
Vol. 2, Page(s): 839 –843.
[24] S.W Huang; T.H. Tsai; L.G. Chen; “A low complexity design of psycho-acoustic model for MPEG-2/4 advanced audio coding”, IEEE Transactions on Consumer Electronics, Volume 50, Issue 4, Nov. 2004 Page(s):1209 - 1217.
[25] S.W. Huang; L.G. Chen; T.H. Tsai; “Memory and Computationally Efficient Psychoacoustic Model for MPEG AAC on 16-bit Fixed-point Processors” Circuits and Systems, 2005. ISCAS 2005. Symposium on IEEE International 23-26 May 2005 Page(s):3155 – 3158.
[26] P.S. Wu, and Y.T. Hwan; “Efficient IMDCT core designs for audio signal processing”, IEEE Workshop on Signal Processing Systems, 2003. SIPS 2003. 27-29 Aug. 2003 Page(s):275 –280
[27] P. Duhmel, Y. Mahieux, and J.P. Petit, “A fast algorithm for the implementation of filter banks based 1on ‘time domain aliasing cancellation’ ”, International Conference on Acoustics, Speech, and Signal Processing, Vol. 3, Page(s): 2209 - 2212, Apr, 1991.
[28] Britanak, V.; Rao, K.R.; “An efficient implementation of the forward and inverse MDCT in MPEG audio coding”, IEEE Signal Processing Letters, Volume 8, Issue 2, Feb. 2001 Page(s): 48 - 51.
[29] Y.H. Fan, Madisetti, V.K. and Mersereau, R.M.; “On fast algorithms for computing the inverse modified discrete cosine transform”, IEEE Signal Processing Letters, Volume 6, Issue 3, March 1999 Page(s): 61 - 64.
[30] M.-H Cheng, Y.-H. Hsu, “Fast IMDCT and MDCT algorithms-a matrix approach”, IEEE Trans. On Signal Processing, Jan 2003, pp. 221 - 229.

簡易檢索 / 詳目顯示

相關論文