| 研究生: |
童閏煜 Run-yu Tong |
|---|---|
| 論文名稱: |
參數立體音訊編碼之低複雜解碼方法 Low Complexity Decoding in Parametric Stereo Audio Coding Scheme |
| 指導教授: |
張寶基
Pao-chi Chang |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 通訊工程學系 Department of Communication Engineering |
| 畢業學年度: | 98 |
| 語文別: | 中文 |
| 論文頁數: | 96 |
| 中文關鍵詞: | 空間音訊編碼 、參數立體聲 、空間參數 、冗餘訊號編碼 、解相關濾波器 |
| 外文關鍵詞: | residual coding, decorrelation filter, spatial parameter, spatial audio coding, Parametric Stereo |
| 相關次數: | 點閱:7 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
參數立體聲 (Parametric Stereo, PS) 音訊編碼為2004年在ISO/MPEG 制定的HE-AAC v2標準下定義的一音訊編解碼物件。傳統音訊編碼標準MP3 (MPEG-1 Layer 3)、AAC (Advanced Audio Coding)的壓縮技術主要利用聲響心理模型 (Psycoaustic Model)、遮蔽效應(Masking Effect)來提高資料壓縮率,編碼過程仍以個別聲道獨立處理。PS則利用聲道間音訊的特徵關係,可節省將近一半的資料量,使傳統音訊利於以更低位元率在網路傳輸或儲存。
由於傳統PS解碼端複雜度比編碼端複雜度高出幾近兩倍,對使用者的可攜式裝置電力是一很大負擔。因此本論文提出一低複雜解碼的PS編解碼架構以降低解碼端複雜度為目標,先在編碼端利用對冗餘訊號抽取空間參數的方法傳送冗餘訊號參數,然後解碼端利用單聲道訊號重建冗餘訊號後,會進行偵測與評估其暫態訊號對重建冗餘訊號的影響,並使用能量補償演算法,對重建的冗餘訊號補償暫態訊號
所造成的失真。
最後使得解碼後的音訊品質客觀評分工具(Objective Difference Grade,ODG )分數上升約0.6分,並討論與MPEG-4 AAC結合的效能,其低複雜解碼的PS架構也利於對AAC在低位元率時傳輸。
The Parametric Stereo (PS) audio coding is an audio coding object of High Efficiency Advanced Audio Coding version 2 (HE-AAC v2) which was standardized by ISO/MPEG in 2004. Traditional audio codec, e.g. MP3 or AAC, utilize “Psycoaustic Model” and “Masking Effect” to achieve high compression efficiency. However, they mainly process the signal with single channel. Different from traditional audio codec, the PS audio coding incorporates the characteristics of two channels, to extract spatial parameters and to down-mixes stereo signals into a mono signal. The PS can save almost half data size which provides great help in storage and transmission.
Nevertheless, the complexity of PS decoder is nearly twice larger than that of PS encoder, which causes a serious problem in implementing PS on portable devices. Therefore, this thesis proposes a modified PS coding scheme to reduce the complexity of decoder. The encoder extracts and transmits the additional residual parameters from the residual signal and the mono signal. On the contrary, the decoder reconstructs the residual signal by the mono signal and the transmitted residual parameters. In addition, we detect the existence of transient signal and measure the artifact of reconstructed residual signal. Finally, “Energy compesated algorithm” is proposed to reduce the artifact produced by the transient signal.
The proposed scheme can improve the Objective Difference Grade (ODG) of audio quality measurement “EAQUAL” with 0.6 score. Combining with audio coder AAC, the modified PS coding scheme still maintains a good performance at low coding bitrates.
[1] Johnston and Ferreira, “Sum-Difference Stereo Transform Coding, ” Proc. ICASSP, pp.569- 571, May 1992
[2] Chi-Min Liu, Wen-Chieh Lee, and Yo-Hua Hsiao, “M/S coding based on allocation entropy, ” Proc. of the 6th Int. Conference on Digital Audio Effects, London, UK, Sep. 11~18, 2003.
[3] Lord Rayleigh (J.W. Strutt), “On our perception of sound direction,”
Philosophical Magazine, 13:214–232, 1907.
[4] C. Faller, “Parametric coding of spatial audio,” Proc. of the 7th International Conference on Digital Audio Effects (DAFx’04), Naples, Italy, October 5-8, 2004.
[5] ISO/IEC JTC1/SC29/WG11, “Text of ISO/IEC 14496-3:2001/FPDAM2 (parametric coding for high quality audio),” ISO/IEC JTC1/SC29/WG11 N5713, July 2003.
[6] E. Schuijers, J. Breebaart, H. Purnhagen, and J. Engdeg˙ard, “Low complexity parametric stereo coding,” in Proc. 116th AES Convention, Berlin, Germany, May 2004.
[7] J. Hall and M. Fernandes, “The role of monaural frequency selectivity in binaural analysis,” in J. Acoust. Soc. Amer., 1984, vol. 76, pp. 435 – 439.
[8] J. Engdegard, H. Purnhagen, J. Roden, and L. Liljeryd, “Sythetic ambience in parametric stereo coding,” in Proc. 116th AES Convention, Berlin, Germany, May 2004, pp.1-12.
[9] M. R. Schroeder and B. F. Logan, “Colorless Artificial Reverberation”, J. Audio Eng. Soc., vol. 9, no. 3, pp. 192-197, July 1961.
[10] J. Breebaart, S. van de Par, A. Kohlrausch, and E. Schuijers, “High-quality parametric spatial audio coding at low bitrates,” in Proc. 116th AES Convention, Berlin, Germany, May 2004.
[11] R. Irwan and R. M. Aarts, “Two-to-five channel sound processing,” Journal of the Audio Engineering Society, vol. 50, no. 11, pp. 914–926, 2002.
[12] T.C. Li, and P.C. Chang, “Spatial characteristic based scalable audio coding structure, “ in Proc. of National Symposium on Tele- communications (NST), Taipei, Taiwan, pp. 452-456, Dec. 2009.
[13] J. Breebaart, S. van de Par, A. Kohlrausch, and E. Schuijers “Parametric coding of stereo audio”, EURASIP Journal, Applied Signal Processing 9:1305-1322, 2005.
[14] J. Breebaart and C. Faller, Spatial Audio Processing, ed.1,WILEY, 2007,p82-84
[15] ITU-R Recommendation BS.1387, Method for Objective Measurements of Perceived Audio Quality, Dec. 1998.