跳到主要內容

簡易檢索 / 詳目顯示

研究生: 吳宗紘
Tsung-Hung Wu
論文名稱: 運用混合小波封包與離散餘弦轉換及
Hybrid Wavelet Packet and Discrete Cosine Transform with Optimum Bit Allocation Applied to High-Quality Audio Coding
指導教授: 張寶基
Pao-Chi Chang
口試委員:
學位類別: 碩士
Master
系所名稱: 資訊電機學院 - 通訊工程學系
Department of Communication Engineering
畢業學年度: 92
語文別: 中文
論文頁數: 85
中文關鍵詞: 壓縮小波離散餘弦轉換音訊
外文關鍵詞: wavelet, DCT, audio, compression
相關次數: 點閱:13下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 以小波分頻的訊號壓縮技術已被廣泛地應用在音視訊編碼系統中,其優越的性能充分顯現在靜態影像之壓縮技術上。本論文提出混合小波與離散餘弦轉換之音訊壓縮系統,以小波封包分頻方式,將樂音訊號經由濾波器群組分成26個次頻帶,再根據時域與頻域之平坦程度,決定是否要進一步執行離散餘弦轉換。本系統並採用非理想合成濾波器之最佳位元配置演算法,將人耳聲學模型所得出的頻域最小遮蔽臨界值,轉換成小波域上的遮蔽臨界值,以提供精良的量化準則。其後以均勻量化器配合小波域的遮蔽臨界值,大幅降低資料量並仍保有極高的音質,最後再以算術編碼將量化後的係數做進一步的熵編碼並封裝成位元流。實驗結果顯示,本系統僅需52 kbps即可達到MP3 64 kbps的音質;另外,在同樣64 kbps之位元率下,本系統所提供的音質不但優於MP3、AAC低複雜度規格,更可超越AAC高效率規格。


    The wavelet filter bank analysis-synthesis technique has been widely applied to many areas of digital signal processing, especially in image and video coding. In this thesis, we propose a hybrid Wavelet Packet and DCT audio compression system, which divides the audio signal into 26 subbands via Wavelet Packet analysis and selectively performs DCT in each subband according to the flatness measure of time and frequency of this subband. The proposed coder adopts optimum bit allocation with nonideal reconstruction filters to transform the minimum masking threshold in frequency domain obtained from psychoacoustic model into the masking threshold in Wavelet domain. The WP or DCT coefficients are then quantized with uniform quantizers according to masking threshold, so that we can reduce the data rate but still have high quality. Finally, the quantized coefficients are encoded with arithmetic coding and encapsulated with other side information. The experiments show that, only 52 kbps is needed for proposed audio coder to achieve MP3 64-kbps quality. At the same bit rate of 64 kbps, the proposed audio coding system can provide not only better quality than MP3 and AAC LC profile but also superior to AAC HE profile!

    目錄 目錄 III 圖目 VI 表目 VIII 第一章 緒論 1 1.1 音訊壓縮簡介 1 1.2 研究動機與目的 3 1.3 系統架構 4 1.4 論文架構 5 第二章 小波分析技術 7 2.1 小波轉換 ( Wavelet Transform ) 7 2.1.1小波分解與離散小波轉換 8 2.1.2多重解析度分析 9 2.2小波轉換與數位訊號處理 11 2.2.1 小波濾波器 11 2.2.2 Daubechies緊密時間涵蓋小波 17 2.2.3 GBCW雙正交小波 19 2.3小波封包 ( Wavelet Packet ) 20 第三章 人耳聲學模型及其應用實例 24 3.1 一般音訊壓縮編解碼器結構 24 3.2 人耳聲學模型 26 3.2.1 基本原理與其應用 26 3.2.2 雜訊對單頻音的遮蔽效應 28 3.2.3頻音對單頻音的遮蔽效應 32 3.2.4 時間軸上的遮蔽效應 33 3.2.5模型公式 34 3.3 MPEG音訊編碼器家族 38 3.3.1 MPEG-1第三層(MP3) 40 3.3.2 先進音訊編碼(AAC) 43 3.4 算術編碼 46 第四章 小波音訊壓縮系統 49 4.1 轉換 50 4.1.1 Daubechies緊密時間涵蓋小波 50 4.1.2 GBCW雙正交小波 54 4.1.3小波封包 55 4.1.4離散餘弦轉換 57 4.2 最佳位元配置 58 4.3熵編碼 61 第五章 實驗結果與討論 65 5.1 客觀評量工具 – EAQUAL 65 5.2 CDF與GBCW雙正交小波之比較 68 5.3 調適性小波與非調適性小波之比較 69 5.4 正交小波與雙正交小波之比較 .72 5.5 混合小波轉換與離散餘弦轉換 73 5.6 複雜度分析 77 第六章 結論與未來展望 81 6.1 結論 81 6.2 未來展望 81 參考文獻 82

    [1] ISO/IEC 11172-3 : “Information technology - Coding of moving pictures and associatedaudio for digital storage media at up to about 1.5 Mbit/s - Part 3: Audio".1992 (“MPEG-1”).
    [2] ISO/IEC 13818-3 : “Information technology – Generic Coding of Moving Pictures and Associated Audio , Part 3: Audio".1994 (“MPEG-2 BC”).
    [3] ISO/IEC, Final Draft International Standard 14496-3: MPEG-4 Audio, ISO/IEC JTC1/SC29/WG11 N2503, Oct. 1998. (“MPEG-4”)
    [4] T. Painter and A. Spanias, “A review of algorithms for perceptual coding of digital audio signals,” Digital Signal Processing Proceedings, 1997. DSP 97., 1997 13th International Conference, VOL 1 , 1997 , pp 179 –208.
    [5] M. Sablatash and T. Cooklev, “Compression of High-Quality Audio Signals, Including Recent Methods Using Wavelet Packets,” Digital Signal Processing, vol. 6, no. 10, 1996, pp. 96-107.
    [6] D. Sinha and A. H. Tewfik, “Low Bit Rate Transparent Compression using Adapted Wavelets,” IEEE Trans. on Signal Processing, vol. 41, no. 12, pp. 3463-3479, Dec. 1993.
    [7] P. Srinivasan and L. H. Jamieson, “High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modeling,” IEEE Trans. on Signal Processing, vol. 46, no. 4, pp. 1085-1093, April 1998.
    [8] S. Boland and M. Deriche, “Audio Coding Using The Wavelet Packet Transform and A combined Scalar-Vector Quantization,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1996, pp. 1041-1044.
    [9] X. Xiong and Z. Eryuan, “Digital Audio Codec Based on the Improved Optimization Algorithm of Adaptive Wavelets and Dynamic Bit Allocation Scheme,” proceeding of ICSP’96, pp. 1523-1526.
    [10] P. Philippe, F. Moreau de Saint-Martin, M. Lever, and J. Soumagne, “Optimal Wavelet Packets for Low-Delay Audio Coding,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1996, pp. 550-553.
    [11] D. Y. Pan, “A Tutorial on MPEG/Audio Compression,” IEEE Multimedia pp. 60-74, 1995.
    [12] C. S. Burrus, R. A. Gopinath, and H. Guo, “Introdution to Wavelets and Wavelet Transforms,” 1998.
    [13] P. E. Kudumakis and M. B. Sandler, “Wavelet Packet Based Scalable Audio Coding,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1996, pp. 41-44.
    [14] W. K. Dobson, J. J. Yang, K. J. Smart, and F. K. Guo, “High Quality Low Complexity Scalable Wavelet Audio Coding,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1997, pp. 327-330.
    [15] P. Philippe, F. Moreau de Saint-Martin, and L. Mainard, “On The Choice of Wavelet Filters for Audio Compression,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1995, pp. 1045-1048.
    [16] P. E. Kudumakis and M. B. Sandler, “On The Performance of Wavelets for Low Bit Rate Coding of Audio Signals,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1995, pp. 3087-3090.
    [17] I. Daubechies, "Ten Lectures on Wavelets," no. 61 in CBMS-NSF Series in Applied Mathematics, SIAM, Philadelphia, 1992.
    [18] C Todd, “A Digital Audio System for Broadcast and Prerecorded Media”, in Proc. 75th Conv. Aud. Eng. Soc., preprint #, Mar. 1984.
    [19] E.F. Schroder and W. Voessing, “High Quality Digital Audio Encoding With 3.0 Bits/Sample Using Adaptive Transform Coding”, in Proc. 80th Conv. Aud. Eng. Soc., preprint #2321, Mar. 1986.
    [20] G. Theile, et al., “Low-Bit Rate Coding of High Quality Audio Signals”, in Proc. 82nd Conv. Aud. Eng. Soc., preprint #2423, Mar. 1987.
    [21] K. Brandenburg, “OCF – A New Coding Algorithm for High Quality Sound Signals”, in Proc. ICASSP-87, May 1987, pp. 5.1.1-5.1.4.
    [22] J. Johnston, “Transform Coding of Audio Signals Using Perceptual Noise Criteria”, IEEE J. Sel. Areas in Comm., Feb. 1988, pp. 314-23.
    [23] W-Y Chan and A. Gersho, “High Fidelity Audio Transform Coding With Vector Quantization”, in Proc. ICASSP-90, May 1990, pp. 1109-1112.
    [24] K. Brandenburg and J.D. Johnston, “Second Generation Perceptual Audio Coding: The Hybrid Coder”, in Proc. 88th Conv. Aud. Eng. Soc., preprint #2937, Mar. 1990.
    [25] K. Brandenburg, et al, “Adaptive Spectral Entropy Coding of High Quality Music Signals”, in Proc. 90th Conv. Aud. Eng. Soc., preprint #3011, Feb. 1991.
    [26] Y.F. Dehery, et al, “A MUSICAM Source Codec for Digital Audio Broadcasting and Storage”, in proc. ICASSP-91, May 1991, pp. 3605-3608.
    [27] M. Iwadare, et al., “A 128 kb/s Hi-Fi Audio CODEC Based on Adaptive Transform Coding With Adaptiv Block Size MDCT”, IEEE J. Sel. Areas in Comm., Jan. 1992, pp. 138-144.
    [28] ISO/IEC 13818-7 : “MPEG-2 Advanced Audio Coding, AAC,” 1997.
    [29] D. Wei, J. Tian, R. O. Wells, Jr., and C. S. Burrus, “A New Class of Biorthogonal Wavelet System for Image Transform Coding,” IEEE Trans. on Image Processing, vol. 7, no. 7, pp. 1000-1013, July. 1998.
    [30] C. Caini and A. V. Coralli, “Optimum Bit Allocation in Subband Coding with Nonideal Reconstruction Filters,” IEEE Signal Processing Latters, vol. 8, no. 6, pp. 157-159, June. 2001.
    [31] V. Britanak and K. R. Rao, “An efficient implementation of the forward and inverse MDCT in MPEG audio coding,” IEEE Signal Processing Letter, vol. 8, pp. 48-51, Feb. 2001.
    [32] I. Daubechies and W. Sweldens, “Factoring wavelet transforms into lifting steps,” J. Fourier Anal. Appl., 4 (no. 3), pp. 247-269, 1998.
    [33] S.K. Tang, S.C. Chan, K.L. Ho, and F.K. Lam, “Implementation of fast cosine transform on the Motorola DSP 96002 digital signal processor,” IEEE International Sympoisum on Circuits and Systems, Vol. 1, pp.73-76, June 1991.
    [34] K. Konstantinides, “Fast Subband Filtering in MPEG Audio Coding,” IEEE Signal Processing Letter, vol. 1, pp. 26-28, Feb. 1994.

    QR CODE
    :::