| 研究生: |
何恭岳 Kong-Yueh Ho |
|---|---|
| 論文名稱: |
高品質切換式離散餘弦與小波封包 High Quality Switched Discrete CosineTransform and Wavelet PacketAudio Coding Technique |
| 指導教授: |
張寶基
Pao-Chi Chang |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 通訊工程學系 Department of Communication Engineering |
| 畢業學年度: | 93 |
| 語文別: | 中文 |
| 論文頁數: | 76 |
| 中文關鍵詞: | 最佳化位元分配 、離散餘弦轉換 、小波濾波器 |
| 外文關鍵詞: | optimal bit-allocation, DCT, Wavelet Packet |
| 相關次數: | 點閱:5 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
轉換編碼應用於音訊處理系統中已行之有年,而近年來最熱門的分頻編碼法則莫過於小波轉換。其多層解析之特性,使得分頻處理之選擇架構更為多元。本論文提出一套混合式高音質壓縮系統,在處理每個音框資料時,首先即依據頻率域之平坦度量測以決定使用頻率解析度較佳之離散餘弦轉換或具豐富時間資訊之小波轉換作為此音框的主要轉換方式。對於採用離散餘弦轉換之音框,將依循人耳聲學模型所算出之頻域遮罩配合非線性量化器作量化。若是選擇運用小波封包做分頻,則樂音訊號通過小波濾波器組後即分成26個固定子頻帶,並在各子頻帶之後再依據小波域與頻率域平坦度之量測選擇性地使用離散餘弦轉換以提升頻率解析度。配合針對非理想濾波器組之最佳化位元分配演算法,將頻域上之人耳聽覺遮蔽曲線轉換為小波域之遮蔽曲線,以提供精良之量化準則並保有極高之音質。最後再以熵編碼,將量化後係數封裝成位元流。實驗結果顯示,在64k位元率的情況下,本系統所提供之音質,不僅優於MP3,更能超越AAC低複雜度規格。
We propose a hybrid coding system that utilizes both Wavelet Packet (WP) and DCT techniques. To process each audio frame, the system selects either WP or DCT to process based on the frame flatness measures in wavelet domain and frequency domain. If DCT is chosen, all DCT coefficients are quantized by a non-uniform quantizer according to the frequency masking curve. On the other hand, frame data are segmented into 26 fixed subbands when WP is chosen. Then, the system selectively utilizes DCT to promote frequency resolution of each subband based on the subband flatness measure. By quoting optimal bit-allocation for non-ideal filter bank, the masking threshold from psychoacoustic model can be translated into specific criteria in the wavelet domain for quantization. Experiment results show that the proposed system is superior to MP3 and AAC LC profile at 64k bps
[1] ISO/IEC 11172-3 : “Information technology - Coding of moving pictures and associated audio for digital storage media at up to about 1.5 M bit/s - Part 3: Audio".1992 (“MPEG-1”).
[2] ISO/IEC 13818-3 : “Information technology – Generic Coding of Moving Pictures and Associated Audio , Part 3: Audio".1994 (“MPEG-2 BC”).
[3] ISO/IEC 13818-7, “Information Technology - Generic Coding of Moving Pictures and Associated Audio, Part 7: Advanced Audio Coding”, 1997.
[4] ISO/IEC, Final Draft International Standard 14496-3: MPEG-4 Audio, ISO/IEC JTC1/SC29/WG11 N2503, Oct. 1998. (“MPEG-4”)
[5] M. Sablatash and T. Cooklev, “Compression of High-Quality Audio Signals, Including Recent Methods Using Wavelet Packets,” Digital Signal Processing, vol. 6, no. 10, 1996, pp. 96-107.
[6] Y. Karelic and D. Malah, “Compression of High-Quality Audio Signals Using Adaptive Filterbanks and A Zero-Tree Coder,” Electrical and Electronics Engineers in Israel, 1995.
[7] P. Srinivasan and L. H. Jamieson, “High-Quality Audio Compression Using an Adaptive Wavelet Packet Decomposition and Psychoacoustic Modeling,” IEEE Trans. on Signal Processing, vol. 46, no. 4, pp. 1085-1093, April 1998.
[8] S. Boland and M. Deriche, “Audio Coding Using The Wavelet Packet Transform and A combined Scalar-Vector Quantization,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1996, pp. 1041-1044.
[9] X. Xiong and Z. Eryuan, “Digital Audio Codec Based on the Improved Optimization Algorithm of Adaptive Wavelets and Dynamic Bit Allocation Scheme,” proceeding of ICSP’96, pp. 1523-1526.
[10] P. Philippe, F. Moreau de Saint-Martin, M. Lever, and J. Soumagne, “Optimal Wavelet Packets for Low-Delay Audio Coding,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1996, pp. 550-553.
[11] D. Y. Pan, “A Tutorial on MPEG/Audio Compression,” IEEE Multimedia pp. 60-74, 1995.
[12] C. S. Burrus, R. A. Gopinath, and H. Guo, “Introdution to Wavelets and Wavelet Transforms,” 1998.
[13] P. E. Kudumakis and M. B. Sandler, “Wavelet Packet Based Scalable Audio Coding,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1996, pp. 41-44.
[14] W. K. Dobson, J. J. Yang, K. J. Smart, and F. K. Guo, “High Quality Low Complexity Scalable Wavelet Audio Coding,” in Proc. Int. Conf. Acoust., Speech, Signal Process. 1997, pp. 327-330.
[15] C Todd, “A Digital Audio System for Broadcast and Prerecorded Media”, in Proc. 75th Conv. Aud. Eng. Soc., preprint #, Mar. 1984.
[16] C Todd, “A Digital Audio System for Broadcast and Prerecorded Media”, in Proc. 75th Conv. Aud. Eng. Soc., preprint #, Mar. 1984.
[17] E. F. Schroder and W. Voessing, “High Quality Digital Audio Encoding With 3.0 Bits/Sample Using Adaptive Transform Coding”, in Proc. 80th Conv. Aud. Eng. Soc., preprint #2321, Mar. 1986.
[18] G. Theile, et al., “Low-Bit Rate Coding of High Quality Audio Signals”, in Proc. 82nd Conv. Aud. Eng. Soc., preprint #2423, Mar. 1987.
[19] K. Brandenburg, “OCF – A New Coding Algorithm for High Quality Sound Signals”, in Proc. ICASSP-87, May 1987, pp. 5.1.1-5.1.4.
[20] J. Johnston, “Transform Coding of Audio Signals Using Perceptual Noise Criteria”, IEEE J. Sel. Areas in Comm., Feb. 1988, pp. 314-23.
[21] W-Y Chan and A. Gersho, “High Fidelity Audio Transform Coding With Vector Quantization”, in Proc. ICASSP-90, May 1990, pp. 1109-1112.
[22] K. Brandenburg and J.D. Johnston, “Second Generation Perceptual Audio Coding: The Hybrid Coder”, in Proc. 88th Conv. Aud. Eng. Soc., preprint #2937, Mar. 1990.
[23] K. Brandenburg, et al, “Adaptive Spectral Entropy Coding of High Quality Music Signals”, in Proc. 90th Conv. Aud. Eng. Soc., preprint #3011, Feb. 1991.
[24] Y. F. Dehery, et al, “A MUSICAM Source Codec for Digital Audio Broadcasting and Storage”, in proc. ICASSP-91, May 1991, pp. 3605-3608.
[25] M. Iwadare, et al., “A 128 kb/s Hi-Fi Audio CODEC Based on Adaptive Transform Coding With Adaptiv Block Size MDCT”, IEEE J. Sel. Areas in Comm., Jan. 1992, pp. 138-144.
[26] Huber, D. M., Runstein, R. E., 2001. Modern Recording Techniques 5th Edition, Focal Press.
[27] ISO/IEC 13818-7 : “MPEG-2 Advanced Audio Coding, AAC,” 1997.
[28] E. Zwicker and H. Fastl, Psychoacoustics, Facts and Models (Springer, Berlin, Heidelberg, 1990).
[29] D. Sinha and A. H. Tewfik, “Low Bit Rate Transparent Compression using Adapted Wavelets,” IEEE Trans. on Signal Processing, vol. 41, no. 12, pp. 3463-3479, Dec. 1993.
[30] I. Daubechies, "Ten Lectures on Wavelets," no. 61 in CBMS-NSF Series in Applied Mathematics, SIAM, Philadelphia, 1992.
[31] C. Caini and A. V. Coralli, “Optimum Bit Allocation in Subband Coding with Nonideal Reconstruction Filters,” IEEE Signal Processing Latters, vol. 8, no. 6, pp. 157-159, June. 2001.
[32] ITU-R Recommendation BS.1387, Method for Objective Measurements of Perceived Audio Quality,Dec. 1998.
[33] SQAM - Sound Quality Assessment Material, website
http://www.tnt.uni-hannover.de/project/mpeg/audio/sqam/
[34] http://psplab.csie.nctu.edu.tw/modules.php?op=modload&name=Web_Links&file=index
[35] J. D. Johnston, “Transform coding of audio signals using perceptual noise criteria,” IEEE Trans. Select. Areas Commun., to be published.
[36] T. H. Wu and P. C. Chang, “Hybrid Wavelet Pack and Discrete Cosine Transform with Optimum Bit allocation Applied to High-Quality Audio Coding”, 碩士論文, 中央大學, 2004.