| 研究生: |
朱復興 Fu-Shing Ju |
|---|---|
| 論文名稱: |
無線傳輸及網際網路環境下之G.729與G.723.1語音傳輸 G.723.1 and G.729 Speech Transmission over Wireless and Internet Environments |
| 指導教授: |
張寶基
Pao-Chi Chang |
| 口試委員: | |
| 學位類別: |
碩士 Master |
| 系所名稱: |
資訊電機學院 - 電機工程學系 Department of Electrical Engineering |
| 畢業學年度: | 88 |
| 語文別: | 中文 |
| 論文頁數: | 114 |
| 中文關鍵詞: | G.729 、G.723.1 、語音壓縮 、錯誤保護 、錯誤隱藏 |
| 外文關鍵詞: | G.729, G.723.1, speech compression, error protect, error concealment |
| 相關次數: | 點閱:8 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
G.729語音編碼器為ITU-T於1996所訂定的高品質低延遲8kbit/s語音編碼標準,而G.723.1亦為ITU-T於1996所訂定的6.3kbit/s及5.3kbit/s雙速元語音編碼標準,這兩種語音編碼器均是針對語音與多媒體的音訊訊號之壓縮處理所設計的,主要的應用方向為網際網路環境下之語音傳輸系統的語音編碼標準,並且為H.323網際網路音視訊會議系統下之音訊訊號的編碼標準。
由於G.729的語音編碼方式,並無錯誤更正之機制,然而在無線的傳輸通道中,雜訊或是訊號的干擾是無法避免的,因此我們分析封包中各位元之重要性,並提出與標準相容的BCH(31, 26)碼的重要位元保護模式,使得重建語音在低通道錯誤率下有較佳之語音品質;此外,在網際網路即時性的應用中,經常會有封包遺失的情形發生,因此我們將錯誤隱藏的處理方法應用於G.729及G.723.1中,使得重建語音在不同的封包遺失情形下,皆能有最佳之語音品質。
G.723.1 and G.729 are both ITU-T speech coding standards. These two speech coders are designed for the compression of speech and audio signals in multimedia applications. The target applications are the voice transmission over Internet and the voice coding of H.323 based Internet video conferences.
To enhance the error robustness of G.729, we propose an error protection scheme by applying BCH(31, 26) to protect significant bits in a frame. Furthermore, the BCH(31, 26) scheme is compatible to G.729 standard. In Internet environment, the packets often get lost and the error propagation effect degrades the speech quality substantially. Therefore, we apply the error concealment techniques to both G.729 and G.723.1. In simulations, the objective and subjective evaluation tests all show that the G.729 BCH(31, 26) error protect scheme and the error concealment techniques of G.729 and G.723.1 can improve speech quality significantly.
[1]R. E. Ziemer and W. H. Tranter, Principles of Communications, Systems, Modulation, and Noise,fourth edition. Boston:Houghton Mifflin, 1995.
[2]A. S. Spanias, "Speech Coding: A Tutorial Review", Proceedings of the IEEE, vol. 82, no. 10, pp. 1541-82, October 1994.
[3]A. Gersho, "Advances in Speech and Audio Compression", Proceedings of the IEEE, vol. 82, no. 6, pp. 900-918, June 1994.
[4]B. Gold and C. Rader, "The channel vocoder," IEEE Trans. Audio , vol. AU-15, pp. 148-161, Dec. 1967.
[5]I. Gibson, "Vector sum excited linear prediction (VSCELP) speech coding for Japan digital cellular," presented at the Meeting of IEICE, paper RCS90-26, Nov. 1990.
[6]J. P. Campbell, Jr., T. E. Tremain and V. C. Welch, "The DOD 4.8 kbps Standard (Proposed Federal Standard 1016)", Advances in Speech Coding, Kluwer Academic Publishers, pp. 121-133, 1991.
[7]R. V. Cox and P. Kroon, “Low bit-rate speech coders for multimedia communication,” IEEE Commun. Magazine, vol.34, no.12, pp.34-41, Dec. 1996.
[8]ITUT Recommendation G.729, Coding of speech at 8 kbit/s using Conjugate-Structure Algebraic-Code-Excited-Linear-Prediction (CS-ACELP), March 1996.
[9]ITU-T Rec. H.323, “Visual Telephone Systems and Equipment for Local Area Networks Which Provice a Non-guaranteed Quality of Service,” Geneva, Switzerland.
[10]ITU-T Recommendation G.723.1, Dual Rate Speech Coder for Multimedia Comunications Transmitting at 5.3 and 6.3 Kbits, March 1996.
[11]S.Yeldener, J.C. de Martin, and V. Viswanathan, "A Mixed Sinusoidally Excited Linear Prediction Coder at 4 KB/S and Below," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. 2, pp. 589-592, 1998.
[12]S.R. Quackenbush, T.P. Barnwell and M.A. Clements, Objective Measures of Speech Quality, Prentice Hall, 1988.
[13]R.V. Cox, “Three New Speech Codecs from the ITU Cover a Range of Application,” IEEE Commun. Mag., Sept. 1997.
[14]Shu Lin and Danie J. Constello, Error Control Coding Fundamentals and Applications, Prentice-Hall, 1983.
[15]A. Papoulis, Probability, Random Variables, and Stochastic Processes, third edition, McGraw-Hill, 1991.
[16]Lan Juan, Lin Biqin, and Fu Qiuliang, " An 8-kb/s conjugate-structure algebraic CELP (CS-ACELP) speech coding," in Signal Processing Proceedings, 1998. ICSP ''98. 1998 Fourth International Conference, vol. 2, pp. 1729-1732, 1998.
[17]D. Rizzetto and C. Catania, “A Voice over IP Service Architecture for Integrated Communications,” IEEE Internet Computing Mag. May/June 1999.
[18]S. Blake et al., “An Architecture for Differentiated Services,” RFC 2475, IETF DiffServ Working Group, December 1998.
[19]G. Barberis, “Buffer Sizing of a Packet Voice Receiver,” IEEE Trans. On Communications, vol. Com-29, NO. 2, pp. 1522-156, Feb. 1981.
[20]S. Caser and V. Jacobson, “Compressing IP/UDP/RTP Hraders for Low-Speed Serial Links,” RFC 2508, IETF, Feb. 1999.
[21]L. Dong, A.R. Kaye and S.A. Mahmoud, “Transmission of compressed voice over integrated services frame relay networks: priority service and adaptive buildout delay,” IEE Proceedings Communications, vol. 141, issue 4, pp. 265-274, Aug. 1994.
[22]T. Fingscheidt, P. Vary and J.A. Andonegui, “Robust speech decoding: can error concealment be better than error correction?,” Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference, vol. 1, pp. 373-376, May 1998.
[23]廖瑞祥, 無線傳輸環境下G.723.1語音編碼之位元保護與錯誤隱藏處理, 碩士論文, 中央大學, 1998.
[24]黃能富, 區域網路與高速網路, 維科出版社, 1998.
[25]David J. Kruglinski, Inside Visual C++ 4.0, first edition, Microsoft Press, 1996.
[26]李孟書, WINDOWS SOCKETS網路程式設計經典, 碁峰, 1997.