Student Number 975203024 Author Run-yu Tong(童閏煜) Author's Email Address email@example.com Statistics This thesis had been viewed 545 times. Download 437 times. Department Communication Engineering Year 2009 Semester 2 Degree Master Type of Document Master's Thesis Language zh-TW.Big5 Chinese Title Low Complexity Decoding in Parametric Stereo Audio Coding Scheme Date of Defense 2010-07-07 Page Count 96 Keyword decorrelation filter Parametric Stereo residual coding spatial audio coding spatial parameter Abstract The Parametric Stereo (PS) audio coding is an audio coding object of High Efficiency Advanced Audio Coding version 2 (HE-AAC v2) which was standardized by ISO/MPEG in 2004. Traditional audio codec, e.g. MP3 or AAC, utilize “Psycoaustic Model” and “Masking Effect” to achieve high compression efficiency. However, they mainly process the signal with single channel. Different from traditional audio codec, the PS audio coding incorporates the characteristics of two channels, to extract spatial parameters and to down-mixes stereo signals into a mono signal. The PS can save almost half data size which provides great help in storage and transmission.
Nevertheless, the complexity of PS decoder is nearly twice larger than that of PS encoder, which causes a serious problem in implementing PS on portable devices. Therefore, this thesis proposes a modified PS coding scheme to reduce the complexity of decoder. The encoder extracts and transmits the additional residual parameters from the residual signal and the mono signal. On the contrary, the decoder reconstructs the residual signal by the mono signal and the transmitted residual parameters. In addition, we detect the existence of transient signal and measure the artifact of reconstructed residual signal. Finally, “Energy compesated algorithm” is proposed to reduce the artifact produced by the transient signal.
The proposed scheme can improve the Objective Difference Grade (ODG) of audio quality measurement “EAQUAL” with 0.6 score. Combining with audio coder AAC, the modified PS coding scheme still maintains a good performance at low coding bitrates.
Table of Content 摘 要 I
誌 謝 III
目 錄 IV
2.4 冗餘訊號編解 35
Reference Johnston and Ferreira, “Sum-Difference Stereo Transform Coding, ” Proc. ICASSP, pp.569- 571, May 1992
Chi-Min Liu, Wen-Chieh Lee, and Yo-Hua Hsiao, “M/S coding based on allocation entropy, ” Proc. of the 6th Int. Conference on Digital Audio Effects, London, UK, Sep. 11~18, 2003.
Lord Rayleigh (J.W. Strutt), “On our perception of sound direction,”
Philosophical Magazine, 13:214–232, 1907.
C. Faller, “Parametric coding of spatial audio,” Proc. of the 7th International Conference on Digital Audio Effects (DAFx’04), Naples, Italy, October 5-8, 2004.
ISO/IEC JTC1/SC29/WG11, “Text of ISO/IEC 14496-3:2001/FPDAM2 (parametric coding for high quality audio),” ISO/IEC JTC1/SC29/WG11 N5713, July 2003.
E. Schuijers, J. Breebaart, H. Purnhagen, and J. Engdeg˙ard, “Low complexity parametric stereo coding,” in Proc. 116th AES Convention, Berlin, Germany, May 2004.
J. Hall and M. Fernandes, “The role of monaural frequency selectivity in binaural analysis,” in J. Acoust. Soc. Amer., 1984, vol. 76, pp. 435 – 439.
J. Engdegard, H. Purnhagen, J. Roden, and L. Liljeryd, “Sythetic ambience in parametric stereo coding,” in Proc. 116th AES Convention, Berlin, Germany, May 2004, pp.1-12.
M. R. Schroeder and B. F. Logan, “Colorless Artificial Reverberation”, J. Audio Eng. Soc., vol. 9, no. 3, pp. 192-197, July 1961.
J. Breebaart, S. van de Par, A. Kohlrausch, and E. Schuijers, “High-quality parametric spatial audio coding at low bitrates,” in Proc. 116th AES Convention, Berlin, Germany, May 2004.
R. Irwan and R. M. Aarts, “Two-to-five channel sound processing,” Journal of the Audio Engineering Society, vol. 50, no. 11, pp. 914–926, 2002.
T.C. Li, and P.C. Chang, “Spatial characteristic based scalable audio coding structure, “ in Proc. of National Symposium on Tele- communications (NST), Taipei, Taiwan, pp. 452-456, Dec. 2009.
J. Breebaart, S. van de Par, A. Kohlrausch, and E. Schuijers “Parametric coding of stereo audio”, EURASIP Journal, Applied Signal Processing 9:1305-1322, 2005.
J. Breebaart and C. Faller, Spatial Audio Processing, ed.1,WILEY, 2007,p82-84
ITU-R Recommendation BS.1387, Method for Objective Measurements of Perceived Audio Quality, Dec. 1998.
Advisor Pao-chi Chang(張寶基)
975203024.pdf Date of Submission 2010-07-26