SciELO - Scientific Electronic Library Online

 
vol.11 número4Integral Imaging Based 3-D Image Encryption Algorithm Combined with Cellular AutomataFuzzy Analytic Hierarchy Process for Risk Assessment to General-assembling of Satellite índice de autoresíndice de materiabúsqueda de artículos
Home Pagelista alfabética de revistas  

Servicios Personalizados

Revista

Articulo

Indicadores

Links relacionados

  • No hay artículos similaresSimilares en SciELO

Compartir


Journal of applied research and technology

versión On-line ISSN 2448-6736versión impresa ISSN 1665-6423

J. appl. res. technol vol.11 no.4 Ciudad de México ago. 2013

 

Bilateral Waveform Similarity Overlap-and-Add Based Packet Loss Concealment for Voice over IP

 

J.F. Yeh*1, P.C. Lin2, M.D. Kuo1,3, Z.H. Hsu1

 

1 Department of Computer Science and Information Engineering, National Chiayi University, Taiwan. (R.O.C.). *ralph@mail.ncyu.edu.tw.

2 Department of Computer Science and Information Engineering.

3 Department of Digital Design and Management, Far East University, Taiwan (R.O.C.).

 

ABSTRACT

This paper invested a bilateral waveform similarity overlap-and-add algorithm for voice packet lost. Since Packet lost will cause the semantic misunderstanding, it has become one of the most essential problems in speech communication. This investment is based on waveform similarity measure using overlap-and-Add algorithm and provides the bilateral information to enhance the speech signal reconstruction. Traditionally, it has been improved that waveform similarity overlap-and-add (WSOLA) technique is an effective algorithm to deal with packet loss concealment (PLC) for real-time time communication. WSOLA algorithm is widely applied to deal with the length adaptation and packet loss concealment of speech signal. Time scale modification of audio signal is one of the most essential research topics in data communication, especially in voice of IP (VoIP). Herein, the proposed the bilateral WSOLA (BWSOLA) that is derived from WSOLA. Instead of only exploitation one direction speech data, the proposed method will reconstruct the lost voice data according to the preceding and cascading data. The related algorithms have been developed to achieve the optimal reconstructing estimation. The experimental results show that the quality of the reconstructed speech signal of the bilateral WSOLA is much better compared to the standard WSOLA and GWSOLA on different packet loss rate and length using the metrics PESQ and MOS. The significant improvement is obtained by bilateral information and proposed method. The proposed bilateral waveform similarity overlap-and-add (BWSOLA) outperforms the traditional approaches especially in the long duration data loss.

Keywords: Packet loss concealment, waveform similarity overlap-and-add, VoIP, speech communication.

 

DESCARGAR ARTÍCULO EN FORMATO PDF

 

References

[1] A. Mantilla-Caeiros et al., "Pattern Recognition Based Esophageal Speech Enhancement System," Journal of Applied Research and Technology, vol. 8, no. 1, pp. 56-71, 2010.         [ Links ]

[2] A. L. Padilla-Ortíz, and F. Orduña-Bustamante, "Binaural Speech Intelligibility and Interaural Cross-Correlation Under Disturbing Noise and Reverberation," Journal of Applied Research and Technology, vol. 10, no. 3, pp. 347-360, 2012.         [ Links ]

[3] J.-F. Yeh et al., "Speech Enabling Services for Wireless Access in Vehicular Environments" ICIC Express Letters, Part B: Applications- An International Journal of Research and Surveys, Vol. 2, No. 3, 2011, pp.705-710.         [ Links ]

[4] H. Sanneck H. et al, "A New Technique for Audio Packet Loss Concealment," Global Telecommunications Conference, 1996, pp. 48-52, 1996.         [ Links ]

[5] C. Perkins et al., "A Survey of Paket Loss Recovery techniques for Streaming Audio," IEEE Network, Vol. 12, No. 5, pp. 40-48, 1998.         [ Links ]

[6] H. Svensson et al., "Implementation Aspects of a Novel Speech Packet Loss Concealment Method," IEEE International Symposium on Circuits and Systems, Kobe, Japan ,Vol. 3, pp. 2867-2870, 2005.         [ Links ]

[7] S. Grofit and Y. Lavner, "Time-Scale Modification of Audio Signals Using Enhanced WSOLA With Management of Transients," IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, no.1, 2008, pp. 106-115, 2008.         [ Links ]

[8] H.-P. Shen et al., "Speaker Clustering Using Decision Tree-based Phone Cluster Models with Multi-Space Probability Distributions," IEEE Trans. Audio, Speech, and Language Processing, Vol. 19, Issue 5, pp.1289-1300, 2011.         [ Links ]

[9] J.-F. Yeh and M.-C. Yen, "Speech Recognition with Word Fragment Detection Using Prosody Features for Spontaneous Speech," International Journal of Applied Mathematics & Information Sciences, V 6 No. 2S pp. 669S-675S, 2012.         [ Links ]

[10] M. Li et al., "Packet Loss Concealment Using Enhanced Waveform Similarity OverLap-and-Add Technique with Management of Gains," International Conference on Wireless Communications, Networking and Mobile Computing 2009 (WiCom '09), Beijing, China, 2009.         [ Links ]

[11] H. G. Ilk and S. Güler, "Adaptive Time Scale Modification of Speech for Graceful Degrading Voice Quality in Congested Networks for VoIP Applications," Signal Process, 86(1), pp. 127-139, 2006.         [ Links ]

[12] K. Sreenivasa Rao and B. Yegnanarayana, "Duration Modification using Glottal Closure Instants and Vowel Onset Points," Speech communication, Vol. 51, issue 21, pp.1263-1269, 2009.         [ Links ]

[13] A. Ito and T. Nagano, "Packet Loss Concealment of VoIP under severe loss conditions," 15th International Symposium on Wireless Personal Multimedia Communications (WPMC), pp.489-490, 2012.         [ Links ]

[14] Jagla et al. "Sample-based engine noise synthesis using an enhanced pitch-synchronous overlap-and-add method," J. Acoust. Soc. Am., vol. 132, no. 5, pp. 3098-3108, 2012        [ Links ]

[15] L. Wang et al, "Waveform Similarity Over-and-add Technique with Gain Control," IEEE International Conference on Broadband Network & Multimedia Technology, Beijing, China, pp. 735-739, 2009.         [ Links ]

[16] L. Wang et al., "A Packet Loss Concealment Method Base on GWSOLA algorithm and Signification Transient Detect," IEEE International Conference on Network Infrastructure and Digital Content 2009, Beijing, China, pp.745-749, 2009.         [ Links ]

[17] W. Verhelst and M. Roelands, "An overlap-add technique based on waveform similarity (WSOLA) for high quality time-scale modifications of speech," in Proc. of International Conference on Acoustics Speech and Signal Processing, Minneapolis, MN, 2, pp. 554-557, 1993.         [ Links ]

[18] Y. Sun et al., "A modified weighted overlap and add-based spectral subtraction method," J. Acoust. Soc. Am. Volume 131, Issue 4, pp. 3444-3444, 2012.         [ Links ]

[19] J.-H. Chen, "Packet Loss Concealment Based on Extrapolation of Speech Waveform," IEEE International Conference on Acoustics, Speech and Signal Processing 2009, Taipei, Taiwan, R.O.C., pp.4129-4132, 2009.         [ Links ]

[20] K. Kondo and K. Nakagawa, "A Speech Packet Loss Concealment Method Using Linear Prediction," IEICE Transaction on Information & Systems, Vol. E89-D, No. 2, pp.806-813, 2006.         [ Links ]

[21] N. Linenberg et al., "Packet Loss Concealment Using Adaptive Lattice Modeling," 15th IEEE Mediterranean Electrotechnical Conference, pp.378-382, 2010.         [ Links ]

[22] J.-F.Wang et al, "A Voicing Driven Packet Loss Recovery Algorithm for Analysis-by-synthesis Predictive Speech Coders Over Internet," IEEE Transactions on Multimedia, vol. 3, pp. 98-107, 2001.         [ Links ]

[23] J.-F. Yeh and C.-H. Hsu, "Sub-Syllable Segment-based Voice Conversion Using Spectral Block Clustering Transformation Functions," Journal of the Chinese Institute of Engineers. Vol. 33, No. 7, pp. 1059-1067, 2010.         [ Links ]

[24] A. Mantilla Caeiros et al., "A Pattern Recognition based Esophageal Speech Enhancement System," vol.8, no. 1, pp. 56-71, 2010.         [ Links ]

[25] M.-Y. Zhu et al., "An Accurate Low Complexity Algorithm for Frequency Estimation in MDCT Domain," IEEE Transactions on Consumer Electronics, Vol. 54, No.3, pp. 1022-1028, 2008.         [ Links ]

[26] H.-Y. Gu and Z.-S. Chen., "A Packet Loss Concealment Method for Voice over IP," 18th Conference on Computational Linguistics and Speech Processing, 2006.         [ Links ]

[27] W.-T. Liao et al., "Adaptive Recovery Techniques for Real-time Audio Streams," Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE INFOCOM), vol. 2, , Anchorage, USA, pp.815-823, 2001.         [ Links ]

Creative Commons License Todo el contenido de esta revista, excepto dónde está identificado, está bajo una Licencia Creative Commons