Speech Feature Extraction and Data Visualisation - Vowel recognition and phonology analysis of four Asian ESL accents

David Tien, Yung Liang, Ayushi Sisodiya


This paper presents a signal processing approach to analyse and identify accent discriminative features of four groups of English as a second language (ESL) speakers, including Chinese, Indian, Japanese, and Korean. The features used for speech recognition include pitch, stress, formant frequencies, the Mel frequency coefficient, log frequency coefficient, and the intensity and duration of vowels spoken. This paper presents our study using the Matlab Speech Analysis Toolbox, and highlights how data processing can be automated and results visualised. The proposed algorithm achieved an average success rate of 57.3% in identifying vowels spoken in a speech by the four non-native English speaker groups.


Speech Recognition; Feature Extraction; Data Visualisation; Vowel Recognition; Phonology Analysis


A. Biem, S. Katagari, and B. H. Juang, "Discriminatiove Feature Extraction for Speech Recognition," in Proc. 1993 IEEE-SP Workshop, pp. 392-401.

L. R. Rabiner and R. W. Schafer, Theory and Application of Digital Speech Processing, 1st ed. Pearson, 2010.

B. Prica and S. Ilić, "Recognition of Vowels in Continuous Speech by Using Formants," Facta universitatis-series: Electronics and Energetics, vol. 23, no. 3, 2010, pp. 379-393.

K. Brown, Enclopedia of Language and Linguistics, 2nd ed. Elsevier, 2010.

K. Ohata, "Phonological Differences between Japanese and English: Several Potentially Problematic Areas of Pronunciation for Japanese ESL/EFL Learners," Asian EFL J. vol. 6, no. 4, 2004, pp. 1-19

A. Spanias, T. Painter, and V. Atti, Audio Signal Processing and Coding. Hoboken, NJ: John Wiley & Sons, 2006.

D. O'shaughnessy, Speech Communication: Human and Machine. India:University Press, 1987.

L. Deng and D. O'shaughnessy, Speech Processing: a Dynamic and Optimization-Oriented Approach. New York, NY: Marcel Dekker Inc, 2003.

J. Baker, L. Deng, J. Glass, S. Khudanpur, C. H. Lee, N. Morgan, and D. O'Shaughnessy, "Developments and Directions in Speech Recognition and Understanding, Part 1," Signal Processing Magazine, vol. 26, no. 3, pp. 75-80, 2009.

Encyclopedia Britannica. (2014). Pitch [Online]. Available: http://www.britannica.com/EBchecked/topic/1357164/pitch

M. P. Kesarkar, "Feature Extraction for speech Recognition," M. Tech. Credit Seminar Report, Electronic Systems Group, EE. Dept, IIT Bombay, Nov. 2003.

B. S. Atal and S. L. Hanauer, "Speech Analysis and Synthesis by Linear Prediction of the Speech Wave," J. Acoustical Society of America, vol. 50, no. 2B, 1971, pp. 637-655.

T. L. Nwe, S. W. Foo, and C. R. De Silva, "Detection of Stress and Emotion in Speech Using Traditional and FFT Based Log Energy Features," in Proc. 4th Int. Conf. Information, Communications and signal Processing, Singapore, 2003, pp. 1619-1623.

A. V. Oppenheim, R. W. Schafer, and J. R. Buck, Discrete-Time Signal Processing. Upper Saddle River, NJ: Prentice Hall, 1999.

Punskaya, E, Basics of Digital Filters [Online]. Available: http://freebooks6.org/3f3-4-basics-of-digital-filters-university-of-cambridge-w7878/

L. R. Rabiner and R. W. Schafer, Digital Processing of Speech Signals, Englewood Cliffs, NJ: Prentice-Hall, 1978.

S. W. Smith, The Scientist and Engineer's Guide to Digital Signal Processing, California Technical Publishing, 1997.

C. E. Shannon and W. Weaver, The Mathematical Theory of Communication, Urbana: the University of Illinois,1964.

U. Zolzer, Digital Audio Signal Processing. John Wiley and Sons, 2008.

D. P. W. Ellis. (2008, October 28). An Introduction to Signal Processing for Speech [Online]. Available: http://www.ee.columbia.edu/~dpwe/pubs/Ellis10-introspeech.pdf

J. Cernocky and V. Hubeika. (2009), Speech Signal Processing - introduction. [Online]. Available: http://www.fit.vutbr.cz/~ihubeika/ZRE/lect/01_prog_intro_2008-09_en.pdf

Full Text: PDF


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.

IT in Innovation IT in Business IT in Engineering IT in Health IT in Science IT in Design IT in Fashion

IT in Industry (2012 - ) http://www.it-in-industry.com ISSN (Online): 2203-1731; ISSN (Print): 2204-0595