Natural Language Processing Faculty Publications

Unsupervised Automatic Speech Recognition: A review

Hanan Aldarmaki, United Arab Emirates University & Mohamed bin Zayed University of Artificial IntelligenceFollow
Asad Ullah, National University of Sciences and Technology Pakistan
Sreepratha Ram, United Arab Emirates University
Nazar Zaki, United Arab Emirates University

Document Type

Article

Publication Title

Speech Communication

Abstract

Automatic Speech Recognition (ASR) systems can be trained to achieve remarkable performance given large amounts of manually transcribed speech, but large labeled data sets can be difficult or expensive to acquire for all languages of interest. In this paper, we review the research literature to identify models and ideas that could lead to fully unsupervised ASR, including unsupervised sub-word and word modeling, unsupervised segmentation of the speech signal, and unsupervised mapping from speech segments to text. The objective of the study is to identify the limitations of what can be learned from speech data alone and to understand the minimum requirements for speech recognition. Identifying these limitations would help optimize the resources and efforts in ASR development for low-resource languages. © 2022 The Author(s)

First Page

Last Page

DOI

10.1016/j.specom.2022.02.005

Publication Date

4-2022

Keywords

Mapping, Speech, Automatic speech recognition, Automatic speech recognition system, Cross-modal, Cross-modal mapping, Data set, Labeled data, Large amounts, Performance, Speech segmentation, Unsupervised automatic speech recognition, Speech recognition

Comments

Hybrid Gold Open Access

Archived, thanks to Elsevier ScienceDirect

License: CC BY NC-ND 4.0

Uploaded 29 November 2023

Recommended Citation

H. Aldarmaki, A. Ullah, S. Ram, and N. Zaki, “Unsupervised automatic speech recognition: A Review,” Speech Communication, vol. 139, pp. 76–91, 2022. doi:10.1016/j.specom.2022.02.005

Additional Links

https://doi.org/10.1016/j.specom.2022.02.005

Download

Included in

Artificial Intelligence and Robotics Commons

COinS

Natural Language Processing Faculty Publications

Unsupervised Automatic Speech Recognition: A review

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Keywords

Comments

Recommended Citation

Additional Links

Included in

Browse

Contribute

Links

Natural Language Processing Faculty Publications

Unsupervised Automatic Speech Recognition: A review

Authors

Document Type

Publication Title

Abstract

First Page

Last Page

DOI

Publication Date

Keywords

Comments

Recommended Citation

Additional Links

Included in

Share

Browse

Contribute

Links