Date of Award

5-2018

Document Type

Thesis

Degree Name

Master of Science (MS)

Department

Computer Engineering and Sciences

First Advisor

Veton Z. Kepuska

Second Advisor

Samuel P. Kozaitis

Third Advisor

Josko Zec

Fourth Advisor

Marius Silaghi

Abstract

The Automatic speech recognition (ASR) system is one of most important technologies that used for human-machine interaction. The main goal of the ASR system is to recognize different natural languages that are spoken by humans. The difficulties of these recognition systems depend on many factors, such as noises, variability of the speakers, and problems of continuous speech. For that reason, many researchers and foundations have designed different kinds of licensed toolkits and software that are specialized in building speech recognition systems, including , Julius, Sphinx-4, RWTH ASR, and HTK toolkits. In this thesis, Kaldi toolkit, which is one of the most notable speech recognition tools that is written in C++ and released under the Apache License v2.0, is used to build, train, and evaluate a digital ASR system. First, the speech recognition system has been explained in detail and built using the TIDIGITS corpus. Second, different training approaches(including discriminative training methods) have been studied and applied to improve the accuracy of the speech recognition system. The ASR system accuracy has been evaluated using two evolution metrics: the word error rate(WER) and the sentence error rate (SER). The overall obtained system performance is ranged from 99.05% to 99.55% depending on the training methods that have been applied.

Comments

Recommended Citation

Alyousefi, Sarah Habeeb, "Digital Automatic Speech Recognition using Kaldi" (2018). Theses and Dissertations. 711.
https://repository.fit.edu/etd/711

Download

Included in

Computer Engineering Commons

COinS

Theses and Dissertations

Digital Automatic Speech Recognition using Kaldi

Date of Award

Document Type

Degree Name

Department

First Advisor

Second Advisor

Third Advisor

Fourth Advisor

Abstract

Comments

Recommended Citation

Included in

Search

Browse

Author Corner

Theses and Dissertations

Digital Automatic Speech Recognition using Kaldi

Author

Date of Award

Document Type

Degree Name

Department

First Advisor

Second Advisor

Third Advisor

Fourth Advisor

Abstract

Comments

Recommended Citation

Included in

Share

Search

Browse

Author Corner