CoreConferences 2020

Home
CoreConferences 2020

CoreConferences 2020

International Conference on Cyber Security and Connected Technologies 2020

Publication Meta:Value
Short Title:CC Batch A 2020
Publisher:ASDF, India
ISBN 13:978-93-88122-10-8
ISBN 10:93-88122-10-0
Language:English
Type:Hard Bound - Printed Book
Copyrights:CC Batch A Organizers/DCRC, London, UK
Editor-in-Chief:Dr A Senthilkumar
Conference Dates:03 - 04, February 2020
Venue Country:Tokyo, Japan
Submitted Papers:259
Acceptance Rate:8.77%
Website:www.coreconferences.com

Paper 002

An Implementation of Customized Wake-Up-Word in Embedded System

Tsung-Han Tsai¹, Tran Dang Khoa²

^1,2Department of Electronic Engineering, National Central University, Taiwan

Abstract

In this paper, a self–defined wake–up–word (WUW) recognition system and its embedded system implementation has been proposed. To execute whole system, it is divided into two phases: training phase and testing–comparison phase. In the training phase, a wake–up word of any language is recorded, and the voice segment is cut out by using the Voice Activity Detection (VAD). Then we use the Mel–Frequency Cepstral Coefficients (MFCC) as the pre–processing to extract features of the input speech signal for follow–up use. The Expectation–Maximization Algorithm is used to train the Gaussian Mixture Model, and the Baum–Welch is used to train the Hidden Markov Model. These two models are combined into a data model of a speaker's speech dataset. In the testing–comparison phase, an unknown voice segment is input. The VAD and MFCC are still reused for the same purpose with training phase. Subsequently, the output feature will be calculated through the log likelihood of the Gaussian Mixture Model to find the correspond speaker, and the Viterbi algorithm is used to calculate the state sequence of the unknown speech through Hidden Markov Model. Finally, we calculate Gaussian Mixture Model similarity method and use Levenshtein Distance algorithm to compare dataset state sequence with the unknown speech state sequence. This system can work well with a small amount of training data, and the system is implemented on the embedded board to test performance where it takes 1.4 seconds to recognize the wake-up word.

Keywords

Customized wake–up–word, Mel–Frequency cepstral coefficients, Gaussian mixture model, Hidden markov model

Author's Profile

Author profile can be generated and linked through our partners World Book of Researchers. To include your profile online Click Here. After it is approved, please email to edlib @ asdf.res.in to create a link with all the papers.

Tsung-Han Tsai : Profile
Tran Dang Khoa : Profile

Buy Reprints

Download Paper

e-AID

2020.CoreConferences.002

Cite this Article as Follows

Tsung-Han Tsai, Tran Dang Khoa. An Implementation of Customized Wake-Up-Word in Embedded System. International Conference on Cyber Security and Connected Technologies (2020): 01. Print.

ASDF EDLIB BY Kokula Krishna Hari K, Long CAI & Daniel James