IMPLEMENTATION OF SPEECH RECOGNITION USING LONG SHORT-TERM MEMORY METHOD WITH MFCC FOR DYNAMIC PRESENTATIONS

ADHTAMA, SATRIYA (2024) IMPLEMENTATION OF SPEECH RECOGNITION USING LONG SHORT-TERM MEMORY METHOD WITH MFCC FOR DYNAMIC PRESENTATIONS. Tugas Akhir thesis, Informatics.

[img] Text
5200411545_SATRIYA ADHITAMA_ABSTRAK.pdf

Download (11kB)

Abstract

ABSTRACT Presentation is a method for communicating an idea or idea that is presented in such a way that the audience can easily understand what the speaker is conveying. Effective communication can be improved by using interactive presentation media such as PowerPoint. Operating this presentation software sometimes becomes an obstacle for speakers when they want to go to the desired section because it requires an operator or other supporting equipment. Speech recognition can be used to assist presenters in giving commands to operate presentation display software that has been prepared dynamically. This voice recognition system for dynamic presentations uses Long Short-Term Memory (LSTM) which is a development of Recurrent Neural Network (RNN) to handle sequential data such as voice. This LSTM implementation is carried out by extracting MFCC features so that sound signals can represent human hearing. The best LSTM model produces quite good performance, namely 95.29% for training, 94.54% for validation, and 94.28% for testing. Keywords: Speech Recognition, LSTM, MFCC, Presentation Devices, Presentations

Item Type: Thesis (Skripsi, Tugas Akhir or Kerja Praktek) (Tugas Akhir)
Subjects: T Technology > T Technology (General)
Divisions: Fakultas Sains Dan Teknologi > S1 Informatika
Depositing User: Kaprodi S1 Informatika UTY
Date Deposited: 07 Aug 2024 02:35
Last Modified: 07 Aug 2024 02:35
URI: http://eprints.uty.ac.id/id/eprint/15818

Actions (login required)

View Item View Item