ระบบควบคุมคอมพิวเตอร์ด้วยเสียงพูดภาษาไทย โดยใช้เทคนิคการวิเคราะห์สเปกตรัมและโครงข่ายประสาทเทียม

พงษ์ศักดิ์ ชูงาม

Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/65372

Title:	ระบบควบคุมคอมพิวเตอร์ด้วยเสียงพูดภาษาไทย โดยใช้เทคนิคการวิเคราะห์สเปกตรัมและโครงข่ายประสาทเทียม
Other Titles:	A computer controlled system by Thai speech using spectrum analysis and an artificial neural Network
Authors:	พงษ์ศักดิ์ ชูงาม
Advisors:	สาธิต วงศ์ประทีป
Other author:	จุฬาลงกรณ์มหาวิทยาลัย. คณะวิศวกรรมศาสตร์
Subjects:	การรู้จำเสียงพูดอัตโนมัติ นิวรัลเน็ตเวิร์ค (วิทยาการคอมพิวเตอร์) การวิเคราะห์สเปกตรัม Automatic speech recognition Neural networks (Computer science) Spectrum analysis
Issue Date:	2544
Publisher:	จุฬาลงกรณ์มหาวิทยาลัย
Abstract:	การวิจัยครั้งนี้มีจุดมุ่งหมายเพื่อพัฒนาวิธีการรู้จำเสียงพูด โดยการวิเคราะห์เชิงความถี่ เพื่อหาลักษณะเด่นของเสียงพูดในรูปแบบของแถบความถี่ และ นิวรอลเน็ตเวิร์กแบบแบ็กพรอพาเกชัน โดยใช้แถบความถี่เป็นข้อมูลอินพุตสำหรับ นิวรอลเนิตเวิร์ก และพัฒนาโปรแกรมต้นแบบ เพื่อแสดงการทำงานของระบบจริง ชุดข้อมูลเสียงที่ใช้ทดสอบ ประกอบด้วยเสียง 50 เสียง โดยกำหนดเพื่อแทนคำสั่งหรือปุมบนแป้นกด เมื่อโปรแกรมได้รับเสียง โปรแกรมจะกำหนดจุดเริ่มต้นของเสียง และคำนวณหาแถบความถี่ของเสียง แถบความถี่ จะเป็นข้อมูลรับเข้าของ นิวรอลเน็ตเวิร์ก เพื่อหารู้แบบที่เข้ากันได้ กับข้อมูลที่มีการสอนไว้ ผลจากการทดลอง ระบบสามารถรู้จำเสียงถูกต้อง 87.7 เปอร์เซ็นต์ พบปัญหาของระบบอยู่ที่ระบบรับสัญญาณเสียง การคำนวณแถบความถี่ เป็นการคำนวณเป็นแบบช่วงเวลา ดังนั้นในบางครั้งกรอบของข้อมูลรับเข้า ไม่สามารถครอบคุมสัญญาณเสียง แถบความถี่จะผิดพลาด ถ้าสัญญาณเสียงไม่สมบูรณ์โปรแกรมตัวอย่างเป็นต้นแบบของการพัฒนา การรู้จำคำพูดแบบต่อเนื่อง
Other Abstract:	The purpose of this research is to develop a speech recognition algorithm using frequency domain analysis for specify pattern of spectrum and back propagation neural network. Results of Spectrum analysis are feeded to neural network. An example program is developed to show the process of algorithm in real system. A set of 50 speeches is used and these speeches are window commands or key when program receive speech. The program find a starting point of speech and calculate frequency spectrum of the speech. Frequency spectrum is input pattern for neural network and the results of neural network are matched with the training pattern. From The results, the system can recognize speeches with 87.7 % Correction. It is found that a problem is in the input signal system. Calculation of short time spectrum can not cover speeches signal. A spectrum of frequency is lost if speeches signal are not completed. An example program is modeled to develop a continuous speech recognition.
Description:	วิทยานิพนธ์ (วท.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2544
Degree Name:	วิทยาศาสตรมหาบัณฑิต
Degree Level:	ปริญญาโท
Degree Discipline:	วิทยาศาสตร์คอมพิวเตอร์
URI:	http://cuir.car.chula.ac.th/handle/123456789/65372
ISBN:	9740316395
Type:	Thesis
Appears in Collections:	Eng - Theses

Files in This Item:

File	Description	Size	Format
Pongsak_ch_front_p.pdf	หน้าปก และ บทคัดย่อ	730.63 kB	Adobe PDF	View/Open
Pongsak_ch_ch1_p.pdf	บทที่ 1	658.23 kB	Adobe PDF	View/Open
Pongsak_ch_ch2_p.pdf	บทที่ 2	1.9 MB	Adobe PDF	View/Open
Pongsak_ch_ch3_p.pdf	บทที่ 3	1.15 MB	Adobe PDF	View/Open
Pongsak_ch_ch4_p.pdf	บทที่ 4	1.14 MB	Adobe PDF	View/Open
Pongsak_ch_ch5_p.pdf	บทที่ 5	635.91 kB	Adobe PDF	View/Open
Pongsak_ch_back_p.pdf	บรรณานุกรม และ ภาคผนวก	663.78 kB	Adobe PDF	View/Open

Show full item record