การรู้จำหน่วยเสียงสระภาษาไทยโดยใช้โครงข่ายประสาทเทียม

เอกฤทธิ์ มณีน้อย

Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/9874

Title:	การรู้จำหน่วยเสียงสระภาษาไทยโดยใช้โครงข่ายประสาทเทียม
Other Titles:	Thai vowel phoneme recognition using artificial neural networks
Authors:	เอกฤทธิ์ มณีน้อย
Advisors:	สมชาย จิตะพันธ์กุล สุดาพร ลักษณียนาวิน
Other author:	จุฬาลงกรณ์มหาวิทยาลัย. บัณฑิตวิทยาลัย
Advisor's Email:	Somchai.J@chula.ac.th Sudaporn.L@chula.ac.th
Subjects:	นิวรัลเน็ตเวิร์ค (คอมพิวเตอร์) การรู้จำเสียงพูดอัตโนมัติ ภาษาไทย -- สระ
Issue Date:	2541
Publisher:	จุฬาลงกรณ์มหาวิทยาลัย
Abstract:	วิทยานิพนธ์นี้มีจุดมุ่งหมายเพื่อพัฒนาระบบรู้จำหน่วยเสียงสระภาษาไทยโดยใช้โครงข่ายประสาทเทียม โครงข่ายประสาทเทียมที่ใช้ในงานวิจัยนี้มีโครงสร้างแบบ MLP ที่มีจำนวนชั้นซ่อนตัวหนึ่งชั้นและใช้ขั้นตอนวิธีการส่งค่าย้อนกลับในการฝึกฝน โครงข่ายประสาทเทียมจะถูกแบ่งออกเป็นโครงข่ายย่อยสำหรับรู้จำหน่วยเสียงสระเดี่ยวจากชุดคำศัพท์ที่ประกอบด้วยเสียงสระเดี่ยวภาษาไทย 9 หน่วยเสียงเพื่อการสร้างต้นแบบหน่วยเสียงที่มีประสิทธิภาพ นอกจากการรู้จำหน่วยเสียงสระเดี่ยวโดยโครงข่ายประสาทเทียมแล้วงานวิจัยนี้ได้สร้างแบบจำลองทางสถิติของช่วงความยาว เสียงสระเพื่อทำการรู้จำสระเสียงสั้น-ยาวกับชุดคำศัพท์ที่ประกอบด้วยสระเสียงสั้นยาวจำนวนน 6 เสียง ค่าลักษณะสำคัญที่ใช้ในงานวิจัยนี้ได้แก่ สัมประสิทธิ์การประมาณพันธะเชิงเส้น สัมประสิทธิ์ cepstral และความถี่ฟอร์แมนท์ ผลการทดสอบอัตราการรู้จำมีค่าร้อยละ 90.34 เมื่อใช้การแบ่งกลุ่มหน่วยเสียงสระตามความถี่ฟอร์แมนท์และใช้ค่าสัมประสิทธิ์ cepstral เป็นค่าลักษณะสำคัญ สูงกว่ากรณีที่ไม่มีการแบ่งกลุ่มหน่วยเสียงสระตามความถี่ฟอร์มแมนท์ที่มีอัตราการรู้จำร้อยละ 85.92 โดยที่ผู้พูดในชุดฝึกฝนมีจำนวน 30 คน และชุดทดสอบแบบไม่ขึ้นกับผู้พูด 20 คน
Other Abstract:	The objective of this research is to develop the Thai vowel-phoneme recognition system using an artificial neural network. The single hidden layer MLP neural network with the backpropagation algorithm, is employed in this research. The artificial neural network is divided into subnetworks for recognizing the 9 vowel phonemes from the vocabulary set. Furthermore, this thesis has also constructed the statistical model for recognizing short and long vowels. The features used in this research are linear prediction coefficient, cepstral coefficient, and formant frequency. The recognition rate of this system is 90.34 percent when the subdivided artificial neural networks correspond to formant frequency and cepstral coefficient. The recognition rate of this subdivided network is higher than the single network of which the recognition rate is 85.92 percent. Thirty training speakers and twenty test speakers are employed in this research.
Description:	วิทยานิพนธ์ (วศ.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2541
Degree Name:	วิศวกรรมศาสตรมหาบัณฑิต
Degree Level:	ปริญญาโท
Degree Discipline:	วิศวกรรมไฟฟ้า
URI:	http://cuir.car.chula.ac.th/handle/123456789/9874
ISBN:	9743314229
Type:	Thesis
Appears in Collections:	Grad - Theses

Files in This Item:

File	Size	Format
Ekkarit_Ma_front.pdf	1.33 MB	Adobe PDF	View/Open
Ekkarit_Ma_ch1.pdf	945.44 kB	Adobe PDF	View/Open
Ekkarit_Ma_ch2.pdf	1.86 MB	Adobe PDF	View/Open
Ekkarit_Ma_ch3.pdf	1.24 MB	Adobe PDF	View/Open
Ekkarit_Ma_ch4.pdf	2.21 MB	Adobe PDF	View/Open
Ekkarit_Ma_ch5.pdf	822.22 kB	Adobe PDF	View/Open
Ekkarit_Ma_back.pdf	2.15 MB	Adobe PDF	View/Open

Show full item record