ระบบการรับรู้เสียงพูดแบบต่างบุคคล

ไพศาล ธรรมโพธิทอง

Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/30016

Title:	ระบบการรับรู้เสียงพูดแบบต่างบุคคล
Other Titles:	Multispeaker recognition system
Authors:	ไพศาล ธรรมโพธิทอง
Advisors:	วีระ ริ้วพิทักษ์
Other author:	จุฬาลงกรณ์มหาวิทยาลัย. บัณฑิตวิทยาลัย
Issue Date:	2533
Publisher:	จุฬาลงกรณ์มหาวิทยาลัย
Abstract:	ระบบการรับรู้เสียงพูดที่ทำการวิจัยนี้เป็นแบบคำเดี่ยวต่างบุคคล ซึ่งขั้นตอนในการวิจัยแบ่ง 3 ส่วน คือขั้นตอนแรกจะทำการหาค่าสัมประสิทธิ์ของสัญญาณ และค่าคาบเสียงสูงต่ำของกลุ่มคำ 2 กลุ่ม เพื่อจะนำมาสร้างเป็นแบบอ้างอิงจากผู้พูด 30 คน ขั้นตอนที่สองกลุ่มของแบบอ้างอิงทั้ง 2 กลุ่ม จะถูกนำมาผ่านการคัดเลือกให้ได้คำละ 10 แบบ โดยอาศัยเทคนิคการจัดกลุ่ม และทำการให้เลเบล ขั้นที่สามจะทำการทดสอบโดยอาศัยแบบทดสอบจากผู้พูด 5 คน มาทำการเปรียบเทียบกับแบบอ้างอิง การคำนวณหา ค่าความแตกต่างระหว่างแบบทดสอบกับแบบอ้างอิง ได้ใช้ เทคนิคของไดนามิคโปรแกรมมิ่ง และกฎการตัดสินใจได้อาศัยเทคนิคของ K-Nearest Neighbour (KNN) การกำหนดค่าความแตกต่างที่ยอมรับได้ ถูกนำมาใช้เพื่อปรับปรงอัตราความถูกต้องในการรับรู้ การทำ Post Processing โดยใช้ค่า Zero Crossing เพื่อปรับปรุงอัตราความถูกต้องของการรับรู้ระหว่างคำว่า "ปาก" และ "มาก" ผลการวิจัยจากคำทดสอบกลุ่มแรกไต้อัตราความถูกต้องร้อยละ 85.8 และคำทดสอบกลุ่มที่สอง ได้อัตราความถูกต้องร้อยละ 94.3
Other Abstract:	This multispeaker voice recognition system is an isolated word : speaker independent system. The procedure in developing this system is of 3 steps 1) The analysis of 2 sets of reference words from 30 speakers in terms of the reflection coefficient and pitch period values, 2) the reference patterns of these 2 sets of words are created and 10 desired patterns are created by clustering Technique and then labeled, 3) the test patterns from 5 subjects are compared with the reference patterns. The distance between the test and the reference patterns is formed by the Dynamic Programming Technique. The decision rules are created by using the K-nearest neighbor (KNN) technique. A range of acceptable distance value has been set to improve recognition rate. Post processing using Zero crossing was also added to improve the recognition rate in the word pair /pa:k/ and /ma:k/. 85.8% of accurate recognition is formed in the first set of words and 94.3 % is formed in the second set of words.
Description:	วิทยานิพนธ์ (วท.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2533
Degree Name:	วิทยาศาสตรมหาบัณฑิต
Degree Level:	ปริญญาโท
Degree Discipline:	วิทยาศาสตร์คอมพิวเตอร์
URI:	http://cuir.car.chula.ac.th/handle/123456789/30016
ISBN:	9745778346
Type:	Thesis
Appears in Collections:	Grad - Theses

Files in This Item:

File	Size	Format
Paisan_th_front.pdf	4.69 MB	Adobe PDF	View/Open
Paisan_th_ch1.pdf	2.07 MB	Adobe PDF	View/Open
Paisan_th_ch2.pdf	14.58 MB	Adobe PDF	View/Open
Paisan_th_ch3.pdf	4.65 MB	Adobe PDF	View/Open
Paisan_th_ch4.pdf	6.42 MB	Adobe PDF	View/Open
Paisan_th_ch5.pdf	3.13 MB	Adobe PDF	View/Open
Paisan_th_ch6.pdf	1.44 MB	Adobe PDF	View/Open
Paisan_th_back.pdf	12.42 MB	Adobe PDF	View/Open

Show full item record