การจำแนกเสียงพยัญชนะกักภาษาไทยในตำแหน่งต้นพยางค์โดยอัตโนมัติเมื่อทราบฐานกรณ์

บุศมาส พลกุล

Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/36084

Title:	การจำแนกเสียงพยัญชนะกักภาษาไทยในตำแหน่งต้นพยางค์โดยอัตโนมัติเมื่อทราบฐานกรณ์
Other Titles:	Automatic classification of Thai syllable-initial stop consonants with known place of articulation
Authors:	บุศมาส พลกุล
Advisors:	อติวงศ์ สุชาโต
Other author:	จุฬาลงกรณ์มหาวิทยาลัย. คณะวิศวกรรมศาสตร์
Advisor's Email:	Atiwong.S@Chula.ac.th
Subjects:	ภาษาไทย -- สัทศาสตร์ ภาษาไทย -- การออกเสียง Thai language -- Phonetics Thai language -- Pronunciation
Issue Date:	2550
Publisher:	จุฬาลงกรณ์มหาวิทยาลัย
Abstract:	วิทยานิพนธ์ฉบับนี้ได้ทำการศึกษาเกี่ยวกับการจำแนกประเภทเสียงพยัญชนะกักภาษาไทยในตำแหน่งต้นพยางค์โดยอัตโนมัติเมื่อทราบฐานกรณ์ โดยอาศัยหลักการสวนสัทศาสตร์ มาหาค่าสวนลักษณ์ของพยัญชนะต้นเสียงกักภาษาไทย ซึ่งได้แก่ ความยาวช่วงกักลม ความยาวช่วงปล่อยลม ค่าพลังงานเฉลี่ยในช่วงความถี่ต่ำที่ 60 ถึง 300 เฮิรตซ์ ของสัญญาณเสียงช่วงกักลม ค่าพลังงานของช่วงปล่อยลม และจำนวนครั้งของเวลาที่สัญญาณคลื่นเสียงแกว่งเกินค่าศูนย์ในช่วงกักลม โดยอาศัยวิธีการแบ่งเซกเมนต์เพื่อหาช่วงกักลม และช่วงปล่อยลมของสัญญาณเสียงพยัญชนะกักที่ถูกบันทึกในรูปแบบ ของ “สระ - พยัญชนะกัก – สระ” โดยที่ สระตัวแรกเป็นสระอะแบบไม่เน้นพยางค์ พยัญชนะเป็นเสียงของพยัญชนะกักต้นพยางค์ มีด้วยกันเป็น 8 เสียงหลัก บ ป พ ด ต ท ก ข และสระตัวหลังเป็นเสียงสระ 8 เสียง คือ สระอะ อา อิ อี อุ อู แอะ แอ หลังจากนั้นจะนำสัญญาณเสียงช่วงกักลม และช่วงปล่อยลมที่ได้ไปสกัดค่าสวนลักษณ์ทั้ง 5 ค่า เพื่อใช้เป็นค่าคุณลักษณะของการวิเคราะห์แบบดิสคริมิแนนต์เชิงเส้น ในการวิเคราะห์จะใช้วิธีวัดระยะห่างตำแหน่งระหว่างสัญญาณเสียงทดสอบกับสัญญาณเสียงต้นแบบตามวิธีของยุคลิด และผลที่ได้จากการจำแนกประเภทเสียงพยัญชนะกักแบ่งเป็น 3 ประเภท ได้แก่ เสียงโฆษะ เสียงอโฆษะแบบสิถิล และเสียงอโฆษะแบบธนิต หลังจากนำสัญญาณเสียงของคนปกติไปทดสอบการจำแนกประเภทเสียงตามวิธีการที่ได้ศึกษา พบว่าความถูกต้องเฉลี่ยที่ได้มีค่าร้อยละ 83 89 และ 88 ตามฐานริมฝีปาก ฐานเพดานอ่อน และฐานปุ่มเหงือก ตามลำดับ และเมื่อสัญญาณเสียงที่เข้ามาทดสอบการจำแนกประเภทเสียงเป็นเสียงของผู้ป่วยที่เป็นโรคปากแหว่งเพดานโหว่ ผลการจำแนกประเภทเสียงที่ได้ตรงกับผลการจำแนกเสียงของผู้เชี่ยวชาญทางด้านสวนลักษณ์ถึงร้อยละ 71 จากผลดังกล่าวพบว่ามีความถูกต้องมากกว่าผลการจำแนกประเภทเสียงของวิธีของฮิดเดนมาร์คอฟถึงร้อยละ 18
Other Abstract:	The goal of this thesis is to create an automatic classification of Thai syllable-initial stop consonants with known place of articulation. The baseline algorithm makes use of acoustic-phonetic knowledge. This work focuses on acoustic discrimination among voiced, voiceless unaspirated and voiceless aspirated stop consonants with the same place of articulation. Region of interest associated with a stop consonant in the vowel-consonant-vowel (V [subscript 1]C[subscript s] V [subscript 2]) where V1 was a schwa, C [subscript s] was one of the eight Thai stop consonants, /b/ /p/ /p [superscript h]/ /d/ /t/ /t [superscript h]/ /k/ and /k [superscript h]/, and V [subscript 2] was either one of eight Thai corner vowels. Since the interested region is in the middle of the speech, the segmentation algorithm is also applied for justifying the stop consonant part. Acoustic measurements based on the Voicing Onset Time (VOT), the Closure Duration, which is the duration from the point where the closure is formed to the point where it is released, the time-averaged Energy in 60-300 Hz frequency range of the speech signal in the closure interval (E [subscript lo] ), the Energy in the interval starting from the release to the point where voicing of V [subscript 2] start (E [subscript rel]) and the Zero Crossing Rate in the Closure Duration were used to form the classification feature vectors of Linear Discriminant Analysis algorithm. Experiment reports were done on cleft lips and palates (CLP) utterances and non-CLP utterances. The classification accuracies of non-CLP tokens are 83%, 89% and 88% for labial, alveolar and velar cases respectively and the overall accuracy of CLP tokens is 71%, while the Hidden Markov Models algorithm can classify CLP data with only 53% which is 18% less than the classification accuracy from the acoustic-phonetic algorithm.
Description:	วิทยานิพนธ์ (วท.ม.)-- จุฬาลงกรณ์มหาวิทยาลัย, 2550
Degree Name:	วิทยาศาสตรมหาบัณฑิต
Degree Level:	ปริญญาโท
Degree Discipline:	วิทยาศาสตร์คอมพิวเตอร์
URI:	http://cuir.car.chula.ac.th/handle/123456789/36084
URI:	http://doi.org/10.14457/CU.the.2007.671
metadata.dc.identifier.DOI:	10.14457/CU.the.2007.671
Type:	Thesis
Appears in Collections:	Eng - Theses

Files in This Item:

File	Description	Size	Format
budsamas_ph.pdf		2.79 MB	Adobe PDF	View/Open

Show full item record