การประสานเวลาอัตโนมัติแบบทันทีระหว่างเสียงและข้อความ

ณัฏฐ์ เลิศวงศ์คณากูล

Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/44227

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	โปรดปราน บุณยพุกกณะ	-
dc.contributor.advisor	อติวงศ์ สุชาโต	-
dc.contributor.author	ณัฏฐ์ เลิศวงศ์คณากูล	-
dc.contributor.other	จุฬาลงกรณ์มหาวิทยาลัย. คณะวิศวกรรมศาสตร์	-
dc.date.accessioned	2015-08-03T09:03:58Z	-
dc.date.available	2015-08-03T09:03:58Z	-
dc.date.issued	2555	-
dc.identifier.uri	http://cuir.car.chula.ac.th/handle/123456789/44227	-
dc.description	วิทยานิพนธ์ (วศ.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2555	en_US
dc.description.abstract	การประสานเวลาอัตโนมัติระหว่างเสียงและข้อความนั้น เป็นวิธีการที่แสดงเนื้อหาเดียวกันจากสื่อที่แตกต่างกัน ซึ่งในที่นี้คือเสียงและข้อความ ซึ่งโปรแกรมประยุกต์ส่วนใหญ่จะเป็นการประสานเวลาในระดับประโยค และใช้ข้อมูลของเสียงและข้อความทั้งหมดในการประสานเวลา แต่เนื่องด้วยความต้องการของโปรแกรมประยุกต์บางประเภท เช่น โปรแกรมการสร้างหนังสือเสียงซึ่งมีข้อความทั้งหมด และต้องการที่จะประสานเวลาในทันทีที่เสียงเข้ามาในระบบ อย่างไรก็ตาม ด้วยลักษณะของภาษาไทยซึ่งมีการแบ่งประโยคและคำไม่ชัดเจน ทำให้การประสานเวลานั้นมีความท้าทาย ดังนั้นวิทยานิพนธ์นี้จึงเสนอขั้นตอนวิธีในการประสานเวลาอัตโนมัติแบบทันทีระหว่างเสียงและข้อความในระดับพยางค์ ขั้นตอนวิธีที่นำเสนอนั้นใช้หลักการในการตรวจหาพยางค์และตรวจหาความไม่ตรงกันของการถอดเสียง การทดลองได้ศึกษาการใช้ลักษณะเด่นต่าง ๆ และการปรับค่าพารามิเตอร์อย่างละเอียด ขั้นตอนวิธีที่นำเสนอถูกนำมาเปรียบเทียบกับระบบอ้างอิง 2 ระบบ ซึ่งได้ผลลัพธ์ดีกว่าระบบอ้างอิง 75% และ 41% ตามลำดับ และในแง่ของเวลาสามารถคำนวณได้ในทันที	en_US
dc.description.abstractalternative	Most of the researches in synchronization of audio and text have been focusing on the synchronization at the level of utterance. However, to generate audio books in unstructed language like Thai from live speech, a finer lever of synchronization is necessary. We propose an algorithm to synchronize live speech with its corresponding transcription in real time at syllabic unit. The proposed algorithm employs the syllable detection concept and the transcription errors detection concept. The experiment was studied the features and the parameters empirically. The result were compared with 2 baselines and found that the proposed algorithm was better than 2 baselines 75% and 41% respectively. In term of processing time, the proposed algorithm was able to give the results in real-time.	en_US
dc.language.iso	th	en_US
dc.publisher	จุฬาลงกรณ์มหาวิทยาลัย	en_US
dc.relation.uri	http://doi.org/10.14457/CU.the.2012.436	-
dc.rights	จุฬาลงกรณ์มหาวิทยาลัย	en_US
dc.subject	การถอดเสียง	en_US
dc.subject	ระบบแปลงเสียงเป็นข้อความ	en_US
dc.subject	ระบบประมวลผลเสียงพูด	en_US
dc.subject	การรู้จำเสียงพูดอัตโนมัติ	en_US
dc.subject	ภาษาศาสตร์คอมพิวเตอร์	en_US
dc.subject	Transcription	en_US
dc.subject	Speech-to-text systems	en_US
dc.subject	Speech processing systems	en_US
dc.subject	Automatic speech recognition	en_US
dc.subject	Computational linguistics	en_US
dc.title	การประสานเวลาอัตโนมัติแบบทันทีระหว่างเสียงและข้อความ	en_US
dc.title.alternative	Real-Time automatic Speech-Text Alignment	en_US
dc.type	Thesis	en_US
dc.degree.name	วิศวกรรมศาสตรมหาบัณฑิต	en_US
dc.degree.level	ปริญญาโท	en_US
dc.degree.discipline	วิศวกรรมคอมพิวเตอร์	en_US
dc.degree.grantor	จุฬาลงกรณ์มหาวิทยาลัย	en_US
dc.email.advisor	proadpran.p@chula.ac.th	-
dc.email.advisor	atiwong.s@chula.ac.th	-
dc.identifier.DOI	10.14457/CU.the.2012.436	-
Appears in Collections:	Eng - Theses

Files in This Item:

File	Description	Size	Format
Nat_le.pdf		2.21 MB	Adobe PDF	View/Open

Show simple item record