การเลือกข้อความออนไลน์โดยอัตโนมัติเพื่อสร้างคลังข้อความตามการกระจายตัวหน่วยเสียงที่กำหนดได้

สุรพล วรภัทราทร

Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/22155

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	โปรดปราน บุณยพุกกณะ	-
dc.contributor.advisor	อติวงศ์ สุชาโต	-
dc.contributor.author	สุรพล วรภัทราทร	-
dc.contributor.other	จุฬาลงกรณ์มหาวิทยาลัย. คณะวิศวกรรมศาสตร์	-
dc.date.accessioned	2012-09-21T10:38:02Z	-
dc.date.available	2012-09-21T10:38:02Z	-
dc.date.issued	2554	-
dc.identifier.uri	http://cuir.car.chula.ac.th/handle/123456789/22155	-
dc.description	วิทยานิพนธ์ (วศ.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2554	en
dc.description.abstract	ประสิทธิภาพของระบบรู้จำเสียงพูดอัติโนมัติและระบบสังเคราะห์เสียงพูด ขึ้นอยู่กับความครอบคลุมของหน่วยเสียงจากคลังข้อความที่เหมาะสม วิทยานิพนธ์นี้เสนอการสร้างคลังข้อความอัตโนมัติ จากการกระจายตัวของหน่วยเสียงตามที่กำหนดการกระจายตัวของหน่วยตามที่กำหนดนั้น สามารถกำหนดได้จากชนิดของหน่วยเสียง ขนาดของคลังข้อความ เกณฑ์ขั้นต่ำของจำนวนหน่วยเสียง และรูปแบบของการกระจายตัวเป้าหมาย ได้คัดเลือกข้อความมาจากข้อมูลจากอินเตอร์เน็ต โดยข้อความนั้นจะถูกจัดเก็บมาอย่างต่อเนื่อง โดยกระบวนการดึงบทความจากหน้าเว็บบนอินเตอร์เน็ต จนกระทั่งได้คลังข้อความที่เหมาะสม ในวิทยานิพนธ์นี้ยังได้ประยุกต์ใช้วิธีการเชิงละโมบ เพื่อเลือกประโยคที่เหมาะสมที่จะทำให้เกิดการกระจายตัวของหน่วยเสียงตามเป้าหมาย ในการทดลองได้ใช้ข้อความจากฐานข้อมูล Large Vocabulary Continuous Speech Recognition (LVCSR) corpus for Thai language ในการสร้างเป้าหมายของการกระจายตัวหน่วยเสียง ผลการทดลองที่ได้คือ จำนวนของข้อมูลข้อความที่ดึงมาจากอินเตอร์เน็ตที่เพิ่มขึ้น สามารถทำให้การกระจายตัวของหน่วยเสียงเป็นไปตามเป้าหมายได้ และเกิดความครอบคลุมทางหน่วยเสียงคู่ ถึง 99.13% คลังข้อความที่ถูกสร้างขึ้นนี้ จึงสามารถนำไปใช้ในการสร้างคลังเสียงพูดได้อย่างมีประสิทธิภาพ	en
dc.description.abstractalternative	Performance of Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) systems depend on appropriate text corpus. This article explains about the automated text corpus generating method using custom phonetic distribution. This distribution is defined by phonemes type, corpus size, minimum criterion number of phonemes, and target phonetic distribution. Generally, the system selects text data from the internet by continuously downloading them using web crawler. The greedy algorithm is applied to extract the proper sentences, in order to fit with the target phonetic distribution until the appropriate text corpus is established. The experiment is done by using the text from Large Vocabulary Continuous Speech Recognition (LVCSR) corpus for Thai language to generate target phonetic distribution. The result shown that, the increased number of data drawn from the internet is able to accomplish target phonetic distribution and generate diphone coverage for 99.13%. This text corpus then generate speech corpus efficiently.	en
dc.format.extent	3256631 bytes	-
dc.format.mimetype	application/pdf	-
dc.language.iso	th	es
dc.publisher	จุฬาลงกรณ์มหาวิทยาลัย	en
dc.relation.uri	http://doi.org/10.14457/CU.the.2011.819	-
dc.rights	จุฬาลงกรณ์มหาวิทยาลัย	en
dc.subject	การรู้จำเสียงพูดอัตโนมัติ	en
dc.subject	ภาษาไทย	en
dc.subject	การประมวลผลข้อความ	en
dc.subject	Automatic speech recognition	en
dc.subject	Thai language	en
dc.subject	Text processing ‪(Computer science)‬	en
dc.title	การเลือกข้อความออนไลน์โดยอัตโนมัติเพื่อสร้างคลังข้อความตามการกระจายตัวหน่วยเสียงที่กำหนดได้	en
dc.title.alternative	Automatic online text selection for constructing text corpus with custom phoneme distribution	en
dc.type	Thesis	es
dc.degree.name	วิศวกรรมศาสตรมหาบัณฑิต	es
dc.degree.level	ปริญญาโท	es
dc.degree.discipline	วิศวกรรมคอมพิวเตอร์	es
dc.degree.grantor	จุฬาลงกรณ์มหาวิทยาลัย	en
dc.email.advisor	Proadpran.Pu@Chula.ac.th	-
dc.email.advisor	Atiwong.S@Chula.ac.th	-
dc.identifier.DOI	10.14457/CU.the.2011.819	-
Appears in Collections:	Eng - Theses

Files in This Item:

File	Description	Size	Format
surapol_vo.pdf		3.18 MB	Adobe PDF	View/Open

Show simple item record