การรู้จำการอ่านริมฝีปากโดยการใช้เทคนิคการวิเคราะห์สัญญาณแปรตามเวลาและนิวรอลเน็ตเวิร์ก

ปกิต ศีลประชาวงศ์

Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/5464

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	บุญเสริม กิจศิริกุล	-
dc.contributor.author	ปกิต ศีลประชาวงศ์	-
dc.contributor.other	จุฬาลงกรณ์มหาวิทยาลัย. คณะวิศวกรรมศาสตร์	-
dc.date.accessioned	2008-01-15T11:54:58Z	-
dc.date.available	2008-01-15T11:54:58Z	-
dc.date.issued	2543	-
dc.identifier.isbn	9743464425	-
dc.identifier.uri	http://cuir.car.chula.ac.th/handle/123456789/5464	-
dc.description	วิทยานิพนธ์ (วท.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2543	en
dc.description.abstract	งานวิจัยนี้เสนอวิธีการสำหรับการรู้จำการอ่านริมฝีปาก (Lipreading Recognition) โดยใช้ข้อมูลที่เป็นลำดับภาพที่ได้จากภาพเทาของริมฝีปากของผู้พูด โดยในขั้นตอนการรู้จำมีการดึงข้อมูลของแต่ละภาพนำมาเข้าโมเดลโดยการใช้ การเปลี่ยนแปลงของความเข้มของแต่ละจุดเทียบกับเวลาเป็นสัญญาณหลัก และมีการใช้การแปลงแบบฟูเรียร์ (fourier transform) เพื่อแทนสัญญาณ จากนั้นจะดึงค่าสัมประสิทธิ์ฟูเรียร์ (Fourier coefficients) เพื่อใช้เป็นคุณลักษณะ (feature) ให้กับนิวรอลเน็ตเวิร์ก (Neural Networks) สำหรับขั้นตอนการรู้จำ เราทำการทดลองกับฐานข้อมูล 2 ชุด คือชุดตัวเลขและชุดตัวอักษรภาษาอังกฤษ ผลการทดลองแสดงให้เห็นถึงประสิทธิผลของวิธีที่นำเสนอ	en
dc.description.abstractalternative	This thesis presents an approach for lipreading recognition based on visual features extracted from gray level image sequences of the speaker's lips. The recognition is done by extracting visual information from each image, and the extracted information is modeled by using the intensity curve of pixels along the time axis as the primary signal. Fourier transform is then applied to this signal. Therefore, the fourier coefficients of a signal curve encode the motion information in a compact manner and are used as features to neural networks for the recognition. We run experiments on two databases of English digits and letters. The results show the effectiveness of our method.	en
dc.format.extent	749399 bytes	-
dc.format.mimetype	application/pdf	-
dc.language.iso	th	es
dc.publisher	จุฬาลงกรณ์มหาวิทยาลัย	en
dc.rights	จุฬาลงกรณ์มหาวิทยาลัย	en
dc.subject	การรู้จำเสียงพูดอัตโนมัติ	en
dc.subject	การอ่านริมฝีปาก	en
dc.subject	แบคพรอพาเกชัน (ปัญญาประดิษฐ์)	en
dc.subject	นิวรัลเน็ตเวิร์ค (คอมพิวเตอร์)	en
dc.title	การรู้จำการอ่านริมฝีปากโดยการใช้เทคนิคการวิเคราะห์สัญญาณแปรตามเวลาและนิวรอลเน็ตเวิร์ก	en
dc.title.alternative	Lipreading recognition using time-varying signal analysis and neural networks	en
dc.type	Thesis	es
dc.degree.name	วิทยาศาสตรมหาบัณฑิต	es
dc.degree.level	ปริญญาโท	es
dc.degree.discipline	วิทยาศาสตร์คอมพิวเตอร์	es
dc.degree.grantor	จุฬาลงกรณ์มหาวิทยาลัย	en
dc.email.advisor	boonserm@cp.eng.chula.ac.th, Boonserm.K@chula.ac.th	-
Appears in Collections:	Eng - Theses

Files in This Item:

File	Description	Size	Format
Pakit.pdf		731.83 kB	Adobe PDF	View/Open

Show simple item record