ระบบบ่งชี้ผู้พูดแบบระบบเปิดโดยใช้แบบจำลองฮิตเดนมาร์คอฟแบบหลายชุดรหัส

พงศ์ไท ทาสระคู

Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/4186

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	สมชาย จิตะพันธ์กุล	-
dc.contributor.advisor	จุฬารัตน์ ตันประเสริฐ	-
dc.contributor.author	พงศ์ไท ทาสระคู	-
dc.contributor.other	จุฬาลงกรณ์มหาวิทยาลัย. คณะวิศวกรรมศาสตร์	-
dc.date.accessioned	2007-09-18T10:15:29Z	-
dc.date.available	2007-09-18T10:15:29Z	-
dc.date.issued	2542	-
dc.identifier.isbn	9743340386	-
dc.identifier.uri	http://cuir.car.chula.ac.th/handle/123456789/4186	-
dc.description	วิทยานิพนธ์ (วศ.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2542	en
dc.description.abstract	วิทยานิพนธ์นี้มีวัตถุประสงค์เพื่อนำเสนอระบบบ่งชี้ผู้พูดแบบระบบเปิด โดยใช้แบบจำลองฮิดเดนมาร์คอฟร่วมกับการควอนไทซ์แบบเวกเตอร์ โดยใช้ชุดรหัสแบบหลายชุดรหัส ระบบบ่งชี้ผู้พูดนี้เป็นระบบที่ขึ้นกับบทคำพูด และใช้กับเสียงพูดต่อเนื่อง ในขั้นตอนการตรวจสอบผู้พูดซึ่งเป็นขั้นตอนสุดท้ายของระบบบ่งชี้ผู้พูดแบบระบบเปิดนั้น ได้มีการนำเสนอฟังก์ชันของความแตกต่างขึ้นมาใช้ สำหรับการทดลองทำกับฐานข้อมูลเสียงพูดกับตัวเลขต่อเนื่อง "สาม-ห้า-สอง-เก้า-สี่" โดยแบ่งเป็นจำนวนผู้พูดในระบบ 10 คน และจำนวนของผู้พูดนอกระบบ 17 คน ผู้พูดแต่ละคนจะบันทึกเสียงแยกกัน 2 ช่วง แต่ละช่วงเว้นห่างกัน 1 เดือน และการบันทึกเสียงในแต่ละช่วงจะบันทึกคนละ 10 เสียง ผลการทดลองปรากฏว่า ลักษณะสำคัญ MFCC ให้ผลดีที่สุดจากการศึกษาลักษณะสำคัญ 3 แบบ ได้แก่ LPC, CEP, และ MFCC พบว่า MFCC ให้อัตราการบ่งชี้ผิดพลาดเฉลี่ยเป็น 0.40 เปอร์เซ็นต์ อัตราการยอมรับผิดพลาดเฉลี่ย 0.71 เปอร์เซ็นต์ และอัตราการปฏิเสธผิดพลาดเฉลี่ย 9.40 เปอร์เซ็นต์	en
dc.description.abstractalternative	This thesis has the objective to develop an open-set speaker identification system using Hidden Markov Model and Vector Quantization with multiple codebooks. The system is a text-dependent continuous speech speaker identification system. In the final verification process, a different function is proposed to improve the performance of the system. The "3-5-2-9-4" or /sa:2 s@:ng ka:w2 si:1/ speech database used in the experiment consists of 10 speakers and 17 imposters. Each speaker did the record twice, where in the second session was performed one month after the first session, and each speaker was recorded 10 times per session. The experiment results show that MFCC is the best result with 0.4% average identification error rate (MFCC, LPC, and CEP have been tested in the experiments), 0.71% average false acceptance rate, and 9.40% average false rejection rate.	en
dc.format.extent	10603793 bytes	-
dc.format.mimetype	application/pdf	-
dc.language.iso	th	en
dc.publisher	จุฬาลงกรณ์มหาวิทยาลัย	en
dc.rights	จุฬาลงกรณ์มหาวิทยาลัย	en
dc.subject	การรู้จำเสียงพูดอัตโนมัติ	en
dc.subject	แบบจำลองฮิดเดนมาร์คอฟ	en
dc.title	ระบบบ่งชี้ผู้พูดแบบระบบเปิดโดยใช้แบบจำลองฮิตเดนมาร์คอฟแบบหลายชุดรหัส	en
dc.title.alternative	Open set speaker identification using multiple codebook HMM	en
dc.type	Thesis	en
dc.degree.name	วิศวกรรมศาสตรมหาบัณฑิต	en
dc.degree.level	ปริญญาโท	en
dc.degree.discipline	วิศวกรรมไฟฟ้า	en
dc.degree.grantor	จุฬาลงกรณ์มหาวิทยาลัย	en
dc.email.advisor	Somchai.J@chula.ac.th	-
dc.email.advisor	mook@notes.nectec.or.th	-
Appears in Collections:	Eng - Theses

Files in This Item:

File	Description	Size	Format
pongthai.pdf		7.33 MB	Adobe PDF	View/Open

Show simple item record