การเปรียบเทียบสถิติบูทสแตรปของการออกแบบการปรับเทียบสำหรับแบบสอบรูปแบบผสมระหว่างวิธีการที่ต่างกัน

พนิดา พานิชวัฒนะ

Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/43132

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	ณัฏฐภรณ์ หลาวทอง	en_US
dc.contributor.author	พนิดา พานิชวัฒนะ	en_US
dc.contributor.other	จุฬาลงกรณ์มหาวิทยาลัย. คณะครุศาสตร์	en_US
dc.date.accessioned	2015-06-24T06:24:21Z
dc.date.available	2015-06-24T06:24:21Z
dc.date.issued	2556	en_US
dc.identifier.uri	http://cuir.car.chula.ac.th/handle/123456789/43132
dc.description	วิทยานิพนธ์ (ค.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2556	en_US
dc.description.abstract	การวิจัยครั้งนี้มีวัตถุประสงค์เพื่อ 1) เพื่อเปรียบเทียบสถิติบูทสแตรป (bootstrapped statistics) ของการออกแบบการปรับเทียบคะแนนแบบสอบรูปแบบผสมโดยใช้ข้อสอบร่วมสำหรับกลุ่มไม่เท่าเทียมกัน (NEAT) ระหว่างวิธีการตรวจที่แตกต่างกัน 2) เพื่อเปรียบเทียบค่ารากที่สองของค่าเฉลี่ยความแตกต่างยกกำลังสอง (RMSD) ของการออกแบบการปรับเทียบคะแนนแบบสอบรูปแบบผสมโดยใช้ข้อสอบร่วมสำหรับกลุ่มไม่เท่าเทียมกัน (NEAT) ระหว่างวิธีของเลอวิน (Levine method) และวิธี CLE (Chained Linear Equating) กลุ่มตัวอย่างเป็นนักเรียนระดับชั้นมัธยมศึกษาปีที่ 4 สำนักงานเขตพื้นที่การศึกษามัธยมศึกษาเขต 1 กรุงเทพมหานคร จำนวน 1,609 คน โดยสุ่มกลุ่มตัวอย่างด้วยวิธีการสุ่มแบบหลายขั้นตอน เครื่องมือที่ใช้ใน การวิจัยครั้งนี้ ได้แก่ 1) แบบสอบวิชาฟิสิกส์รูปแบบผสม เรื่องงานและพลังงาน ระดับชั้นมัธยมศึกษาปีที่ 4 จำนวน 2 ฉบับ ซึ่งเป็นแบบสอบคู่ขนาน ได้แก่ แบบสอบฉบับใหม่ ประกอบด้วยข้อสอบแบบหลายตัวเลือกจำนวน 16 ข้อ และข้อสอบแบบสร้างคำตอบจำนวน 8 ข้อ ที่มีค่าความเที่ยงเป็น .634 และ .824 ตามลำดับ และแบบสอบฉบับอ้างอิง ประกอบด้วยข้อสอบแบบหลายตัวเลือกจำนวน 16 ข้อ และข้อสอบแบบสร้างคำตอบจำนวน 8 ข้อ ที่มีค่าความเที่ยงเป็น .602 และ .872 ตามลำดับ ซึ่งแบบสอบทั้งสองฉบับมีข้อสอบร่วมจำนวน 12 ข้อ 2) เกณฑ์การให้คะแนนที่ใช้ในการตรวจข้อสอบแบบสร้างคำตอบในแบบสอบฉบับใหม่และฉบับอ้างอิง ที่มีความเที่ยงระหว่างผู้ประเมิน มีค่าสัมประสิทธิ์สหสัมพันธ์แบบเพียร์สัน (r) คือ .96 และ .97 อย่างมีนัยสำคัญทางสถิติที่ระดับ .05 ตามลำดับ 3) ผู้ตรวจให้คะแนน 2 คน ที่มีความสอดคล้องในการให้คะแนนระหว่าง ผู้ประเมินเป็นรายข้อสำหรับแบบสอบทั้ง 2 ฉบับ มีค่าสัมประสิทธิ์สหสัมพันธ์แบบเพียร์สัน (r) ในแต่ละข้ออยู่ระหว่าง .86 - .99 และ .84 – 1.00 อย่างมีนัยสำคัญทางสถิติที่ระดับ .05 ตามลำดับ โดยวิธีการตรวจทั้ง 3 วิธี คือ 1) วิธีตรวจข้อสอบร่วมแบบสร้างคำตอบ โดยไม่มีการตรวจให้คะแนนข้ามกลุ่ม 2) วิธีตรวจข้อสอบร่วมแบบสร้างคำตอบ โดยมีการตรวจให้คะแนนข้ามกลุ่มทั้งหมดของข้อสอบร่วม 3) วิธีตรวจข้อสอบร่วมแบบสร้างคำตอบ โดยมีการตรวจให้คะแนนข้ามกลุ่มครึ่งหนึ่งของข้อสอบร่วม การปรับเทียบคะแนนด้วยวิธีการปรับเทียบเชิงเส้นตรงทั้ง 2 วิธี และการวิเคราะห์ค่าสถิติบูทสแตรปจะใช้โปรแกรม R ส่วนการวิเคราะห์ค่ารากที่สองของค่าเฉลี่ยความแตกต่างยกกำลังสองใช้โปรแกรม Microsoft excel ผลการวิจัยพบว่า 1) วิธีตรวจข้อสอบร่วมแบบสร้างคำตอบ โดยมีการตรวจให้คะแนนข้ามกลุ่มครึ่งหนึ่งของข้อสอบร่วม เมื่อใช้วิธี CLE (Chained Linear Equating) มีค่ารากที่สองของค่าเฉลี่ยความคลาดเคลื่อนยกกำลังสอง (RMSE) ค่าความคลาดเคลื่อนในการปรับเทียบ (equating error) และ ค่าความลำเอียง (bias) ต่ำที่สุด คือ .802, .251 และ .840 ตามลำดับ 2) ในบริบทของการวิจัยนี้วิธี CLE (Chained Linear Equating) มีค่ารากที่สองของค่าเฉลี่ยความแตกต่างยกกำลังสอง (RMSD) จากวิธีการตรวจวิธีที่ 1, 2 และ 3 คือ 1.421, 1.481 และ 1.481 ตามลำดับ ซึ่งเป็นค่าที่ต่ำกว่าวิธีของเลอวิน (Levine method) จากวิธีการตรวจวิธีที่ 1, 2 และ 3 คือ 1.717, 1.804 และ 1.793 ตามลำดับ	en_US
dc.description.abstractalternative	The purposes of this research were 1) to compare bootstrapped statistics of equating mixed-format test of nonequivalent groups with anchor test equating design among scoring methods 2) to compare root mean squared difference (RMSD) of equating mixed-format test of nonequivalent groups with anchor test equating design between Levine method and CLE (Chained linear equating) method. The sample were 1,607 students obtained by multi-stage random sampling from Matayomsuksa 4 student of Office of Bangkok Educational Service Area Zone I. They were used for data collecting with 1) two parallel mixed - format tests of the Physics, the work and energy for Matayomsuksa 4. There were new form and reference form. New form consisted 16 multiple-choice items and 8 construct-response items, their reliability values were .634 and .824. Reference form consisted 16 multiple-choice items and 8 construct-response items, with reliability value of .602 and .872. Those form had 8 multiple-choice items and 4 construct-response items in anchor test. 2) rubric scoring used for constructed-response items in Physics test, new and reference forms, had inter-rater reliability with Pearson’s product moment correlation were .96 and .97 at the .05 level of significance. 3) two raters, who had consistency between two raters of two tests with Pearson’s product moment correlation were .86 - .99 and .84 – 1.00 at the .05 level of significance. Three scoring methods were 1) constructed-response anchor items scoring methods with no trend scoring method 2) constructed-response anchor items scoring methods with full anchor test trend scoring method 3) constructed-response anchor items scoring methods with half anchor test trend scoring method. Equating score with linear equating method and analyzing bootstrapped statistics employed program R and analyzing Root Mean Squared Difference employed Microsoft excel. The results of the study revealed that 1) constructed-response anchor items scoring methods with half anchor test trend scoring method with CLE (Chained linear equating) method resulted in smallest Root Mean Squared Error (RMSE), Equating error and Bias were .802, .251 and .840. 2) For this study, CLE (Chained linear equating) method had Root Mean Squared Difference (RMSD) from scoring methods of 1, 2 and 3 method were 1.421, 1.481 and 1.481, which lower than those of equating with Levine method were 1.717, 1.804 and 1.793.	en_US
dc.language.iso	th	en_US
dc.publisher	จุฬาลงกรณ์มหาวิทยาลัย	en_US
dc.relation.uri	http://doi.org/10.14457/CU.the.2013.603	-
dc.rights	จุฬาลงกรณ์มหาวิทยาลัย	en_US
dc.subject	สถิติ -- การประมวลผลข้อมูล
dc.subject	การประเมินผลทางการศึกษา
dc.subject	Statistics -- Data processing
dc.subject	Educational evaluation
dc.title	การเปรียบเทียบสถิติบูทสแตรปของการออกแบบการปรับเทียบสำหรับแบบสอบรูปแบบผสมระหว่างวิธีการที่ต่างกัน	en_US
dc.title.alternative	A COMPARISION OF BOOTSTRAPPED STATISTICS IN EQUATING DESIGNS FOR MIXED FORMAT TEST AMONG DIFFERENT METHODS	en_US
dc.type	Thesis	en_US
dc.degree.name	ครุศาสตรมหาบัณฑิต	en_US
dc.degree.level	ปริญญาโท	en_US
dc.degree.discipline	การวัดและประเมินผลการศึกษา	en_US
dc.degree.grantor	จุฬาลงกรณ์มหาวิทยาลัย	en_US
dc.email.advisor	Nuttaporn.l@chula.ac.th	en_US
dc.identifier.DOI	10.14457/CU.the.2013.603	-
Appears in Collections:	Edu - Theses

Files in This Item:

File	Description	Size	Format
5583327827.pdf		12.43 MB	Adobe PDF	View/Open

Show simple item record