การวิเคราะห์อภิมานคุณภาพของแบบสอบหลายตัวเลือก ที่มีรูปแบบของแบบสอบบริบทของแบบสอบ และวิธีการตรวจให้คะแนนต่างกัน

ไชยยศ ไพวิทยศิริธรรม

dc.contributor.advisor	นงลักษณ์ วิรัชชัย
dc.contributor.advisor	ศิริชัย กาญจนวาสี
dc.contributor.author	ไชยยศ ไพวิทยศิริธรรม
dc.contributor.other	จุฬาลงกรณ์มหาวิทยาลัย. คณะครุศาสตร์
dc.date.accessioned	2020-03-20T03:18:50Z
dc.date.available	2020-03-20T03:18:50Z
dc.date.issued	2544
dc.identifier.isbn	9740306187
dc.identifier.uri	http://cuir.car.chula.ac.th/handle/123456789/64373
dc.description	วิทยานิพนธ์ (ค.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2544	en_US
dc.description.abstract	การวิจัยครั้งนี้มีวัตถุประสงค์เพื่อสังเคราะห์งานวิจัยเกี่ยวกับคุณภาพของแบบสอบหลายตัวเลือก ประกอบด้วยค่าความเที่ยง ค่าความตรง ค่าความยากง่ายและค่าอำนาจจำแนก ที่มีรูป แบนของแบบสอบ บริบทของแบบสอบ วิธีการตรวจให้คะแนนและประเภทของแบบสอบต่างกันด้วยวิธีการวิเคราะห์อภิมานตามแนวคิดของ Hunter งานวิจัยที่นำมาสังเคราะห์เป็นงานวิจัยที่พิมพ์เผยแพร่ระหว่างปี 2511 ถึง 2542 จำนวน 96 เล่ม จากห้องสมุด 3 แห่ง คือ จุฬาลงกรณ์มหาวิทยาลัย มหาวิทยาลัยศรีนครินทรวิโรฒและมหาวิทยาลัยเกษตรศาสตร์ แบ่งเป็นงานวิจัยเชิงทดลองจำนวน 92 เล่ม และงานวิจัยเชิงสหสัมพ์นธ์จำนวน 4 เล่ม ฐานข้อมูลของการวิจัยครั้งนี้ประกอบด้วยค่าดัชนีมาตรฐานจำนวน 1,837 ค่าแบ่งเป็นค่าดัชนีมาตรฐานของค่าความเที่ยง 777 ค่า ค่าความตรง 461 ค่า ค่าความยากง่าย 283 ค่า และค่าอำนาจจำแนก 316 ค่า การวิเคราะห์ข้อมูลใช้สถิติบรรยายและการวิเคราะห์ความแปรปรวน ผลการวิจัยพบว่า 1. ค่าเฉลี่ยในภาพรวมของค่าความเที่ยงมีค่าเฉลี่ยเท่ากับ .713 ค่าตํ่าสุดและค่สูงสุดเป็น .020 และ .999 ตามลำดับ ตัวแปรปรับที่อธิบายความแตกต่างของค่าความเที่ยงอย่างมีนัยสำคัญและค่าเฉลี่ยของกลุ่มที่มีค่าเฉลี่ยสูงสุด ได้แก่ ตัวแปรการเรียงตัวเลือก: แบบสอบที่มีการเรียงตัวเลือกแบบค่ามากไปค่าห้อย (.775) การเรียงแบบสอบ: แบบสอบที่มีการเรียงจากพฤติกรรมการเรียนรู้ขั้นตํ่าไปขั้นสูง (.818) วิธีการตรวจให้คะแนน: แบบสอบที่มีวิธีการตอบและตรวจให้คะแนนด้วยวิธีให้คะแนนรายข้อต่างกันตามความสามารถของผู้สอบ (.951) และประเภทของแบบสอบ: แบบสอบเติมคำ (.852) 2. ค่าเฉลี่ยในภาพรวมของค่าความตรงเท่ากับ .528 ค่าตํ่าสุดและค่าสูงสุดเป็น .005 และ .950 ตามลำดับ ตัวแปรปรับที่อธิบายความแตกต่างของค่าความตรงอย่างมีนัยสำคัญและค่าเฉลี่ยของกลุ่มที่มีค่าเฉลี่ยสุงสุด ได้แก่ ตัวแปรรูปแบบตัวลวง: แบบสอบที่มีรูปแบบตัวลวงจากคำตอบผิดของนักเรียน (.691) การเรียงแบบสอบ: แบบสอบที่มีการเรียงจากพฤติกรรมการเรียนรู้ขั้นตํ่าไปขั้นสูง (.623) และวิธีการตรวจให้คะแนน: แบบสอบที่มีวิธีการตอบและตรวจ ให้คะแนนด้วยวิธีของสำราญ มีแจ้ง (.756) 3. ค่าเฉลี่ยในภาพรวมของค่าความยากง่ายเท่ากับ .474 ค่าตํ่าสุดและค่าสูงสุดเป็น .100 และ .871 ตามลำดับ ตัวแปรปรับที่อธิบายความแตกต่างของค่าความยากง่ายอย่างมีนัยสำคัญและค่าเฉลี่ยของกลุ่มที่มีค่าเฉลี่ยปานกลาง ได้แก่ ตัวแปรการเรียงแบบสอบ: แบบสอบที่มีการเรียงโดยการสุ่ม (.484) วิธีการตรวจให้คะแนน: แบบศอบที่มีวิธีการตอบและตรวจให้คะแนนด้วยวิธีแก้การเดา (.497) และประเภทของแบบสอบ: แบบสอบหลายตัวเลือก (.538) 4. ค่าเฉลี่ยในภาพรวมของค่าอำนาจจำแนกเท่ากับ .433 ค่าตํ่าสุดและค่าสูงสุดเป็น .125 และ .838 ตามลำดับ ตัวแปรปรับที่อธิบายความแตกต่างของค่าอำนาจจำแนกอย่างมีนัยสำคัญและค่าเฉลี่ยของกลุ่มที่มีค่าเฉลี่ยสูงสุด ได้แก่ ตัวแปรรูปแบบตัวลวง: แบบสอบที่มีรูปแบบตัวลวงจากคำตอบผิดของนักเรียน (.550) และวิธีการตรวจให้คะแนน: แบบสอบที่มีวิธีการตอบและตรวจให้คะแนนด้วยวิธีของอนันต์ ศรีโศภา (.693)
dc.description.abstractalternative	The purpose of this research was to synthesize research about the quality of multiple-choice tests, comprising of reliability, validity, item difficulty and power of discrimination with different test formats, test contexts, scoring methods and test type by using Hunter’s meta-analysis method. Ninety six research reports to be synthesized consisted of 92 experimental research and 4 correlational research, published in 1968-1999 from three libraries: namely Chulalongkorn university, Srinakharinwirot university and Kasetsart university. The database of this research consisted of 1,837 standard indices with were divided into 777 reliability coefficients, 461 validity coefficients, 283 indices of item difficulty and 316 indices of power of discrimination. The data analysis employed descriptive statistic and ANOVA. The research results were as follows: 1. The average of reliability was .713 and the minimum and maximum were .020 and .999 respectively. The moderator variables which could significantly explain the difference in reliability and its value that gave maximum subgroup means were as follow: choice arrangement -high to low value arrangement (.775); item ordering -low to high learning behavior ordering (.818); scoring method -optimal scoring weighted by examinee’s ability (.951) and test type -completion type.(.852) 2. The average of validity was .528 and the minimum and maximum were .005 and .950 respectively. The moderator variables which could significantly explain the difference in validity and its value that gave maximum subgroup means were as follow: distractor construction -student’s mistake based distractor (.691); item ordering -low to high learning behavior ordering (.623) and scoring method -Samran Mejang’s scoring method.(.756) 3. The average of item difficulty was .474 and the minimum and maximum were .100 and .871 respectively. The moderator variables which could significantly explain the difference in item difficulty and its value that gave middle subgroup means were as follow: item ordering -random ordering (.484); scoring method -scoring adjusted for guessing (.497) and test type -multiple-choice type.(.538) 4. The average of power of discrimination was .433 and the minimum and maximum were .125 and .838 respectively. The moderator variables which could significantly explain the difference in power of discrimination and its value that gave maximum subgroup means were as follow: distractor construction -student’s mistake based, distractor (.550) and scoring method -Anan Srisopa’s scoring method.(.693)
dc.language.iso	th	en_US
dc.publisher	จุฬาลงกรณ์มหาวิทยาลัย	en_US
dc.relation.uri	http://doi.org/10.14457/CU.the.2001.262
dc.rights	จุฬาลงกรณ์มหาวิทยาลัย	en_US
dc.subject	ข้อสอบแบบเลือกตอบ	en_US
dc.subject	การวิเคราะห์อภิมาน	en_US
dc.title	การวิเคราะห์อภิมานคุณภาพของแบบสอบหลายตัวเลือก ที่มีรูปแบบของแบบสอบบริบทของแบบสอบ และวิธีการตรวจให้คะแนนต่างกัน	en_US
dc.title.alternative	A meta-analysis of the quality of multiple-choice tests with different test formats, test contexts and scoring methods	en_US
dc.type	Thesis	en_US
dc.degree.name	ครุศาสตรมหาบัณฑิต	en_US
dc.degree.level	ปริญญาโท	en_US
dc.degree.discipline	การวัดและประเมินผลการศึกษา	en_US
dc.degree.grantor	จุฬาลงกรณ์มหาวิทยาลัย	en_US
dc.email.advisor	Nonglak.W@chula.ac.th
dc.email.advisor	Sirichai.K@Chula.ac.th
dc.identifier.DOI	10.14457/CU.the.2001.262