การเปรียบเทียบวิธีการแบ่งข้อมูลอย่างสุ่ม และวิธีบูตสแตรปในการปรับค่าพี-แวลูของสัมประสิทธิ์การถดถอยที่มีมิติสูง

บงกชพร เนาวนัติ

DSpace Home
→
Faculty and Institute
→
Faculty of Commerce and Accountancy - Acctn
→
Acctn - Theses
→
View Item

dc.contributor.advisor	วิฐรา พึ่งพาพงศ์	en_US
dc.contributor.author	บงกชพร เนาวนัติ	en_US
dc.contributor.other	จุฬาลงกรณ์มหาวิทยาลัย. คณะพาณิชยศาสตร์และการบัญชี	en_US
dc.date.accessioned	2015-06-24T06:45:56Z
dc.date.available	2015-06-24T06:45:56Z
dc.date.issued	2556	en_US
dc.identifier.uri	http://cuir.car.chula.ac.th/handle/123456789/43950
dc.description	วิทยานิพนธ์ (วท.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2556	en_US
dc.description.abstract	การวิจัยครั้งนี้มีวัตถุประสงค์เพื่อศึกษาและเปรียบเทียบแนวทางในการเลือกใช้วิธี Random Split และวิธีบูตสแตรปในการปรับค่า p-value ของสัมประสิทธิ์การถดถอยที่มีมิติสูง อีกทั้งเพื่อศึกษาและเปรียบเทียบประสิทธิภาพในการคัดเลือกตัวแปรระหว่างวิธี Random Split และวิธีบูตสแตรปในการปรับค่า p-value ของสัมประสิทธิ์การถดถอยที่มีมิติสูง ซึ่งเกณฑ์ที่ใช้ในการเปรียบเทียบ คือจำนวนความผิดพลาดในการตรวจจับเชิงบวก จำนวนความผิดพลาดในการตรวจจับเชิงลบ และจำนวนสัมประสิทธิ์การถดถอยที่ไม่เท่ากับศูนย์จากการทดสอบสมมติฐานของสัมประสิทธิ์แต่ละตัว โดยข้อมูลที่ใช้ในการศึกษาได้จากการจำลองข้อมูลโดยมีขนาดตัวอย่างต่อจำนวนตัวแปรอิสระเป็น 10:20, 10:50, 10:100, 100:200, 100:500, 100:1,000, 200:400, 200:1,000 และ 200:2,000 ตามลำดับด้วยจำนวนสัมประสิทธิ์จริงที่ไม่เท่ากับศูนย์ 0.1 เท่า, 0.25 เท่า และ 0.45 เท่าของขนาดตัวอย่างที่ระดับความสัมพันธ์ของตัวแปรอิสระเป็น 0, 0.5 และ 0.9 จากผลการศึกษาโดยเปรียบเทียบจำนวนความผิดพลาดในการตรวจจับเชิงบวก พบว่าการแบ่งข้อมูลด้วยวิธี Random Split มีประสิทธิภาพในการปรับค่า p-value ของสัมประสิทธิ์การถดถอยที่มีมิติสูงมากกว่าการแบ่งข้อมูลด้วยวิธีบูตสแตรป แต่ในแง่ของจำนวนความผิดพลาดในการตรวจจับเชิงลบและจำนวนสัมประสิทธิ์การถดถอยที่ไม่เท่ากับศูนย์จากการทดสอบสมมติฐานของสัมประสิทธิ์แต่ละตัว พบว่ากรณีส่วนใหญ่การแบ่งข้อมูลด้วยวิธีบูตสแตรปจะมีประสิทธิภาพในการปรับค่า p-value ของสัมประสิทธิ์การถดถอยที่มีมิติสูงมากกว่าการแบ่งข้อมูลด้วยวิธี Random Split	en_US
dc.description.abstractalternative	The objective of this research is to study and compare on p-value adjustment between Random – Split and Bootstrap methods in high dimensional regression, include studying and comparing efficiency in variable selection on p-value adjustment between Random – Split and Bootstrap methods in high dimensional regression. The number of false positive, the number of false negative and the number of nonzero coefficient are three criteria using for comparison. The data in this study under several situations which are the ratio of sample size to the number of independent variables are 10:20, 10:50, 10:100, 100:200, 100:500, 100:1,000, 200:400, 200:1,000 and 200:2,000 with true nonzero coefficients are 0.1, 0.25 and 0.45 of sample size which correlation level of independent variables are 0, 0.5 and 0.9 Based on the simulation results by comparing the number of false positive show that data splitting with Random – Split method is more efficient than Bootstrap method on p-value adjustment in high dimensional regression. However, the number of false negative and the number of nonzero coefficients, overall, data splitting with Bootstrap method is more efficient than Random – Split method on p-value adjustment in high dimensional regression.	en_US
dc.language.iso	th	en_US
dc.publisher	จุฬาลงกรณ์มหาวิทยาลัย	en_US
dc.relation.uri	http://doi.org/10.14457/CU.the.2013.1403
dc.rights	จุฬาลงกรณ์มหาวิทยาลัย	en_US
dc.subject	เซตสุ่ม
dc.subject	สถิติวิเคราะห์
dc.subject	Random sets
dc.title	การเปรียบเทียบวิธีการแบ่งข้อมูลอย่างสุ่ม และวิธีบูตสแตรปในการปรับค่าพี-แวลูของสัมประสิทธิ์การถดถอยที่มีมิติสูง	en_US
dc.title.alternative	A COMPARISON ON P-VALUE ADJUSTMENT BETWEEN RANDOM – SPLIT AND BOOTSTRAP METHODS IN HIGH DIMENSIONAL REGRESSION	en_US
dc.type	Thesis	en_US
dc.degree.name	วิทยาศาสตรมหาบัณฑิต	en_US
dc.degree.level	ปริญญาโท	en_US
dc.degree.discipline	สถิติ	en_US
dc.degree.grantor	จุฬาลงกรณ์มหาวิทยาลัย	en_US
dc.email.advisor	vitara@cbs.chula.ac.th	en_US
dc.identifier.DOI	10.14457/CU.the.2013.1403