การศึกษาความสัมพันธ์ระหว่างสัมประสิทธิ์สหพันธ์กับค่าทดสอบไคสแควร์ โดยการจำลองแบบ

วีณา เตชะพนาดร

Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/25793

Title:	การศึกษาความสัมพันธ์ระหว่างสัมประสิทธิ์สหพันธ์กับค่าทดสอบไคสแควร์ โดยการจำลองแบบ
Other Titles:	A simulation study on the relationship between correlation coefficient and chi-square
Authors:	วีณา เตชะพนาดร
Advisors:	อนุชิต ล้ำยอดมรรคผล
Other author:	จุฬาลงกรณ์มหาวิทยาลัย. บัณฑิตวิทยาลัย
Issue Date:	2529
Publisher:	จุฬาลงกรณ์มหาวิทยาลัย
Abstract:	การทดสอบสมมติฐานที่นักวิจัยทางสังคมศาสตร์นิยมใช้กันมากวิธีหนึ่งคือ การทดสอบความเป็นอิสระระหว่างสองตัวแปร โดยการบันทึกค่าความถี่ลงในตารางแจกแจงความถี่สองทางหรือที่เรียกว่า ตารางการณ์จร ค่าสถิติสำหรับการทดสอบจะอาศัยผลรวมของค่าผลต่างกำลังสองระหว่างความถี่จากการทดลองและความถี่ตามข้อสมมติฐาน หารด้วยความถี่ตามข้อสมมติฐาน ค่าสถิติสำหรับการทดสอบนี้จะมีการแจกแจงโดยประมาณเป็นไคสแควร์ภายใต้ข้อสมมติฐานดังกล่าว ค่าสถิติสำหรับการทดสอบดังกล่าว จะนำมาสรุปผลการทดสอบสมมติฐานว่าควรยอมรับหรือปฏิเสธสมมติฐานว่าสองตัวแปรเป็นอิสระต่อกัน โดยอาศัยการเปรียบเทียบกับค่าวิกฤติที่กำหนดขึ้นโดยระดับนัยสำคัญและค่าชั้นความเป็นอิสระเท่านั้น การสรุปผลโดยวิธีการดังกล่าวนี้ยังเป็นข้ออภิปรายกันโดยทั่วไปว่าเหมาะสมหรือไม่เพียงใด ค่าสถิติเพื่อการทดสอบที่ได้จะสามารถนำไปอธิบายขนาดของความสัมพันธ์เชิงเส้นตรงระหว่างสองตัวแปรกรณีที่มีการแจกแจงแบบต่อเนื่องได้หรือไม่ ขนาดตัวอย่าง ขนาดตาราง และลักษณะการแบ่งกลุ่มข้อมูลจะมีผลต่อการแจกแจงของค่าสถิติเพื่อการทดสอบหรือไม่ ข้ออภิปรายต่างๆ นี้อาจจะสังเกตได้จากที่ได้มีการคำนวณค่าสถิติเพิ่มเติม เพื่อให้เกิดความน่าเชื่อถือในข้อสรุปมากยิ่งขึ้น เช่น การคำนวณค่าสัมประสิทธิ์ความมีเงื่อนไข ที่นำขนาดตัวอย่างมาประกอบการพิจารณา c = √w / w+n และค่าเครเมอร์ วี ที่นำทั้งขนาดตัวอย่างและขนาดตารางมาพิจารณา คือ v² = w / n . min (r-l, c-l) เมื่อ w คือ ค่าสถิติเพื่อการทดสอบ n คือ ขนาดตัวอย่าง r คือ จำนวนกลุ่มแบ่งตามแนวนอน c คือ จำนวนกลุ่มแบ่งตามสดมภ์ เพื่อตอบปัญหาดังกล่าว จึงกำหนดการศึกษาโดยใช้วิธีการจำลองแบบซึ่งจะผลิตข้อมูลเชิงสุ่มสองชนิดคือ ตัวแปรปกติสองตัวแปร และตัวแปรพหุนามสองตัวแปร ตัวอย่างกำหนดจะศึกษาในขนาด 20 30 40 50 75 และ 100 ขนาดตารางระดับ 2x2 2x3 2x4 2x5 3x3 3x4 3x5 4x4 4x5 และ 5x5 ในกรณีตัวแปรปกติสองตัวแปรจะกำหนดให้มีค่าสัมประสิทธิ์สหสัมพันธ์เฉพาะค่าบวก ตั้งแต่ 0.00 ถึง 0.98 โดยแบ่งช่วงดังนี้ 0.00 ถึง 0.40 จะมีความห่าง 0.02 0.40 ถึง 0.60 จะมีความห่าง 0.01 และ 0.60 ถึง 0.98 จะมีความห่าง 0.02 กรณีการแจกแจงแบบปกติสองตัวแปร ได้ศึกษาอิทธิพลของขนาดตัวอย่าง ขนาดตารางและการจัดแบ่งกลุ่มข้อมูล ที่มีต่อค่าทดสอบไคสแควร์ ถ้าพิจารณาในแง่ของการทดสอบสมมติฐาน พบว่าขนาดตัวอย่างและขนาดตารางจะทำให้ระดับนัยสำคัญจากการจำลองแบบสูงกว่าระดับนัยสำคัญจากทฤษฎี ในแต่ละขนาดตัวอย่างจะได้ค่าวิกฤติจากการจำลองแบบแตกต่างกัน ณ ระดับนัยสำคัญ และขั้นแห่งความเป็นอิสระเดียวกัน ถ้าพิจารณาในแง่ของความสัมพันธ์ระหว่างค่าคาดหวังของสัมประสิทธิ์สหสัมพันธ์กับค่าทดสอบไคสแควร์ พบว่าเมื่อขนาดตัวอย่างและขนาดตารางเพิ่มขึ้นค่าคาดหวังของสัมประสิทธิ์สหสัมพันธ์จะลดลง ส่วนการจัดกลุ่มข้อมูลที่แตกต่างกันจะทำให้ค่าเฉลี่ยและความแปรปรวนของค่าไคสแควร์แตกต่างกันในแต่ละกลุ่มด้วย กรณีตัวแปรมีการแจกแจงแบบพหุนามสองตัวแปร และตัวแปรทั้งสองเป็นอิสระต่อกันพบว่าในทุกขนาดตัวอย่างและขนาดตาราง ค่าเฉลี่ยของค่าไคสแควร์จากการจำลองแบบจะสอดคล้องกับค่าเฉลี่ยตามทฤษฎี ส่วนค่าความแปรปรวนมีแนวโน้มจะมีค่าน้อยกว่าค่าตามทฤษฎี เมื่อขนาดตัวอย่างและขนาดตารางเพิ่มขึ้น สำหรับระดับนัยสำคัญจากการจำลองแบบจะต่ำกว่าระดับนัยสำคัญตามทฤษฎี แต่ถ้าตัวแปรทั้งสองมีความสัมพันธ์กัน พบว่าค่าเฉลี่ยและความแปรปรวนของค่าไคสแควร์จะเพิ่มขึ้นเมื่อขนาดตัวอย่างเพิ่มขึ้น และระดับนัยสำคัญจากการจำลองแบบจะสูงกว่าระดับนัยสำคัญตามทฤษฎีในทุกขนาดตัวอย่าง เฉพาะกรณีการแจกแจงแบบปกติสองตัวแปร จากค่าไคสแควร์ที่คำนวณได้จะสามารถบอกช่วงความเชื่อมั่นและค่าคาดหวังของสัมประสิทธิ์สหสัมพันธ์ได้จากตารางที่สร้างขึ้นสำหรับขนาดตัวอย่างและขนาดตารางที่กำหนดในการศึกษา นอกจากนี้ถ้าขนาดตัวอย่างแตกต่างออกไปจากขนาดตัวอย่างที่ใช้ศึกษาครั้งนี้ อาจประมาณค่าคาดหวังของสัมประสิทธิ์สหสัมพันธ์ได้จากการแทนค่าขนาดตัวอย่างลงในสมการความถดถอยเชิงเส้นอย่างง่ายที่มีค่าคงที่ (a) และสัมประสิทธิ์ความถดถอย (b) ณ ค่าไคสแควร์ และขนาดตารางที่ต้องการศึกษาที่สร้างขึ้นเพื่อนำไปใช้ได้ทันที
Other Abstract:	One of the test of hypothesis which is popular to the Sociologist is the test of independent between two variables. This method can be applied by recording the frequencies in the two way frequency distribution or which is called the contingency table. The statistical value for the test will be the total of the squared difference between the frequency from the test and the frequency of the assumption devided by the frequency of the assumption. The statistical value for this test will approximately distribute as chi-square distribution under the above assumptions. The statistical value will be concluded the test of hypothesis whether the result will be accept or reject the independent hypothesis of the two variables. This can be done only by comparing with the critical value which is determined only by level of significance and the degrees of freedom. The solution from this method is still debatable whether it is appropriate or not. It is doubted that the statistical value which is the result can explain linear relationship between two continuous variables, a case of the objectors. The size of samples, tables and the classification of data whether or not will effect the suitabilits of the test. These issue can be regcognized by more statistical calculation so that it can be reliable. For example, the calculation of contingency coefficient which bring sample size to the consideration expressed by c = √w / w+n and the Cramer’s V which bring both sample sizes and size of tables to the consideration expressed by คือ v² = w / n . min (r-l, c-l) where w is the statistical value for the test n is sample sizes r is member of data classify by row c is member of data classify by column For answering the problem, we determine the study by simulation method which will generate random numbers which are bivariate normal and bivariate multinomial. The sample sizes have been set up for studying are 20, 30, 40, 50, 75 and 100 and sizes of table level are set up at 2x2 2x3 2x4 2x5 3x3 3x4 3x5 4x4 4x5 and 5x5. In case of bivariate normal, we shall determine only absolute value correlation coefficient from 0.00 to 0.98 by classify level 0.00 to 0.40 with difference 0.02, 0.40 to 0.60 with difference 0.01 and 0.60 to 0.98 with difference 0.02. In case of the bivariate normal distribution the influence to the value of Chi-square of the sample sizes, sizes of table and classification data has been studied. In the aspect of testing hypothesis, it is found that sample sizes and sizes of table make simulation significance level higher than theoretical significance level and simulation critical value differs from each other sample sizes at the same significance level and the same degrees of freedom. In the case of identifying the relation between correlation coefficient and the value of Chi-square, it is found that, when sample size and size of table increase, the expected value of correlation coefficient will diminish. Different grouping in some cases of study will make the means and variances of Chi-square values differ in each grouping. In case of the independent multinomial distribution, it is found that all sample size and all size of table, the means of Chi-square values are consistent with of theoretical value and the variances tend to be less than the theoretical values. As when the sample size and size of table increases, the values of simulation significance level are likely to be less than theoretical significance levels. But, if both variables are correlated, it is found that the mean and variance of Chi-square value tend to increase. When sample size increases, the simulation significance levels appear to be higher than theoretical significance levels in each size. Only in the case of bivariate normal, Chi-square value calculated from models and table mentioned above may be used to indicate the confidence interval and the expected value of the correlation coefficient. Moreover, if the size of sample varie, estimation of the expected value of correlation coefficient may also be found from the simple linear regression equation which has the constant value (a) and regression coefficient (b) provided at the specified values of Chi-square and size of table.
Description:	วิทยานิพนธ์ (สต.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2529
Degree Name:	สถิติศาสตรมหาบัณฑิต
Degree Level:	ปริญญาโท
Degree Discipline:	สถิติ
URI:	http://cuir.car.chula.ac.th/handle/123456789/25793
ISBN:	9745667552
Type:	Thesis
Appears in Collections:	Grad - Theses

Files in This Item:

File	Size	Format
Weena_Ta_front.pdf	546.93 kB	Adobe PDF	View/Open
Weena_Ta_ch1.pdf	329.93 kB	Adobe PDF	View/Open
Weena_Ta_ch2.pdf	520.67 kB	Adobe PDF	View/Open
Weena_Ta_ch3.pdf	584.25 kB	Adobe PDF	View/Open
Weena_Ta_ch4.pdf	1.16 MB	Adobe PDF	View/Open
Weena_Ta_ch5.pdf	366.67 kB	Adobe PDF	View/Open
Weena_Ta_back.pdf	6.26 MB	Adobe PDF	View/Open

Show full item record