• Nem Talált Eredményt

8.2 Interval estimations, confidence intervals

9.1.4 F-test

9.1.4 F-test

An important chapter in the investigation of relations between h random variables is based on the analysis of standard deviation or variance (see ANOVA, Section 9.1.6).

During its application we usually have to assume the normal distribution of h random variables and the equality of their standard deviations. In statistics the application of the F-test is considerably widespread.

Consider the normally distributed and independent random variablesXandY. Their expected values and standard deviations σX := D(X), σY := D(Y) are unknown. We wish to decide about the hypothesis H0 : σXY at a significance level of 1−α. We choose the two-sided hypothesis H1 : σX 6=σY as alternative hypothesis, in other cases the one-sided hypothesis H10Y < σX (or H100X < σY).

It is known that in case the null hypothesis holds, the test statistic F := M SSX

M SSY

has an F-distribution with parameters nX −1 , nY −1, where nX and nY are the two sample sizes (for details see [VIN][page 125]. (The notation F is used locally for the test statistic.)

The question of critical regions is more complicated than before, partly because the density function of the test statistic F is not symmetrical (cf. Fig. 9.7).

We keep from the previous practice – at a significance level of 1−αand seemingly in a straightforward way – that we specify such critical values a1 and a2 as boundaries of the region (interval) of acceptance that the area under the density function on the ”left of”a1 be equal to the area on the ”right of”a2 (cf. Fig.9.7):

P(F < a1) =P(a2 < F) = α 2 (here we ignore the notation of the parameters for simplicity).

Finally, when testing the hypothesis at a significance level of 1−αwe in fact examine if the observation

f := mssX mssY

for the random variable F falls into the region of acceptance (a1, a2). However, it is known from the literature that the relation a2 = a1

1 happens to be valid (which implies

Figure 9.7: Graph of the probability density function of the test statistic F of a given parameter with indication of the critical values a1 and a2 = a1

1 at a significance level of 1−α.

and its reciprocal is greater than a2”. From this follows that the probability of this latter event isα.

So, a good decision strategy for the originally two-sided hypothesis may be the fol-lowing one-sided test. Suppose that the ratio which is greater than 1 exceeds the value Ff1,f2, where f1, f2 are the parameters belonging to the numerator and denominator of the ratio, respectively. Then the hypothesis H0 : σX = σY is rejected against the alternative hypothesis H1 : σX 6= σY in an actually one-sided test test at a significance level of 1−α, otherwise we accept it. The critical value Ff1,f2 can be looked up from the corresponding table (Table III). For example, if f1 = 5, f2 = 7, then at a significance level of α = 0.05 the critical value is F5,7,0.05= 3.97.

In a one-sided test, if the significance level is 1−α and the null hypothesis is for instance H00X < σY, and the alternative hypothesis is H10Y < σX, then the routine procedure is as follows. If mssY < mssX, then we rejectH00 and acceptH10. However, if mssX ≤mssY, then we accept H00 if the ratio mssmssY

X

exceeds the critical valueFf1,f2 in Table III, where the parameters f1 and f2 belong to Y and X, respectively.

The F-test is by far not a robust test, i.e., it cannot be applied to distributions significantly different from a normal one.

Example 9.6. 10female and 8male cockroaches of the same age were deprived of food, and their times of survival x and y (in days) were measured. The mean survival times were x = 8.5 and y = 4.8. For the observation of the females the corrected empirical

variance was mssx = 3.6, while for the males mssy = 0.9. By assuming approximately normal distribution of the survival times within each sex (!), let us make a two-sided F-test at a significance level of 1−α = 0.95 in order to check the null hypothesis that the standard deviations are the same, i.e., H0 : σX = σY by the two-sided alternative hypothesis H1x 6=σy.

Solution: The critical value depends on whether the relation 1 < mss

y

mssx or 1 < mssmssx y

holds. Since now the latter one is true, the pair of parameters fx, fy is 9, 7. The corresponding critical value for a one-sided test and for 1−α = 0.95 is F9,7,0.05 = 3.68 (see Table III).

The observed value of the test statistic is 3.60.9 = 4.0. The latter one exceeds the critical value of 3.68. According to this, we reject the null hypothesis that the standard deviations are the same at the given significance level 0.95.

If the null hypothesis and the alternative hypothesis were H00x < σy, H10y < σx then due to mssy < mssx, we would at once reject H00 and accept H10. However, for the hypotheses H000 : σy < σx, H100 : σx < σy at the same significance level H000 would be accepted, since mssy < mssx and mssmssx

y = 4.0>3.68.

Example 9.7. When examining the glutamic acid concentration in the urine of two primate species (gorilla (x) and chimpanzee (y)), the results are contained in Table 9.3.

sample size (number of individuals)

n

corrected empirical variance

mss

chimpanzee nx = 37 mssx = 0.0107

gorilla ny = 6 mssy = 0.124

Table 9.3: Some intermediate results of the calculation when applying the F-test, see Example 9.7.

We assume the normality of the measurements within each species. Let us check the null hypothesis that the standard deviations are the same for the two species by using the two-sided F-test at a significance level of 1−α= 0.99 against the alternative hypothesis H1x 6=σy.

Solution: We can find the required critical value at the pair of parameters (5, 36).

According to Table III, F5,36,0.01 = 3.60. The observation with regard to the test statistic F is: f = 0.01070.124 = 11.589. This value is greater than the critical value 3.60. Therefore, at a significance level of 0.99, we reject the null hypothesis that the standard deviations are equal.