Diff for "FAQ/ChiTrend" - CBU statistics Wiki
location: Diff for "FAQ/ChiTrend"
Differences between revisions 11 and 35 (spanning 24 versions)
Revision 11 as of 2006-08-17 14:19:46
Size: 2700
Editor: PeterWatson
Comment:
Revision 35 as of 2013-08-28 10:47:50
Size: 4338
Editor: PeterWatson
Comment:
Deletions are marked like this. Additions are marked like this.
Line 1: Line 1:
== Linear trend test on proportions ==
Line 11: Line 13:
The chi-square testing the presence of a linear trend is outputted by SPSS CROSSTABS as the Linear-by-Linear association term. The lack of fit is the difference between the Pearson chi-square value and the linear-by-linear one. The chi-square testing the presence of a linear trend is outputted by SPSS CROSSTABS as the Linear-by-Linear association term ( also called the Mantel-Haenszel statistic).

Linear-by-linear association = $$r^text{2} (N-1)$$

where r is the Pearson correlation of the rows (correct/incorrect) with the columns (group) and N is the total sample size. Bruce Weaver has shown that provided all expected cell counts are greater than 1 the Linear-by-Linear association is the most powerful preferred chi-square for 2x2 tables (see [[https://sites.google.com/a/lakeheadu.ca/bweaver/Home/statistics/notes/chisqr_assumptions | here]]).

In particular for a 2x2 table Bruce shows that the linear-by-linear chi-square has the special form equal to N(ad-bc)^2 / (mnrs)

where:
* N is the total number of observations
* a, b, c, and d are the observed counts in the 4 cells
* ^2 means "squared"

* m, n, r, s are the 4 marginal totals

For a 2x2 table (only) the regular Pearson chi-square (e.g., in the output from statistical software), can be converted to the 'N - 1' chi-square as follows:

{{{
'N -1' chi-square = Linear-by-Linear chi-square = Pearson chi-square x (N -1) / N
}}}
 

The lack of fit is the difference between the Pearson chi-square value and the linear-by-linear one.
Line 19: Line 43:
So there is a linear trend providing a reasonable fit. So there is a linear trend providing a reasonable fit. 
Line 22: Line 46:
Denoting the time points by –1,0 and 1 and regressing these on the observed proportions correct give regression estimates of the above linear trend. Denoting the time points by –1,0 and 1 and regressing these on the observed proportions correct give regression estimates of the above linear trend. The Pearson chi-square lack of fit term is (O-E)*(O-E)/E where O are observed table frequencies and E are expected frequencies using the expected proportions from the linear regression.
Line 28: Line 52:

You can also compare linear trends of proportions in [[FAQ/poly| SPSS LOGISTIC.]]

'''References:'''

Agresti, A (2013) Categorical Data Analysis. Third Edition. Wiley:New York. Pages 86-87 mention the above testing for linear trend.

Everitt, BS and Wykes T.(1999) A Dictionary for Psychologists. Arnold:London. (See page 31).

Linear trend test on proportions

A more powerful form of chi-square specifically tests for a linear trend in proportions across groups. For example, proportion remembered correctly using a memory aid.

Example

Time 1

Time 2

Time 3

Correct

3

6

10

Incorrect

9

6

2

Does the proportion correct change linearly over time?

The chi-square testing the presence of a linear trend is outputted by SPSS CROSSTABS as the Linear-by-Linear association term ( also called the Mantel-Haenszel statistic).

Linear-by-linear association = $$r^text{2} (N-1)$$

where r is the Pearson correlation of the rows (correct/incorrect) with the columns (group) and N is the total sample size. Bruce Weaver has shown that provided all expected cell counts are greater than 1 the Linear-by-Linear association is the most powerful preferred chi-square for 2x2 tables (see here).

In particular for a 2x2 table Bruce shows that the linear-by-linear chi-square has the special form equal to N(ad-bc)^2 / (mnrs)

where: * N is the total number of observations * a, b, c, and d are the observed counts in the 4 cells * ^2 means "squared"

* m, n, r, s are the 4 marginal totals

For a 2x2 table (only) the regular Pearson chi-square (e.g., in the output from statistical software), can be converted to the 'N - 1' chi-square as follows:

'N -1' chi-square = Linear-by-Linear chi-square = Pearson chi-square x (N -1) / N

The lack of fit is the difference between the Pearson chi-square value and the linear-by-linear one.

Model

Chi-square

Df

p-value

Linear

7.96

1

0.005

Lack of Fit

0.29

1

0.130

Total

8.25

2

0.004

(Pearson Chi-square)

So there is a linear trend providing a reasonable fit.

Denoting the time points by –1,0 and 1 and regressing these on the observed proportions correct give regression estimates of the above linear trend. The Pearson chi-square lack of fit term is (O-E)*(O-E)/E where O are observed table frequencies and E are expected frequencies using the expected proportions from the linear regression.

Observed proportion correct

0.33

0.50

0.83

Expected proportion correct

0.30

0.55

0.80

(Fitting a linear trend)

You can also compare linear trends of proportions in SPSS LOGISTIC.

References:

Agresti, A (2013) Categorical Data Analysis. Third Edition. Wiley:New York. Pages 86-87 mention the above testing for linear trend.

Everitt, BS and Wykes T.(1999) A Dictionary for Psychologists. Arnold:London. (See page 31).

None: FAQ/ChiTrend (last edited 2013-08-28 10:47:50 by PeterWatson)