15.7.9.1 Missing Values for Cell-Defining Variables ¶
For variables that divide tables into cells, per-variable category
options, as described in Per-Variable Category Options,
determine which data is analyzed. If any of the categories for such a
variable would exclude a case, then that case is not included.
As an example, consider the following entirely artificial dataset, in
which ‘x’ and ‘y’ are categorical variables with missing
value 9, and ‘z’ is scale:
Data List
x |
y |
z |
1 |
1 |
1 |
1 |
2 |
10 |
1 |
9 |
100 |
2 |
1 |
1000 |
2 |
2 |
10000 |
2 |
9 |
100000 |
9 |
1 |
1000000 |
9 |
2 |
10000000 |
9 |
9 |
100000000 |
|
Using ‘x’ and ‘y’ to define cells, and summarizing ‘z’,
by default PSPP omits all the cases that have ‘x’ or ‘y’ (or both)
missing:
CTABLES /TABLE x > y > z [SUM].
Custom Tables
|
|
|
|
|
Sum |
x |
1 |
y |
1 |
z |
1 |
2 |
z |
10 |
2 |
y |
1 |
z |
1000 |
2 |
z |
10000 |
|
If, however, we add CATEGORIES
specifications to include
missing values for ‘y’ or for ‘x’ and ‘y’, the output
table includes them, like so:
CTABLES /TABLE x > y > z [SUM] /CATEGORIES VARIABLES=y MISSING=INCLUDE.
CTABLES /TABLE x > y > z [SUM] /CATEGORIES VARIABLES=x y MISSING=INCLUDE.
Custom Tables
|
|
|
|
|
Sum |
x |
1 |
y |
1 |
z |
1 |
2 |
z |
10 |
9 |
z |
100 |
2 |
y |
1 |
z |
1000 |
2 |
z |
10000 |
9 |
z |
100000 |
Custom Tables
|
|
|
|
|
Sum |
x |
1 |
y |
1 |
z |
1 |
2 |
z |
10 |
9 |
z |
100 |
2 |
y |
1 |
z |
1000 |
2 |
z |
10000 |
9 |
z |
100000 |
9 |
y |
1 |
z |
1000000 |
2 |
z |
10000000 |
9 |
z |
100000000 |
|