Measuring dependencies between discrete variables
You can pivot, or cross-tabulate two discrete variables. You measure counts, or absolute frequencies, of each combination of pairs of values of the two variables. You can compare the actual with the expected values in the table. So, what are the expected values? You start with the null hypothesis again—there is no association between the two variables you are examining. For the null hypothesis, you would expect that the distribution of one variable is the same in each class of the other variable, and the same as the overall distribution in the dataset. For example, if you have half married and half single people in the dataset, you expect such a distribution for each level of education. The tables where you show the actual and the expected frequencies are called contingency tables.
Contingency tables show you only visual dependencies. The numerical measure for the association of two discrete variables is the chi-squared value. You calculate...