Scoring

degrees of freedom

Degrees of freedom (df) is a statistical concept referring to the number of independent observations minus the number of parameters being estimated. In any data set, all values can be freely selected except the last, which is determined. Degrees of freedom are essential for calculating significan...

What Are Degrees of Freedom?

Degrees of freedom (df) is a fundamental statistical concept that refers to the number of independent values in a dataset that are free to vary when calculating a statistic. In any dataset of n observations where the mean is known, only n − 1 values can vary freely — the final value is mathematically determined by the others and the constraint imposed by the mean. Degrees of freedom appear throughout the statistical methods used to evaluate polygraph research, and understanding the concept is essential for interpreting published validation studies.

Why Degrees of Freedom Matter

Degrees of freedom serve a critical function in statistical testing: they determine which probability distribution should be used to evaluate whether an observed result is statistically significant. For example, when a researcher conducts a t-test comparing the accuracy of two polygraph scoring methods, the degrees of freedom determine the shape of the t-distribution used to calculate the p-value. With fewer degrees of freedom (smaller samples), the distribution has heavier tails, meaning a larger observed difference is required to reach statistical significance.

This relationship between sample size, degrees of freedom, and statistical power is particularly important in polygraph research, where obtaining large samples of confirmed cases (cases with independently verified ground truth) is often difficult and expensive.

Common Applications in PDD Research

T-Tests

When comparing mean accuracy rates between two techniques or two scoring methods, the independent-samples t-test uses df = n₁ + n₂ − 2, where n₁ and n₂ are the sample sizes of the two groups. A paired t-test (e.g., comparing two algorithms applied to the same set of charts) uses df = n − 1.

Chi-Square Tests

The test">chi-square test is frequently used in polygraph research to compare classification outcomes (DI, NDI, INC) across techniques or examiner groups. Here, df = (rows − 1) × (columns − 1). For a simple 2 × 2 comparison (e.g., deceptive vs. truthful × correct vs. incorrect), df = 1.

Regression and Discriminant Analysis

In discriminant analysis and logistic regression — the statistical methods underlying many computerised scoring algorithms — degrees of freedom are consumed by each predictor variable (physiological feature) included in the model. Adding more features without a proportional increase in sample size risks overfitting, where the model performs well on the training data but poorly on new cases.

Interpreting Research with Degrees of Freedom

When reading a published polygraph validation study, the reported degrees of freedom provide immediate insight into the study’s statistical power. A study reporting t(14) has only 16 participants — potentially insufficient for detecting moderate differences between techniques. A study reporting χ²(1) = 12.5 with df = 1 indicates a straightforward two-group comparison with a strong effect. Understanding these values helps practitioners and researchers evaluate the reliability of published findings and the strength of the evidence supporting different validated techniques.

For access to published polygraph validation research, visit the polygraph research database.