Comprehensive definitions of statistical terms, concepts, and methods.
The entire group of individuals, objects, or measurements of interest for a statistical study.
A subset of the population selected for study.
A characteristic or attribute that can be measured or categorized.
Categorical data without any inherent order.
Categorical data with a meaningful order but no fixed intervals.
Numeric data with meaningful intervals but no true zero.
Numeric data with meaningful intervals and a true zero.
The arithmetic average of a set of numbers, calculated by summing all values and dividing by the count of values.
x̄ = (Σx) / nThe middle value when data is arranged in order.
A measure of variability that indicates the average distance between data points and their mean.
s = √[Σ(x - x̄)² / (n-1)]Method of collecting data from a sample to infer about the population.
A controlled study where variables are manipulated to determine effects.
A study where data is collected without intervention, observing naturally occurring variables.
A numerical measure of the likelihood of an event occurring.
Two events are independent if the occurrence of one does not affect the probability of the other.
A formula for updating probabilities based on new information.
P(A|B) = [P(B|A) * P(A)] / P(B)The theorem that the distribution of sample means approximates a normal distribution as the sample size increases.
A symmetric, bell-shaped distribution characterized by its mean and standard deviation.
A measure of the asymmetry of a probability distribution.
A statistical method for making decisions using data, involving null and alternative hypotheses.
A range of values that likely contains the true population parameter with a specified level of confidence.
CI = point estimate ± (critical value × standard error)A statistical test used to determine if there is a significant difference between the means of two groups.
Analysis of Variance - a statistical test used to analyze differences among means of three or more groups.
A test for relationships between categorical variables, often used in contingency tables.
A false positive, rejecting the null hypothesis when it is actually true.
A false negative, failing to reject the null hypothesis when it is actually false.
The probability of correctly rejecting the null hypothesis (1 - Type II error probability).
A measure of the strength and direction of the relationship between two variables.
A method to predict a response variable based on one predictor variable.
y = a + bxA regression model with multiple predictor variables to predict an outcome.
A technique for reducing dimensionality of large data sets while retaining most variation.
A method for grouping data points into clusters based on similarities.
A measure of the strength or magnitude of an effect, useful for understanding practical significance.
An effect size measure for the difference between two means.
d = (M1 - M2) / SD