BIOMEDICAL STATISTICS-SIMPLIFIED

Guide

Data types, test selection, p values, and 95% confidence intervals.

This page explains the concepts used by the dashboard so the results are easier to interpret.

Download companion book

BASIC BIOSTATISTICS is a colorful PDF handbook with index pages and short chapters on descriptive statistics, analytic statistics, and sample size calculations. Author: Manjunath Kulkarni, Nephrologist, Mangaluru, India.

Download BASIC BIOSTATISTICS

Types of data

Choosing the test starts with understanding the variable type.

Nominal data are categories without numeric order, such as blood group, dialysis modality, or outcome status.
Ordinal data have order but not equal spacing, such as symptom grades or staged severity labels.
Continuous data are numeric measurements such as hemoglobin, creatinine, age, BMI, or blood pressure.

Nominal tests

Use contingency tables when comparing frequencies across categories.

Chi-square tests independence between categorical variables.
For a 2x2 table with small expected counts, Fisher exact is preferable or should be reviewed alongside chi-square.
For 2x2 tables, the odds ratio with a 95% confidence interval provides a clinically useful point estimate.

Continuous tests

Normality helps determine whether a parametric or non-parametric summary is more suitable.

Shapiro-style testing and QQ plots help judge whether a group is approximately normal.
Student t test compares two approximately normal groups and yields a mean difference with 95% confidence interval.
Mann-Whitney compares two groups when a rank-based non-parametric approach is more suitable.
ANOVA compares three or more approximately normal groups, while Kruskal-Wallis is the non-parametric alternative.

Correlation and regression

Use paired numeric data when asking whether two variables move together.

Pearson correlation measures linear association.
Spearman correlation measures rank-based monotonic association.
Simple linear regression estimates the expected change in the outcome for a one-unit change in the predictor.

What a p value means

Interpret p values carefully and in context.

A p value is the probability of seeing data this extreme, or more extreme, if the null hypothesis were true.
A small p value suggests the observed data are less compatible with the null hypothesis.
A large p value does not prove there is no effect; it may reflect limited sample size or variability.

What a 95% confidence interval means

Confidence intervals provide more context than p values alone.

A 95% confidence interval gives a range of plausible values for the population effect under the model used.
For ratios such as odds ratios, an interval crossing 1 often suggests no clear association.
For differences such as mean difference, an interval crossing 0 often suggests no clear difference.
Narrower confidence intervals suggest greater precision.