Forest Plot Maker

Created:March 5, 2026

Last Updated:March 5, 2026

Create publication-ready forest plots for meta-analysis and systematic reviews. Visualize effect sizes with confidence intervals, pooled estimates, and heterogeneity statistics across multiple studies.

Data Format Tips

Your data needs four columns at minimum:

Study column:Study identifiers (e.g., "Smith et al. 2020")
Effect size column: Point estimates (OR, RR, HR, MD, or SMD)
Lower CI column: Lower bound of the 95% confidence interval
Upper CI column: Upper bound of the 95% confidence interval

Optionally, include a weight column. If not provided, inverse-variance weights are calculated automatically from the confidence intervals.

Not sure how to format your data? (10 studies with odds ratios) to see how it works!

Calculator

1. Load Your Data

2. Select Columns & Options

Study Column *

Effect Size Column *

Lower CI Column *

Upper CI Column *

Weight Column (optional)

Effect Measure

Show Pooled Effect

Show Weights

Null Value

1 for ratio measures (OR/RR/HR), 0 for differences (MD/SMD)

Related Calculators

Kaplan-Meier Survival Analysis

Cox Regression

Odds Ratio / Effect Size Calculator

Logistic Regression

Learn More

What is a Forest Plot?

A forest plot is a graphical display of estimated results from multiple studies in a meta-analysis. Each study is represented by a square (effect size) with a horizontal line extending on both sides (confidence interval). A diamond at the bottom typically shows the pooled (combined) effect across all studies.

The name "forest plot" comes from the visual appearance of the plot, which resembles a forest of lines. It was first used in medical research to synthesize findings from randomized controlled trials but is now widely used across social sciences, education, ecology, and business research.

How to Read a Forest Plot

Squares: Represent the effect size (point estimate) for each study. Larger squares indicate studies with greater weight (typically more participants or precision).
Horizontal lines: Show the 95% confidence interval for each study. Shorter lines mean more precise estimates.
Vertical dashed line: The line of no effect (1 for ratio measures like OR/RR/HR, 0 for difference measures like MD/SMD). If a CI crosses this line, the result is not statistically significant.
Diamond: The pooled (combined) effect from all studies. The center is the pooled estimate, and the width represents the pooled CI.
Weights (%): How much each study contributes to the pooled result. Larger, more precise studies have higher weights.

Understanding Heterogeneity

Heterogeneity measures how much variation exists between study results beyond what would be expected by chance.

I² Statistic

0-25%: Very low heterogeneity
25-50%: Low heterogeneity
50-75%: Moderate heterogeneity
>75%: High heterogeneity

Cochran's Q Test

Tests whether observed differences between studies are due to chance. A significant p-value (typically < 0.10) suggests true heterogeneity exists.

When Heterogeneity is High

Consider: subgroup analysis, meta-regression, using a random-effects model, or examining potential sources of variation (study design, population, intervention).

Effect Size Measures Explained

OR (Odds Ratio)

Ratio of odds of outcome in treatment vs. control group. Used in case-control studies. Null value = 1.

RR (Risk Ratio)

Ratio of risk of outcome in treatment vs. control group. Used in cohort studies and RCTs. Null value = 1.

HR (Hazard Ratio)

Ratio of hazard rates between groups from survival analysis. Used in time-to-event studies. Null value = 1.

MD (Mean Difference)

Difference in means between groups on the original scale. Used when studies use the same outcome measure. Null value = 0.

SMD (Standardized Mean Difference)

Difference in means divided by pooled standard deviation. Used when studies use different scales to measure the same outcome. Also known as Cohen's d or Hedges' g. Null value = 0.

Common Uses of Forest Plots

Medical research: Synthesize results from clinical trials comparing treatments
Epidemiology: Combine risk estimates from observational studies
Public health: Evaluate effectiveness of interventions across populations
Psychology: Summarize effect sizes across behavioral studies
Education: Compare educational intervention outcomes across schools/districts
Cochrane Reviews: The gold standard for evidence-based medicine uses forest plots extensively

How to Create a Forest Plot in R

Using the metafor package to create a forest plot.

library(metafor)

# Sample meta-analysis data
dat <- data.frame(
  study = c("Smith 2018", "Johnson 2019", "Williams 2019",
            "Brown 2020", "Davis 2020", "Miller 2021",
            "Wilson 2021", "Moore 2022", "Taylor 2022", "Anderson 2023"),
  yi = log(c(1.52, 0.85, 1.23, 1.78, 1.05, 1.45, 0.92, 1.67, 1.31, 1.12)),  # log OR
  lower = log(c(1.12, 0.62, 0.89, 1.21, 0.76, 1.05, 0.68, 1.15, 0.95, 0.82)),
  upper = log(c(2.06, 1.16, 1.70, 2.62, 1.45, 2.00, 1.25, 2.42, 1.81, 1.53))
)

# Calculate SE from CI
dat$sei <- (dat$upper - dat$lower) / (2 * 1.96)

# Fit random-effects model
res <- rma(yi = yi, sei = sei, data = dat, method = "REML",
           slab = study)

# Create forest plot
forest(res,
       atransf = exp,         # Back-transform to OR scale
       xlab = "Odds Ratio",
       refline = 0,           # null line at log(1) = 0
       header = c("Study", "OR [95% CI]"),
       cex = 0.9)

# Add heterogeneity info
text(-4, -1, pos = 4, cex = 0.8,
     bquote(paste("I"^2, " = ", .(formatC(res$I2, digits = 1, format = "f")), "%",
                  ", p = ", .(formatC(res$QEp, digits = 3, format = "f")))))

How to Create a Forest Plot in Python

Using Matplotlib and Seaborn to create a forest plot.

Python

import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

# Sample meta-analysis data
studies = [
    "Smith et al. (2018)", "Johnson et al. (2019)", "Williams et al. (2019)",
    "Brown et al. (2020)", "Davis et al. (2020)", "Miller et al. (2021)",
    "Wilson et al. (2021)", "Moore et al. (2022)", "Taylor et al. (2022)",
    "Anderson et al. (2023)"
]
odds_ratios = [1.52, 0.85, 1.23, 1.78, 1.05, 1.45, 0.92, 1.67, 1.31, 1.12]
lower_ci = [1.12, 0.62, 0.89, 1.21, 0.76, 1.05, 0.68, 1.15, 0.95, 0.82]
upper_ci = [2.06, 1.16, 1.70, 2.62, 1.45, 2.00, 1.25, 2.42, 1.81, 1.53]

n = len(studies)
y_pos = np.arange(n)

sns.set_theme(style="whitegrid")
fig, ax = plt.subplots(figsize=(10, max(5, 0.6 * n)))

# CI lines
for i in range(n):
  ax.plot([lower_ci[i], upper_ci[i]], [y_pos[i], y_pos[i]],
      color="#333", linewidth=1.5)

# Effect size points
ax.scatter(odds_ratios, y_pos, s=80, color="#1565C0", marker="s",
       label="Study OR", zorder=3)

# Null line (OR = 1)
ax.axvline(x=1, linestyle="--", color="gray")

ax.set_xscale("log")
ax.set_xlabel("Odds Ratio")
ax.set_title("Forest Plot - Odds Ratios")
ax.set_yticks(y_pos)
ax.set_yticklabels(studies)
ax.invert_yaxis()
ax.grid(axis="y", visible=False)

plt.tight_layout()
plt.show()

Frequently Asked Questions

What data do I need for a forest plot?

At minimum, you need the study name, effect size (OR, RR, HR, MD, or SMD), and 95% confidence interval bounds for each study. Optionally, you can provide study weights.

Why is the x-axis on a log scale for OR/RR/HR?

Ratio measures (OR, RR, HR) are asymmetric around 1. For example, OR=2 and OR=0.5 are equally "far" from the null in opposite directions, but on a linear scale they appear different distances from 1. A log scale makes them symmetric.

Fixed-effect vs. random-effects model?

This tool uses a fixed-effect (inverse-variance weighted) model. If heterogeneity is high (I²> 50%), a random-effects model may be more appropriate. For formal meta-analysis, consider using R's metafor or Python's statsmodels packages.

What does the diamond represent?

The diamond shows the pooled (combined) effect estimate across all studies. The center of the diamond is the pooled point estimate, and its horizontal extent shows the 95% confidence interval for the pooled effect.

How are weights calculated?

If no weight column is provided, weights are calculated using the inverse-variance method: w = 1/SE², where SE is derived from the confidence interval. Studies with narrower CIs (more precision) receive higher weights.

Why do square sizes differ?

Square sizes are proportional to study weights. Larger squares represent studies that contribute more to the pooled estimate, typically because they have larger sample sizes or more precise estimates.

Forest Plot Maker

Created:March 5, 2026

Last Updated:March 5, 2026

Data Format Tips

Your data needs four columns at minimum:

Study column:Study identifiers (e.g., "Smith et al. 2020")
Effect size column: Point estimates (OR, RR, HR, MD, or SMD)
Lower CI column: Lower bound of the 95% confidence interval
Upper CI column: Upper bound of the 95% confidence interval

Optionally, include a weight column. If not provided, inverse-variance weights are calculated automatically from the confidence intervals.

Not sure how to format your data? (10 studies with odds ratios) to see how it works!

Calculator

1. Load Your Data

2. Select Columns & Options

Study Column *

Effect Size Column *

Lower CI Column *

Upper CI Column *

Weight Column (optional)

Effect Measure

Show Pooled Effect

Show Weights

Null Value

1 for ratio measures (OR/RR/HR), 0 for differences (MD/SMD)

Learn More

What is a Forest Plot?

How to Read a Forest Plot

Squares: Represent the effect size (point estimate) for each study. Larger squares indicate studies with greater weight (typically more participants or precision).
Horizontal lines: Show the 95% confidence interval for each study. Shorter lines mean more precise estimates.
Vertical dashed line: The line of no effect (1 for ratio measures like OR/RR/HR, 0 for difference measures like MD/SMD). If a CI crosses this line, the result is not statistically significant.
Diamond: The pooled (combined) effect from all studies. The center is the pooled estimate, and the width represents the pooled CI.
Weights (%): How much each study contributes to the pooled result. Larger, more precise studies have higher weights.

Understanding Heterogeneity

Heterogeneity measures how much variation exists between study results beyond what would be expected by chance.

I² Statistic

0-25%: Very low heterogeneity
25-50%: Low heterogeneity
50-75%: Moderate heterogeneity
>75%: High heterogeneity

Cochran's Q Test

Tests whether observed differences between studies are due to chance. A significant p-value (typically < 0.10) suggests true heterogeneity exists.

When Heterogeneity is High

Consider: subgroup analysis, meta-regression, using a random-effects model, or examining potential sources of variation (study design, population, intervention).

Effect Size Measures Explained

OR (Odds Ratio)

Ratio of odds of outcome in treatment vs. control group. Used in case-control studies. Null value = 1.

RR (Risk Ratio)

Ratio of risk of outcome in treatment vs. control group. Used in cohort studies and RCTs. Null value = 1.

HR (Hazard Ratio)

Ratio of hazard rates between groups from survival analysis. Used in time-to-event studies. Null value = 1.

MD (Mean Difference)

Difference in means between groups on the original scale. Used when studies use the same outcome measure. Null value = 0.

SMD (Standardized Mean Difference)

Difference in means divided by pooled standard deviation. Used when studies use different scales to measure the same outcome. Also known as Cohen's d or Hedges' g. Null value = 0.

Common Uses of Forest Plots

Medical research: Synthesize results from clinical trials comparing treatments
Epidemiology: Combine risk estimates from observational studies
Public health: Evaluate effectiveness of interventions across populations
Psychology: Summarize effect sizes across behavioral studies
Education: Compare educational intervention outcomes across schools/districts
Cochrane Reviews: The gold standard for evidence-based medicine uses forest plots extensively

How to Create a Forest Plot in R

Using the metafor package to create a forest plot.

library(metafor)

# Sample meta-analysis data
dat <- data.frame(
  study = c("Smith 2018", "Johnson 2019", "Williams 2019",
            "Brown 2020", "Davis 2020", "Miller 2021",
            "Wilson 2021", "Moore 2022", "Taylor 2022", "Anderson 2023"),
  yi = log(c(1.52, 0.85, 1.23, 1.78, 1.05, 1.45, 0.92, 1.67, 1.31, 1.12)),  # log OR
  lower = log(c(1.12, 0.62, 0.89, 1.21, 0.76, 1.05, 0.68, 1.15, 0.95, 0.82)),
  upper = log(c(2.06, 1.16, 1.70, 2.62, 1.45, 2.00, 1.25, 2.42, 1.81, 1.53))
)

# Calculate SE from CI
dat$sei <- (dat$upper - dat$lower) / (2 * 1.96)

# Fit random-effects model
res <- rma(yi = yi, sei = sei, data = dat, method = "REML",
           slab = study)

# Create forest plot
forest(res,
       atransf = exp,         # Back-transform to OR scale
       xlab = "Odds Ratio",
       refline = 0,           # null line at log(1) = 0
       header = c("Study", "OR [95% CI]"),
       cex = 0.9)

# Add heterogeneity info
text(-4, -1, pos = 4, cex = 0.8,
     bquote(paste("I"^2, " = ", .(formatC(res$I2, digits = 1, format = "f")), "%",
                  ", p = ", .(formatC(res$QEp, digits = 3, format = "f")))))

How to Create a Forest Plot in Python

Using Matplotlib and Seaborn to create a forest plot.

Python

import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

# Sample meta-analysis data
studies = [
    "Smith et al. (2018)", "Johnson et al. (2019)", "Williams et al. (2019)",
    "Brown et al. (2020)", "Davis et al. (2020)", "Miller et al. (2021)",
    "Wilson et al. (2021)", "Moore et al. (2022)", "Taylor et al. (2022)",
    "Anderson et al. (2023)"
]
odds_ratios = [1.52, 0.85, 1.23, 1.78, 1.05, 1.45, 0.92, 1.67, 1.31, 1.12]
lower_ci = [1.12, 0.62, 0.89, 1.21, 0.76, 1.05, 0.68, 1.15, 0.95, 0.82]
upper_ci = [2.06, 1.16, 1.70, 2.62, 1.45, 2.00, 1.25, 2.42, 1.81, 1.53]

n = len(studies)
y_pos = np.arange(n)

sns.set_theme(style="whitegrid")
fig, ax = plt.subplots(figsize=(10, max(5, 0.6 * n)))

# CI lines
for i in range(n):
  ax.plot([lower_ci[i], upper_ci[i]], [y_pos[i], y_pos[i]],
      color="#333", linewidth=1.5)

# Effect size points
ax.scatter(odds_ratios, y_pos, s=80, color="#1565C0", marker="s",
       label="Study OR", zorder=3)

# Null line (OR = 1)
ax.axvline(x=1, linestyle="--", color="gray")

ax.set_xscale("log")
ax.set_xlabel("Odds Ratio")
ax.set_title("Forest Plot - Odds Ratios")
ax.set_yticks(y_pos)
ax.set_yticklabels(studies)
ax.invert_yaxis()
ax.grid(axis="y", visible=False)

plt.tight_layout()
plt.show()

Frequently Asked Questions

What data do I need for a forest plot?

At minimum, you need the study name, effect size (OR, RR, HR, MD, or SMD), and 95% confidence interval bounds for each study. Optionally, you can provide study weights.

Forest Plot Maker

Data Format Tips

Calculator

1. Load Your Data

2. Select Columns & Options

Related Calculators

Kaplan-Meier Survival Analysis

Cox Regression

Odds Ratio / Effect Size Calculator

Logistic Regression

Learn More

What is a Forest Plot?

How to Read a Forest Plot

Understanding Heterogeneity

I2 Statistic

Cochran's Q Test

When Heterogeneity is High

Effect Size Measures Explained

OR (Odds Ratio)

RR (Risk Ratio)

HR (Hazard Ratio)

MD (Mean Difference)

SMD (Standardized Mean Difference)

Common Uses of Forest Plots

How to Create a Forest Plot in R

How to Create a Forest Plot in Python

Frequently Asked Questions

What data do I need for a forest plot?

Why is the x-axis on a log scale for OR/RR/HR?

Fixed-effect vs. random-effects model?

What does the diamond represent?

How are weights calculated?

Why do square sizes differ?

Forest Plot Maker

Data Format Tips

Calculator

1. Load Your Data

2. Select Columns & Options

Related Calculators

Kaplan-Meier Survival Analysis

Cox Regression

Odds Ratio / Effect Size Calculator

Logistic Regression

Learn More

What is a Forest Plot?

How to Read a Forest Plot

Understanding Heterogeneity

I2 Statistic

Cochran's Q Test

When Heterogeneity is High

Effect Size Measures Explained

OR (Odds Ratio)

RR (Risk Ratio)

HR (Hazard Ratio)

MD (Mean Difference)

SMD (Standardized Mean Difference)

Common Uses of Forest Plots

How to Create a Forest Plot in R

How to Create a Forest Plot in Python

Frequently Asked Questions

What data do I need for a forest plot?

Why is the x-axis on a log scale for OR/RR/HR?

Fixed-effect vs. random-effects model?

What does the diamond represent?

How are weights calculated?

Why do square sizes differ?

I² Statistic

I² Statistic