Clinical resources

NTOG Study Design & Simulation Builder

Training tool for moving from clinical idea to PICOTS, codebook, simulated data, design checks, and starter R code.

This tool is for training and early study design. It does not replace protocol development, ethics review, data permissions, statistical review, or clinical judgement.

Author: Heidi Andersén, MD, PhD — Docent, Tampere University; Clinical Lecturer, University of Turku · Clinical lead, Finnish lung cancer registry · Last reviewed 10 June 2026

Training notice

Prototype version: v0.9. Educational use only.

This educational prototype supports early study design training. It helps users practise how to structure a research question, define variables, identify common design conflicts, and separate simulated teaching examples from real study analysis.

This tool teaches:

This tool is for education and early study design only. It does not replace protocol development, ethics review, data permissions, statistical review, patient-level data governance, or clinical judgement.

Do not enter patient-level, identifiable, confidential, or unpublished sensitive data.

Study concept form

Variable codebook section

No preset loaded yet.
Format: variable_name | label | type | role | coding | registry_source | registry_variable | transformation_rule

Analysis plan section

Endpoint text is exported to JSON. Use a matching variable for data exports.

Estimand & modern design considerations

Define what you are estimating before choosing how to analyse it. These are standard expectations for contemporary lung cancer study design.

Estimand framework (ICH E9(R1))

State the estimand as five attributes, so the objective and the analysis match:

Non-proportional hazards & RMST

The hazard ratio and the log-rank test assume proportional hazards. With immunotherapy, survival curves often separate late and plateau, or cross — the assumption fails, and a single HR (and a sample size powered on it) can mislead. Consider restricted mean survival time (RMST), landmark or milestone survival, or weighted log-rank approaches, and power the study accordingly.

Registry / observational designs: target trial emulation

For non-randomised, registry-based questions, specify the protocol of the target trial you are emulating — eligibility, treatment strategies, assignment, time zero, outcome, and estimand. This is the standard way to avoid time-zero and immortal-time bias in registry studies.

Education and early design only; this does not replace formal statistical review.

Study design checks

Design checks will update as assumptions change.
What these checks mean

Warnings are teaching prompts. They do not approve or reject a real study.

Power scenario section

Formula

required_events = ((z_alpha + z_beta)^2) / (p1 * p2 * log(HR)^2). Teaching estimate only. Not final statistical justification.

Generate outputs

Cite this tool: Andersén HH. NTOG Study Design & Simulation Builder (v0.9). Zenodo; 2026. doi:10.5281/zenodo.20632738. Licensed under CC BY-NC 4.0.