Introduction to Biostatistics and Epidemiology

A.Y. 2025/2026
9
Max ECTS
60
Overall hours
SSD
MED/01
Language
English
Learning objectives
The purpose of the course Introduction to Biostatistics and Epidemiology is to develop student's critical ability to analyze biomedical data through the interaction with biomedical researchers. The course consists of two main sections aiming at introducing students to a general understanding of the analysis of medical data. In the epidemiology section students will learn the principles of planning observational studies and the study of the relationship between exposure and health in the population. In the biostatistics section, the focus will be the application of statistical methods to biology and medicine, through the evaluation of diagnostic procedures and the application of statistical inference and regression models to the planning and analysis of clinical studies.
Expected learning outcomes
The acquisition of the abilities related to Biostatistics for a data scientist, contributes to the formation of a methodological habitus suitable to integrate, in daily practice, the clinical knowledge coming from biomedical researchers with the evidence framework provided by the inferential reasoning.
The student is expected to understand the principles of study design, measures of disease frequency and measures of risk, the concept of causality, bias, confounding, effect modification and latency period, the evaluation of diagnostic tests, precision and accuracy, reference limits.
The student is also expected to understand and use inferential statistics through hypothesis testing, p-values and confidence intervals, the difference between statistical significance and clinical importance, correlation and regression, the difference between parametric and non-parametric tests, categorical tests (Chi-square and Fisher), sample size calculations and statistical power, adjustment for multiple tests, censored data and the use of regression models in biosciences.
Single course

This course cannot be attended as a single course. Please check our list of single courses to find the ones available for enrolment.

Course syllabus and organization

Single session

Responsible
Lesson period
Third four month period
Course syllabus
The program is composed of 4 interconnected modules.
TOPIC 1: Module of Epidemiology (3 CFU) (Risk and prognostic factors)
Most diseases are not the product of a single cause, but a complex concatenation multiple causes. In vitro and in animals' experiments are not always adequate to identify all the factors that may be responsible for a disease or accelerate its adverse outcome. On the other hand, the identification of determinants of diseases cannot, for obvious ethical reasons, be based on studies involving the experimental exposure of humans to suspected risk factors. Hence, we recognize the need for planned observational studies that do not require the intervention on the subject, but due to their nature, make it complex to interpret associations between exposure to risk factors and disease occurrence in terms of cause effect relationship.
The purpose of this module is to develop the student ability to carefully evaluate the role and importance of risk and prognostic factors, by learning the principles of planning observational studies. With this aim the students will develop skills to evaluate the measures of disease occurrence and association between risk factors and occurrence of the disease, or between prognostic factors and outcome of the disease, with particular attention to the problem of confounding.
- Introductory Lectures (10 hours):
· Measure of disease occurrence: prevalence and incidence. (2h)
· Types of epidemiologic studies and measures of association between risk factors and disease: relative risks, odds ratios, risks differences. (6h)
· Type of bias, including confounding and effect modification. (2h)
- Practical session (6 hours plus 2 hours of discussion of the final projects): Group work to disentangle and discuss the essential elements of a study, including design, inclusion/exclusion criteria, exposure and outcome assessment, statistical analysis, main results of the analysis, strengths and limitations of the study. Each group (up to four participants) may select one article from those provided by the instructor, covering cross-sectional, case-control, and cohort study designs.
- Take home points: revising theory after the practice (2 hours): Critical evaluation of study designs and risk of bias in the epidemiological practice.

TOPIC 2: Biomedical test evaluation: metrics and applications (1 CFU, 6 hours)
Screening and diagnostic tests are important tools for identifying individuals at risk of developing a disease or confirming a diagnosis based on clinical suspicion. These tests play a crucial role in the early detection of medical conditions, where timely intervention can greatly improve health outcomes.
However, these tests are not error-free, and enhancing diagnostic accuracy remains a central goal of biomedical research. As new technologies emerge, it is increasingly important to assess their performance and compare them to established standards. A comprehensive understanding of both the strengths and limitations of diagnostic tools is essential for evidence-based decision-making in clinical and public health settings.
This module provides students with the theoretical foundations and practical skills necessary to evaluate the accuracy and clinical utility of screening and diagnostic tests. Students will work with real data and use the statistical software R to calculate the diagnostic metrics introduced in lectures. Guided assignments will offer hands-on experience in test evaluation and data-driven decision-making.
Topics covered:
· Introduction to screening and diagnostic tests
· Reference intervals and test thresholds
· Measures of diagnostic accuracy: sensitivity and specificity
· Predictive values, pre- post-test probabilities: application of Bayes' theorem , the Fagan's Nomogram as a clinical tool. (2h)
· ROC curves: assessing diagnostic accuracy across different cut-off points

TOPIC 3: Drawing conclusions from samples (2 CFU, 14 hours)
Health science research relies on data collected from samples, even though the ultimate goal is to make informed decisions that impact the entire population of interest. This is possible through statistical inference, a process that allows researchers to draw conclusions about a population based on sample data. However, relying on samples instead of the entire population introduces uncertainty in the sample estimates, known as sampling error. Understanding and managing this uncertainty is essential for drawing valid conclusions on disease burden, the efficacy of a potential treatments, or public health interventions.
This module provides students with the knowledge and skills to manage uncertainty in biomedical studies through lectures and guided practical sessions with the statistical software R.
Topics covered:
· Estimation techniques: point and interval estimates
· The logic of hypothesis testing: the power and the sample size. Statistical significance and clinical relevance.
· Comparing group means: hypothesis testing with continuous data
· Comparing proportions: hypothesis testing with categorical data
· Nonparametric tests

TOPIC 4: Introduction to the use of regression models in medicine (3 CFU)
Regression models for different outcomes: the analysis of change and the linear regression model; the case-control study and logistic regression; the cohort study and the study of incidence using Cox regression. (10h)
Regression in context: descriptive, predictive or causal purpose? (4h)
Variable selection and the true model myth. (6h)
Prerequisites for admission
No prerequisites.
Teaching methods
Synchronous learning: Lectures as well as use of the AMBOSS platform (Epidemiology, Statistical analysis of data) with audio-video lectures available online. Group works will be used for practical activities.
Teaching Resources
TEXT BOOKS:
· Interpretation and Uses of Medical Statistics" by Daly and Bourke (5th edition). Available online in Unimi library.
· Marcello Pagano, Kimberlee Gauvreau, "Principles of Biostatistics", 2000, Duxbury Press. Available online in Unimi library.
· R. Bonita, R. Beaglehole, T. Kjellström, "Basic epidemiology". 2nd edition. World Health Organization (free)
· J. Mark Elwood, "Critical Appraisal of Epidemiological Studies and Clinical Trials", 3rd Edition, Oxford University Press
· Douglas G. Altman, "Practical statistics for medical research". Chapman and Hall
SUPPLEMENTAL LEARNING MATERIAL:
Additional material, in particular online content, will be made available during the course.
J. B. Carlin and M. Moreno-Betancur, " On the Uses and Abuses of Regression Models: A Call for Reform of Statistical Practice and Teaching," Statistics in Medicine 44, no. 13-14 (2025): e10244, https://doi.org/10.1002/sim.10244
Assessment methods and Criteria
The exam is a written test.
The written test will be based on the Moodle platform with multiple items questions and short answers numerical questions. The grades are on a scale of 30 and a minimum of 18/30 is required to pass the written test. In the event of a full grade (30/30) honors (lode) may be granted.
Registration to the exam through SIFA is mandatory.
MED/01 - MEDICAL STATISTICS - University credits: 9
Lessons: 60 hours
Professor(s)
Reception:
On appointment (email)
Laboratorio di Statistica Medica, Biometria ed Epidemiologia "G.A. Maccacaro", Via Celoria 22, Milano
Reception:
For meetings, please write an email.
via Celoria, 22, 20133 Milano