Ph.D students will be brought, starting from the simpler concepts of probability and statistics, to enough advanced methods and techniques.
In this way the course aims to cover students with different/non-homogeneous entry knowledge level.
A final exam (homework type) will be organized as learning assessment.
The course will be provided on the Zoom platform because of the Ph.D. students of the Dottorato Nazionale are geographically spread.
Basic concept of the theory of Probability. Axiomatic probability and the role of Bayes theorem.
Histograms: sampling and binning. Hystograms' comparison: absolute and relative normalization, stacked plots, data-to-simulation comparison, data-to-data comparison.
Histograms ratio and uncertainties.
Probability density functions and their features. Joint and conditional probabilities.
Dependence and correlation between observables. Covariance matrix. Variance propagation.
Generation of distributions. Binomial distribution and efficiency. Stochastic (Poissonian) processes and applicability of the Poissonian distribution.
Gaussian function and its role in the Central Limit Theorem. Gaussian resolution function.
Other important distributions (Crystal Ball, Breit-Wigner, chi-squared).
Hypothesis testing: test statistics, discrimination of signal against background, ROC curva and choice of a suitable Working Point.
Point estimation theory. Maximum Likelihood fitting, binned and unbinned, extended.
Symmetric and asymmetric uncertainties, Profile Likelihood.
Fitting tasks within a Jupyter notebook. Background modelization with different polynomia; sidebands subtraction method.
Python framework and Jupyter notebook. Uproot and RDataFrame to handle big data.
Extraction of a physical signal from big data with a classical cut-based selection; evaluation of signal significance, signal purity and signal-to-noise ratio.
Extraction of physical signal from big data with a machine-learning approach (XGBoost) optimized by Optuna. Comparison with the cut-based selection. [U. Sozbilir]
Note: all items are covered by hands-on examples/exercises - executed on Google COLAB platform - borrowed by High Energy Physics best practices.
July 7th: 2 hours (15-17)
July 8th: 5 hours (10-13, 15-17)
July 9th: 4 hours (11-13, 15-17)
July 10th: 5 hours (10-13, 15-17)
July 11th: 4 hours (11-13, 15-17)
July 16th: 4 hours (9.30-11.30 / 12.00-14.00) [with U.Sozbilir]
Zoom coordinates will be the same and sent by email the first day of lessons.