The course provides an introduction to modern statistics and data analysis. It addresses the question, “What should I do if these are my data and this is what I want to know”? The course adopts a model based, largely Bayesian, approach. It introduces the computational means and software packages to explore data and infer underlying parameters from them. An emphasis will be put on streamlining model specification and evaluation by leveraging probabilistic programming frameworks. The topics are exemplified by real-world applications drawn from across the sciences.
- Principled Data Analysis: signal model and error model, likelihood and priors
- Probability Distributions
- Generative Clustering and Classification
- Gaussian Processes
- Fitting your own model: gradient-based optimization
- Automatic differentiation
- Error Estimation
- Sampling Methods: MCMC and variants
- Advanced Sampling: Hamiltonian MC, ensemble and nested methods
- Hierarchical Models
- Likelihood-free Methods
- Hypothesis testing
Lecture notes with code examples will be made available online.
- Machine Learning: A Probabilistic Perspective
free to read online through the PU library