5. Computational Statistics Lectures

Author

Raymond Bisdorff, Emeritus Professor of Applied Mathematics and Computer Science

Copyright
  1. Bisdorff © 2013-2021

5.1. Introduction

From 2007 to 2011 the Algorithmic Decision Theory COST Action IC0602, coordinated by Alexis Tsoukiàs, gathered researchers coming from different fields such as Decision Theory, Discrete Mathematics, Theoretical Computer Science and Artificial Intelligence in order to improve decision support in the presence of massive data bases, combinatorial structures, partial and/or uncertain information and distributed, possibly interoperating decision makers.

A positive result a.o. of this COST action was the organisation from 2010 to 2020 of a Semester Course on Computational Statistics at the University of Luxembourg in the context of its Master in Information and Computer Science.

Below are gathered 2x2 reduced copies of the presentation slides for 8 Lectures from the Winter Semester 2019.

5.2. Lectures

  1. Generating random numbers for simulations

    1. On numbers “chosen at random”, computer generated random numbers and multiple recursive random number generators over F2.

    2. Recommendations and traps to watch for with home brewed generators.

    3. Combining random number generators and testing randomness.

  2. Introduction to statistical computing

    1. On generating simulation data with Python and exploring simulation data with gretl. Getting started with R. Introducing R objects: vectors, matrices, lists, data frames. Reading CSV data files into data frames.

    2. Doing linear algebra in R. Constructing matrice objects, matrix operations and inversion, solving linear systems, eigen-values and -vectors, singular value decomposition, Choleski and QR decompositions.

    3. Principal component analysis (PCA) and discrete Markov chain simulating.

  3. Continuous Random Variables

    1. Probability distributions in R-core, simulating a continuous uniform random distribution, the spectral test for random number generators.

    2. Simulating random variables by a continuous inverse transform, standard exponential law based generators.

    3. The Gaussian random variables, important properties, simulating Gaussian random varaibles.

  4. Simulating from Discrete Random Variables

    1. Simulating Bernoulli and Binomial random variables. The CLT for binomial distributions.

    2. Simulating a Poisson random variable and Poisson processes with exponential time intervals.

    3. Simulating Gamma variables, integer alpha parameter and the sum rule for Gamma Variables.

  5. Simulating from arbitrary empirical random distributions

    1. Single pass estimation of arbitrary quantiles: computing sample quantiles, quantiles via selecting algorithms, tracking the M-largest element in a single pass.

    2. Computing quantiles from binned data: equally binned observation data, linear integration formulas, regular binned data quantiles.

    3. Incremental quantiles estimation with the IQ-agent: using the IQ-agent for Monte-Carlo simulations.

  6. Two distributions, are they of the same kind?

    1. Methodology: comparing statistical distributions, methodological approach and statistical tests.

    2. Comparing histograms: Chi-square test against a known distribution, comparing two binned data sets, testing uniform randomness.

    3. Comparing continuous distributions with the Kolmogorov-Smirnov test.

  7. On Averaging

    1. The benfit from averaging: the law of large numbers, estimating distribution parameters, how to reduce noise.

    2. Convergence of the averaging: Convergence of the mean for a standrad Gaussian, and if there are outliers? Non convergence of a Cauchy mean.

    3. Comparing two empiric means: robustness of the t sdatistic, estimating t statistics, Monte Carlo simulation of the H0 rejection.

  8. Accept-Reject Simluation Methods

    1. Classical Monte Carlo Integration: principles and applications.

    2. Accept-reject simulation methods.

    3. Accept-reject simulation applications: pi estimation, Box-Muller transform, Ratio-Of-Uniforms method.