Introduction to statistic with R [ISR]
Context | Training cours |
Organisation | 5 * 4 hours of lecture and exercice with R |
Material | Slides, exercices, data and R-scripts |
Introduction to descriptive and parametric statistics for univariate and multivariate data with the software R.
1. Descriptive statistic for univariate and bivariate data
- Repartition of the data (histogram, kernel density, distribution function)
- Order statistic and quantile
- Statistics for location and variability, boxplot
- Scatter plot, QQplot
- Covariance and correlation
- Simple linear regression
2. Descriptive statistic for multivariate data
- Least squares and multiple linear and non-linear regression models
- Principal component analysis and principal component regression
- Clustering methods (K-means, hierarchical, density-based)
- Linear discriminant analysis
- Bootstrap technique
- Artificial neural network
3. Parametric statistic
- Notion of Likelihood
- Estimator: Definition and main properties (bias, convergence)
- Punctual estimate (maximum likelihood estimation, Bayesian estimation)
- Confidence and credible intervals
- Information criteria
- Test of hypothesis
- Parametric clustering
-
ID Montgomery and G Runger: Applied statistics and probability for engineers, Wiley, 2014.
-
P Congdon: Bayesian statistical modelling (2nd edition), Wiley, 2006.
-
G Saporta: Probabilités, analyse des données et statistique, BMS, 2010.
-
R codes (uni and multivariate data analysis)
-
The R project for statistical computing: r-project.org
-
R statistics tutorials: youtube.com/watch?v=eDrhZb2onWY
-
Python & R codes for machine learning algorithms: analyticsvidhya.com
-
R vs Python: blog.dominodatalab.com