Course description:
“It is easy to lie with statistics; it is easier to lie without them.” /Frederick Mosteller/
Nowadays, thanks to digitalization, we are bombarded with vast amounts of data and information. It is often difficult to navigate through this information and decide which data and statistical values we can trust and which we cannot. The aim of this course is to provide a statistical foundation for data analysis. We will move from simple descriptive analyses to advanced regression models. An important feature of the course is that there are no black boxes. We look at each methodology with an appropriate depth of mathematical statistics background and then apply them in practice. A secondary goal of the course is to enable participants to apply the methods they have learned in an appropriate IT environment. To achieve this goal, we use R and RStudio software throughout the course. We use R not only for calculations, but also for testing various theories through simulation. R is the market- leading software for data analysis alongside Python. This allows participants to gain competitive data analysis knowledge and skills in the course. Participants are requested to bring their own laptops to the sessions.
Topics: