The workshop will take place tentatively on August 14, 2023, on Zoom and Youtube Livestreaming

About The Data Science Summer School

The Data Science Summer School is a series of theoretical and practical workshops on the exciting methods and technologies currently employed by industry, government, and civil society to address the world's most complex problems today. It is organized by the Hertie School Data Science Lab with funding and support from the Hertie School and the Dieter Schwarz Foundation

Workshop Details

This workshop focuses on the very beginnings of a great journey ahead of you: learning how to use and be comfortable with the statistical programming language R. Together we will explore the basics, from the working environment itself, creating functions for simplifying your tasks, to data management with the tidyverse package. The overarching goal of the workshop is for you to receive the necessary skill set that will enable you to soon embark on your own data science adventures. That being said, the most important aspect of the workshop will be to have fun along the way so that your journey can begin as smoothly and easily as possible.

To do list before the class

Intruction on how to install R

Content Licensing

All workshop materials and recording are under Creative Commons Attribution-NonCommercial-ShareAlike 2.0 license. You are free to share — copy and redistribute the material in any medium or format, and adapt — remix, transform, and build upon the material. However, you must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. You may not use the material for commercial purposes. If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original.

Workshop Materials


Fabio Votta
Fabio Votta

Fabio Votta is a second-year Ph.D. student in Political Communication at the University of Amsterdam. His dissertation focuses on the impact of microtargeted political advertisements on citizens and society. He is also very passionate about reproducible open science, data visualization, and extracting insights from data. In line with his enthusiasm for learning and teaching R he is very active in the R community and has contributed with a series of R packages, tutorials and blog posts

Schedule (Central European Summer Time - CEST)

R basics

Operators and objects


Exercise I

Data frames and access

Short break

R packages

Intro to tidyverse

Tidy data manipulation

Exercise II

Best practices/open questions

Session Recording