    This course will focus on the concept and techniques that allow statisticians/data analysts to ensure as much as possible that their analysis performed using R is reproducible.

    Reproducible research in statistical analysis is the idea that it should be possible for other people to reproduce a published data analysis and obtain the same results ("other people" also include the person who did the initial data analysis, 6 months later). While all data analysts agree that the concept of reproducible research is important, very few of them actually take active steps to ensure that their work is reproducible.


    The course will consist in three main parts:

    • Introduction (about 1 hour): why is reproducible research important, and what are the concrete problems we are facing
    • Revision control and GIT (rest of the morning): how to keep track of the different versions of the files we use during a data analysis
    • Preparing reports with knitR: how can one prepare reports that automatically include the statistical analysis, while fully documenting the analysis and reducing the risk of manual errors ?


    Participants should have experience in using R for analyzing data. Participants should bring a laptop and be able to install custom applications. A list of required applications (including R, RStudio and GIT) will be provided before the course.


    Schedule and location

    Schedule: Monday, January 26, 2015, 9:00 - 17:30

    Location: Lounge, Biozentrum Annex, Klingelbergstr. 61, Basel

    The Biozentrum Annex is across the road from the main building. The lounge is at 13th floor.  Directions can be found here.

    Additional information

    The course will be taught by Frédéric Schütz, PhD, from the Bioinformatics Core Facility at SIB, Lausanne.

    We will recommend 0.25 ECTS credit for this workshop.

