This site contains the materials for an R course run by Peter Mac.
The materials were adapted from the course run by the Bioinformatics Core at the Cancer Research UK Cambridge Institute (Contributions : Matthew Eldridge, Chandra Chilamakuri, Mark Fernandes, Ashley Sawle, Kamal Kishore, Sergio Martinez Cuesta, Rory Stark).
April-May 2022
R is one of the leading programming languages in Data Science and the most widely used within Peter Mac for interacting with, analyzing and visualizing cancer biology datasets. In this training, we aim to provide a friendly ‘next-steps’ R course for beginners who have been through an introductory R training and would like to consolidate their skills. It is an opportunity for increasing skills through supported practice using cancer biology data as part of a class.
By the end of this course you should be able to:
The course will be run over 4 sessions with the following structure:
Course setup
Installing R and RStudio
Week 1
Introduction to working with data in RStudio
Interacting with R using RStudio, importing and viewing data, generating and editing a reproducible report.
Live recap/Q&A session Tuesday April 26th 3.30-4.30pm
Week 2
Data visualization with ggplot2
A common grammar to create scatter plots, bar charts, boxplots, histograms and line graphs for time series data.
Live recap/Q&A session Tuesday May 3rd 3.30-4.30pm
Week 3
Data manipulation using dplyr
Filtering and modifying tabular data, computing summary values, faceting with ggplot2.
Live recap/Q&A session Tuesday May 10th 3.30-4.30pm
Week 4
Grouping, combining,and restructuring data for analysis
Advanced grouping and summarization operations, joining data from different tables, the concept of ‘tidy data’, pivoting and separating operations, customizing ggplot2 plots.
Live recap/Q&A session Tuesday May 17th 3.30-4.30pm
Competition!
Submit an Rmd report showing some of the skills you’ve learned, using the course data or your own data, as simple or as complex as you like. Submit after the course ends, by May 24th. Best report will win a small prize.
Refer to the R for Data Science book for more information on the topics covered in this course.