Introduction to R: Online - (3 days)
This masterclass offers a step-by-step, interactive introduction to R and RStudio for participants with no experience with these software packages.
Instructor
Dr Joanna Dipnall is an applied statistician with particular interests in the advanced statistical methods and machine and deep learning techniques. She completed her Honours in Econometrics with Monash University and PhD with IMPACT SRC, School of Medicine, Deakin University. Joanna works extensively with registry and linked medical data and collaborates extensively with the Faculty of IT at Monash to supervise Masters and PhD students to integrate AI within health research. Joanna teaches within the Monash Biostatistics Unit and is the Unit Coordinator for the Monash Masters of Health Data Analytics course. Joanna has taught advanced statistical methods for many years at universities and for ACSPRI.
One of the key skills in data science is making effective use of software for manipulating data and generating results. R is an established software environment used in the world of data science. In this course, you will be introduced to basic data wrangling, descriptive statistics, visualisation and reporting of results. Key R data science libraries such as dplyr and ggplot will be introduced.
Upon completion of this master class, you will have the skills required to load different types of data files into R, manage and manipulate your data, build visualisations and produce a basic report. The workshop is relevant to researchers and data analysts in any area of research that want to use R for their research work.
This masterclass is part of the ACSPRI suite of courses in social data science is specially designed for those who want to learn how to use R for data manipulation and statistical analysis. It aims to introduce the foundations of R and build confidence in the use of R.
Day 1
- Introduction to R
- Installing and loading libraries
- Data structures in R (vectors, matrices, data frames)
- Descriptive statistics
- Tabulations
- Exercises
Day 2
- Introduction to data wrangling
- Recoding variables
- Generating new variables
- Filtering data frames (rows and/or columns)
- Merging and appending data
- Exercises and homework
Day 3
- Review of homework and Quiz
- Basic graphs
- Extending graphs with ggplot
- Creating your first report of your analysis using Markdown files (tables, graphs)
- Exercises
This course assumes that participants have:
- A basic understanding of statistical concepts including descriptive statistics (mean, median and interquartile range),
- Some familiarity with a PC/Mac environment including keyboard skills,
- An understanding of folder and file structures in the PC/Mac environment, and
- Some experience in using Microsoft Word and Excel or their equivalent.
Data Analysis and Graphics Using R by John Maindonald and W. John Braun.
Discovering Statistics Using R by Andy Field and Jeremy Miles.
The extra day a week later provided the chance to re-group my thoughts and work through the homework which re-enforced the information.