This 3 day course is an introduction to supervised machine learning techniques for data science providing an interactive step-by-step guide to running some of the standard statistical regression and classification machine learning models that every data scientist should know. This course will use the R software.
This course will be run over 2 days in 1 week, and then the 3rd day will run in the following week.
Please check the syllabus below for details.
Friday 27th - Saturday 28th September: Days 1 & 2
Friday 4th October: Day 3
This course is being held online via Zoom and run on Australian Eastern Standard Time (UTC +10)
(Canberra, Sydney Melbourne, Brisbane time)
This masterclass is an introduction to supervised machine learning techniques for data science. It will provide an interactive step-by-step guide to running some of the standard statistical regression and classification machine learning models that every data scientist should know. This course will use the R software.
This masterclass is part of the ACSPRI suite of courses in social data science and is specially designed for those who want a gentle introduction to supervised machine learning models in data science.
This course will be run over 3 days, (2 days in week 1 & 1 day in Week 2), using the following timetable:
Day 1
- 9.30 am - 10.00 am – Introductions and setup check
- 10.00 am - 11.30 am - Instructional Zoom Session
- 12.30 pm - 2.00 pm - Instructional Zoom Session
- 3.00 pm - 5.00pm - Instructional Zoom Session and Exercises
Days 2 and 3
- 10.00 am - 11.30 am - Instructional Zoom Session
- 12.30 pm - 2.00 pm - Instructional Zoom Session
- 3.00 pm - 5.00pm - Instructional Zoom Session and Exercises
Dr Joanna Dipnall is an applied statistician with interests in the advanced statistical methods, including machine learning and deep learning techniques. She completed her Honours in Econometrics with Monash University and her PhD with IMPACT SRC, School of Medicine, Deakin University. Joanna works extensively with registry and linked medical data and collaborates extensively with the Faculty of IT at Monash to supervise Masters and PhD students to integrate artificial intelligence within health research. Joanna teaches within the Monash Biostatistics Unit and is the Unit Co-coordinator for the Monash Masters of Health Data Analytics course. Joanna has taught advanced statistical methods for many years at universities and for ACSPRI.
Machine Learning techniques are becoming increasingly popular across a broad range of research areas and a necessary skill for the serious data analyst. This branch of artificial intelligence relates to algorithms that learn from data based on performance measures. Supervised machine learning algorithms are trained using "labelled" data to predict the output of interest. Once the training process is complete, the model is tested on a test data set to predict the output, which is used to measure the performance of the algorithm. Based on the nature of the output task, these models can be classed as either regression or classification algorithms. This course is an introductory course with a primarily focus on the application of specific machine learning techniques, rather than the complex mathematical and statistical theory behind the algorithms.
Upon completion of this masterclass, you will have the skills required to confidently run a set of standard supervised machine learning tasks using the R software platform. You will have gained an understanding of when each type of model is appropriate and be able to justify the use of your model using key machine learning performance measures. The workshop is relevant to researchers and data analysts in any area of research that want to use machine learning algorithms for their research work.
Day 1:
- Fundamentals of Machine Learning
- Machine Learning workflow
- Different Machine Learning algorithms
- Feature engineering
- Common Machine Learning Metrics
- Model tuning and over-fitting
Day 2:
- Linear regression methods
- Classification methods
- Tree-based methods
- Resampling techniques
- Ensemble methods
- Exercises
Day 3:
- Review of homework and Quiz
- Market basket analysis
- Neural networks
- Support vector machines
- Use and reporting of supervised machine learning models in publications
- Exercises
This course will be run online over 2 weeks with days 1 and 2 in the first week, and day 3 the following week. Homework will be provided to participants to complete over the following week, with a quiz to be completed prior to day 3.
Participants will require their own computers and to have loaded R and RStudio loaded onto their machines. They will also need to be able to access the internet to download R libraries. This course will be taught in the PC environment but MAC users are welcome.
Please note that due to the short 3-day structure, there will not be any time set aside for analysing participant’s own data.
This course assumes that participants have:
- A basic of statistical concepts including descriptive statistics (mean, median and interquartile range), and regression analysis.
- A sound knowledge of using the R and RStudio software.
- Some familiarity with a PC/Mac environment including keyboard skills.
- An understanding of folder and file structures in the PC/Mac environment, and
- Some experience in using Microsoft Word and Excel or their equivalent.
The Elements of Statistical Learning Data Mining, Inference, and Prediction, Second Edition by Trevor Hastie, Robert Tibshirani, Jerome Friedman.
An Introduction to Statistical Learning with Applications in R, by Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani
Mastering Machine Learning with R - Second Edition, Advanced prediction, algorithms, and learning methods with R 3.x by Cory Lesmeister
1. BOOKING - ACSPRI does not accept ‘expressions of interest’ for course places, i.e. all bookings, are considered firm, and a cancellation fee is charged if you cancel your booking after the early-bird date.
2. DISCOUNT RATE – The discounted rate for ACSPRI members is available to all staff and students of member organisations. To be eligible for this rate:
The course fee must be paid by either the member organisation or by you. Where fees are paid by a non-member organisation the non-member rate applies:and
You must either have a valid email address issued by the member organisation; or you must hold, or have a right to hold, a current staff or student identity card from the member organisation.
In addition, to be eligible for a full time student discount the participant must:
Hold, or have a right to hold, a current student identity card from the member organisation;
Be enrolled as a full-time student;
Make payment in full with your application, arrange electronic funds transfer (EFT), or contact ACSPRI to advise credit card details for payment, by the early-bird closing date;
Provide ACSPRI with contact details of your supervisor, so we can request them to confirm your eligibility for the full time student rate.
The early bird rate applies to all bookings paid in full by the early bird close date, otherwise you will be charged at the standard rate.
3. REFUNDS & CANCELLATIONS - Course fees are not refundable unless:
we cancel the course in which you have enrolled; or
you cancel your enrolment before the early-bird closing date.
A cancellation fee of $250 will be charged if you cancel within the period from the early-bird closing date of and one week prior to the commencement of the program. The full course fee will be charged if you cancel within 1 week of the beginning of your course.
4. PRE-REQUISITES - Course descriptions specify course pre-requisites. You must undertake to meet the pre-requisites of the course(s) in which you enrol. If in any doubt, you should contact ACSPRI prior to enrolling.
Delivery of this course is online - via Zoom.
Please ensure you have the following:
- Reliable Internet connection with at least 5Gb per day of data available (i.e. a 5 day course will use about 25Gb of data just on the Zoom application)
- A computer/laptop with the Zoom application installed (free)
- A webcam (built in to most laptops)
- A headset with a microphone (not required but ideal)
- A second monitor/screen if possible
Please also check the course page for specific software requirements (if any).