Machine Learning Techniques using Stata: Online - (2 days)

This course is designed as an applied introduction to the use of the Stata software for Machine Learning (ML) techniques.

 

 

This course will be offered online via Zoom And will run to the following timetable:

9.30am -11.00am: Instructional Zoom session
11.00am-11.30am: Break
11.30am-12.30pm: Instructional Zoom session
12:30pm-1.30pm: Lunch
1.30pm-3.00pm: Instructional Zoom session
3.00pm-3.30pm: Break
3.30pm-4.30pm:  Exercises

 

Please note: Courses will run on Australian Eastern Daylight Time (GMT +11)

(ie Melbourne, Sydney, Canberra daylight savings time)

 

 

 
Master Class - runs over 2 days
Instructor: 

Dr Joanna Dipnall is an applied statistician with interests in the advanced statistical methods, including machine learning and deep learning techniques. She completed her Honours in Econometrics with Monash University and her PhD with IMPACT SRC, School of Medicine, Deakin University. Joanna works extensively with registry and linked medical data and collaborates extensively with the Faculty of IT at Monash to supervise Masters and PhD students to integrate artificial intelligence within health research. Joanna teaches within the Monash Biostatistics Unit and is the Unit Co-coordinator for the Monash Masters of Health Data Analytics course. Joanna has taught advanced statistical methods for many years at universities and for ACSPRI.

Course dates: Friday 27 November 2020 - Saturday 28 November 2020
Course status: Course completed (no new applicants)
Venue: 
Online
Week: 
Week 1
About this course: 

Machie Learning techniques are becoming increasingly popular across areas of research from computer science to various disciplines of medicine. This branch of artificial intelligence relates to algorithms that learn from data based on specific tasks and performance measures. This course is an introductory applied course, using Stata software to run various ML algorithms. This course will use some Stata commands that are built into the base system and others that have been specially designed user-written commands that have evolved from the increasing use of ML. Classification, prediction and model selection issues will be discussed. Detailed notes with worked examples and references will be provided as a basis for both the lecture and hands-on computing aspect of the course.

Please note that this course will use Stata V16.

Course syllabus: 

This course primarily focusses on the application of specific ML techniques rather than the complex mathematics behind the ML algorithms and is broken up into Five Parts:
 

Part I: Fundamentals of Machine Learning

Part II: Machine Learning Techniques and Work Flow

Part III: Decision Trees & Random Forests

Part IV: Boosted regression

Part V: Support Vector Machines

Part VI: Lasso regression

 

At the end of each day, participants will be given time to do some ML exercises on their own to practise what they have learned.

Course format: 

This workshop will take place in a classroom. You will need to bring your own laptop with Stata. If you don't have a copy of Stata, please let us know in advance and we will organise a trial version for the course.

Recommended Background: 

This course assumes that participants have (1) Familiarity with the Stata command language (2) sufficient understanding of statistics to be able to comprehend the material covered in the course outline, such as a basic grounding in multiple regression (e.g. linear, logistic, Poisson) and clustering techniques (e.g. Principal components analysis, k-means clustering) (3) access to Stata V16 (4) some experience in using Microsoft Word and Excel or their equivalent (5) experience using a text editor such as Notepad.

Recommended Texts: 

Course notes will be supplied.

 

No specific references are suggested but a number will be supplied with the notes handed out for the course.

Course fees
Member: 
$1,480
Non Member: 
$2,280
Full time student Member: 
$1,280
Supported by: 

Stata is distributed in Australia and New Zealand by Survey Design and Analysis Services.