COMP 6321 MACHINE LEARNING PROJECT PRESENTATION ANH TUAN

  • Slides: 10
Download presentation
COMP 6321 MACHINE LEARNING PROJECT PRESENTATION ANH TUAN, TRAN Msc. Computer Science Concordia University,

COMP 6321 MACHINE LEARNING PROJECT PRESENTATION ANH TUAN, TRAN Msc. Computer Science Concordia University, Fall 2017

OUTLINE PROJECT OVERVIEW MACHINE LEARNING BOOTSTRAP

OUTLINE PROJECT OVERVIEW MACHINE LEARNING BOOTSTRAP

PROJECT OVERVIEW (1) ROLLING-SHIFT WORKERS’ LEVEL OF FATIGUE AFFECTED BY WORK SCHEDULE SLEEP PATTERNS

PROJECT OVERVIEW (1) ROLLING-SHIFT WORKERS’ LEVEL OF FATIGUE AFFECTED BY WORK SCHEDULE SLEEP PATTERNS LEVEL OF FATIGUE MEASURED BY (AMONG OTHERS) PVT (PSYCHOMOTOR VIGILANCE TEST) DATA COLLECTED BY SUBJECTIVE MEASURES: QUESTIONNAIRES (5 TIMES DAILY) OBJECTIVE MEASURES: ACTIWATCH (WEARABLE DEVICE) Sleep measurements in time series

PROJECT OVERVIEW (2) OBJECTIVES PREDICT THE LEVEL OF FATIGUE AS A RESULT OF SLEEP

PROJECT OVERVIEW (2) OBJECTIVES PREDICT THE LEVEL OF FATIGUE AS A RESULT OF SLEEP DEPRIVATION HOW WE DO IT? Decision Tree Random Forests

MACHINE LEARNING # X Y 1 1. 5 2 2 1. 7 2. 2

MACHINE LEARNING # X Y 1 1. 5 2 2 1. 7 2. 2 3 2 2. 5 Cross-Validation Model Bootstrap

Bootstrap How-to? (1) # X Y 1 1. 5 2 2 1. 7 2.

Bootstrap How-to? (1) # X Y 1 1. 5 2 2 1. 7 2. 2 3 2 2. 5 Original Data set (Z) # X Y 1 1. 5 2 3 2 2. 5 # X Y 1 1. 5 2 2 1. 7 2. 2 3 2 2. 5

Bootstrap How-to? (2)

Bootstrap How-to? (2)

Using Bootstrap in Error prediction Bootstrap data sets as training data Original sample as

Using Bootstrap in Error prediction Bootstrap data sets as training data Original sample as validation data Problems? Yes! Observations appear both in bootstrap AND validation data This will underestimate true prediction error

A little bit comparison (1) Data set 497 records, in 3 classes 479 in

A little bit comparison (1) Data set 497 records, in 3 classes 479 in Green class 13 in Yellow class 5 in Red class Decision Tree gives: 93. 8% accuracy 21 Green classified as Yellow 10 Green classified as Red

A little bit comparison (2) Random Forests gives: 96. 8% accuracy 14 Green classified

A little bit comparison (2) Random Forests gives: 96. 8% accuracy 14 Green classified as Yellow 2 Green classified as Red Random Forests with Bootstrap gives: 99. 2% accuracy 4 Green classified as Yellow 0 Green classified as Red