Action Classification An Integration of Randomization and Discrimination

  • Slides: 32
Download presentation
Action Classification: An Integration of Randomization and Discrimination in A Dense Feature Representation Bangpeng

Action Classification: An Integration of Randomization and Discrimination in A Dense Feature Representation Bangpeng Yao, Aditya Khosla, and Li Fei-Fei Computer Science Department, Stanford University {bangpeng, aditya 86, feifeili}@cs. stanford. edu 1

Outline • Action Classification & Intuition • Our Method • Our Results • Conclusion

Outline • Action Classification & Intuition • Our Method • Our Results • Conclusion 2

Outline • Action Classification & Intuition • Our Method • Our Results • Conclusion

Outline • Action Classification & Intuition • Our Method • Our Results • Conclusion 3

Action Classification Phoning Riding. Bike Running Object classification: Presence of parts and their spatial

Action Classification Phoning Riding. Bike Running Object classification: Presence of parts and their spatial configurations. [Lazebnik et al, 2006] [Fergus et al, 2003] 4 …

Action Classification • All images contain humans; Object classification: Presence of parts and their

Action Classification • All images contain humans; Object classification: Presence of parts and their spatial configurations. [Lazebnik et al, 2006] [Fergus et al, 2003] 5 …

Action Classification • All images contain humans; • Objects small or absent; Object classification:

Action Classification • All images contain humans; • Objects small or absent; Object classification: Presence of parts and their spatial configurations. [Lazebnik et al, 2006] [Fergus et al, 2003] 6 …

Action Classification • All images contain humans; • Objects small or absent; • Large

Action Classification • All images contain humans; • Objects small or absent; • Large pose variation & occlusion; • Background clutter; Challenging… Object classification: Presence of parts and their spatial configurations. [Lazebnik et al, 2006] [Fergus et al, 2003] 7 …

Our Intuition Focus on image regions that contain the most discriminative information. 8

Our Intuition Focus on image regions that contain the most discriminative information. 8

Our Intuition Focus on image regions that contain the most discriminative information. How to

Our Intuition Focus on image regions that contain the most discriminative information. How to represent the features? How to explore this feature space? Dense feature space Randomization & Discrimination 9

Outline • Action Classification & Intuition • Our Method • Our Results • Conclusion

Outline • Action Classification & Intuition • Our Method • Our Results • Conclusion 10

Dense Feature Space . . . . Region Height . . . Region Width

Dense Feature Space . . . . Region Height . . . Region Width Normalized Image Size of image region Center of image region 11

Dense Feature Space . . . . Region Height . . . Region Width

Dense Feature Space . . . . Region Height . . . Region Width Normalized Image Size of image region Center of image region 12

Dense Feature Space . . . . Region Height . . . Region Width

Dense Feature Space . . . . Region Height . . . Region Width Normalized Image Size of image region Center of image region 13

Dense Feature Space . . . . Region Height . . . Region Width

Dense Feature Space . . . . Region Height . . . Region Width Normalized Image Size of image region Center of image region Image size: N×N Image regions: O(N 6) How can we identify the discriminative regions efficiently and effectively? 14

Dense Feature Space . . . . Region Height . . . Region Width

Dense Feature Space . . . . Region Height . . . Region Width Normalized Image Size of image region Center of image region Apply randomization to sample a subset of image patches Random Forests (RF) 15

Dense Feature Space . . . . Region Height . . . Region Width

Dense Feature Space . . . . Region Height . . . Region Width This class Other classes Normalized Image Size of image region Center of image region Random Forests (RF) 16

Dense Feature Space . . . . Region Height . . . Region Width

Dense Feature Space . . . . Region Height . . . Region Width This class Other classes Normalized Image Size of image region Center of image region RF with discriminative classifiers 17

Generalization Ability of RF • Generalization error of a RF: : correlation between decision

Generalization Ability of RF • Generalization error of a RF: : correlation between decision trees : strength of the decision trees • Dense feature space • Discriminative classifiers decreases increases Better generalization 18

RF with Discriminative Classifiers … … 19

RF with Discriminative Classifiers … … 19

RF with Discriminative Classifiers … 1 2 3 … … 0 4 5 1

RF with Discriminative Classifiers … 1 2 3 … … 0 4 5 1 1 1 0 … Train a binary SVM Bo. W or SPM of SIFT-LLC features 20

RF with Discriminative Classifiers … 1 2 3 … … 0 4 5 1

RF with Discriminative Classifiers … 1 2 3 … … 0 4 5 1 1 1 0 … Train a binary SVM Biggest information gain 21

RF with Discriminative Classifiers … 1 2 3 … … 0 4 5 1

RF with Discriminative Classifiers … 1 2 3 … … 0 4 5 1 1 1 0 … Train a binary SVM 22

RF with Discriminative Classifiers … … • We stop growing the tree if: -

RF with Discriminative Classifiers … … • We stop growing the tree if: - The maximum depth is reached; - There is only one class at the node; - The entropy of the training data at the node is low 23

Classification With RF … … … Class Label … Number of trees 24

Classification With RF … … … Class Label … Number of trees 24

Outline • Action Classification & Intuition • Our Method • Our Results • Conclusion

Outline • Action Classification & Intuition • Our Method • Our Results • Conclusion 25

Results on VOC 2011 Actions Action Others’ Best Our Method Jumping 71. 6 66.

Results on VOC 2011 Actions Action Others’ Best Our Method Jumping 71. 6 66. 0 Phoning 50. 7 41. 0 Playing instrument 77. 5 60. 0 Reading 37. 8 41. 5 Riding bike 88. 8 90. 0 Riding horse 90. 2 92. 1 Running 87. 9 86. 6 Taking photo 25. 7 28. 8 Using computer 58. 9 62. 0 Walking 59. 5 65. 9 Our method ranks the first in six out of ten classes. 26

Results on VOC 2011 Actions Action Others’ Best Our Method Jumping 71. 6 66.

Results on VOC 2011 Actions Action Others’ Best Our Method Jumping 71. 6 66. 0 Phoning 50. 7 41. 0 Playing instrument 77. 5 60. 0 Reading 37. 8 41. 5 Riding bike 88. 8 90. 0 Riding horse 90. 2 92. 1 Running 87. 9 86. 6 Taking photo 25. 7 28. 8 Using computer 58. 9 62. 0 Walking 59. 5 65. 9 27

Results on VOC 2011 Actions Action Others’ Best Our Method Jumping 71. 6 66.

Results on VOC 2011 Actions Action Others’ Best Our Method Jumping 71. 6 66. 0 Phoning 50. 7 41. 0 Playing instrument 77. 5 60. 0 Reading 37. 8 41. 5 Riding bike 88. 8 90. 0 Riding horse 90. 2 92. 1 Running 87. 9 86. 6 Taking photo 25. 7 28. 8 Using computer 58. 9 62. 0 Walking 59. 5 65. 9 28

Generalization Ability of RF • Dense feature space Tree correlation decreases • Discriminative classifiers

Generalization Ability of RF • Dense feature space Tree correlation decreases • Discriminative classifiers Tree strength increases Better generalization dense feature Vs. (spatial pyramid) SPM feature Train discriminative SVM classifiers Generate feature weights randomly strong classifier Vs. weak classifier (Results on PASCAL VOC 2010) 29

Outline • Action Classification & Intuition • Our Method • Our Results • Conclusion

Outline • Action Classification & Intuition • Our Method • Our Results • Conclusion 30

Conclusion • Exploring dense image features can benefit action classification; • Combining randomization and

Conclusion • Exploring dense image features can benefit action classification; • Combining randomization and discrimination is an effective way to explore the dense image representation; • Achieves very good performance based on only one type of image descriptor; • Code will be available soon. 31

Acknowledgement … … Thanks to Su 0 Hao, Olga Russakovsky, and Carsten Rother. 1

Acknowledgement … … Thanks to Su 0 Hao, Olga Russakovsky, and Carsten Rother. 1 Train a 4 1 Reference: binary 2 1 SVM 5 0 Khosla, Bangpeng Yao, Aditya and Li Fei-Fei. “Combining 3 1 and Discrimination for Fine-Grained Image Randomization Categorization. ” CVPR 2011. 32