Local Affine Feature Tracking in FilmsSitcoms Chunhui Gu

  • Slides: 19
Download presentation
Local Affine Feature Tracking in Films/Sitcoms Chunhui Gu CS 294 -6 Final Presentation Dec.

Local Affine Feature Tracking in Films/Sitcoms Chunhui Gu CS 294 -6 Final Presentation Dec. 13, 2006

Objective • Automatically detect and track local affine features in film/sitcom frame sequences. –

Objective • Automatically detect and track local affine features in film/sitcom frame sequences. – Current Dataset: Sex and the City – Why sitcom? • Simple daily environment • Few or no special effects • Repeated scenes

Outline • Preprocessing • Tracking Algorithm – Pairwise local matching – Robust features •

Outline • Preprocessing • Tracking Algorithm – Pairwise local matching – Robust features • Feature Matching across Shots • Results – Feature matching vs baseline color histogram – Time complexity – When does tracking fail

Preprocessing (i-1)’th shot Frame Extraction SIFT Feature Extraction i’th shot Shot Detection MSER Interest

Preprocessing (i-1)’th shot Frame Extraction SIFT Feature Extraction i’th shot Shot Detection MSER Interest Point Detection

Tracking Algorithm • Basic: Pairwise Matching Frame i Frame j=i+1

Tracking Algorithm • Basic: Pairwise Matching Frame i Frame j=i+1

Tracking Algorithm • Basic: Pairwise Matching Frame i Frame j=i+1

Tracking Algorithm • Basic: Pairwise Matching Frame i Frame j=i+1

Tracking Algorithm • Basic: Pairwise Matching Frame i Frame j=i+1 Thresholding on both minimum

Tracking Algorithm • Basic: Pairwise Matching Frame i Frame j=i+1 Thresholding on both minimum distance and ratio

Tracking Algorithm • Basic: Pairwise Matching Frame i Frame j=i+1

Tracking Algorithm • Basic: Pairwise Matching Frame i Frame j=i+1

Tracking Algorithm • Basic: Pairwise Matching Frame i Frame j=i+1

Tracking Algorithm • Basic: Pairwise Matching Frame i Frame j=i+1

Tracking Algorithm • Problem of Pairwise Matching – Sensitive to occlusion and feature misdetection

Tracking Algorithm • Problem of Pairwise Matching – Sensitive to occlusion and feature misdetection • Solutions: – Use multiple overlapping windows – Backward Matching • Match features in current frame to features in all previous frames within the shot • Pruning process (reduce computation time) • Select a proportion of features that have longer tracking length as robust features

Shot grouping/Scene Retrieval Shot 49 10746 10747 10772 10933 10934 10968 11393 11394 11435

Shot grouping/Scene Retrieval Shot 49 10746 10747 10772 10933 10934 10968 11393 11394 11435 11533 11534 11560 Shot 53 Scene 5 Shot 56 Shot 60

Inter-Shot Matching Shot I Shot J

Inter-Shot Matching Shot I Shot J

“Confusion Table”

“Confusion Table”

ROC

ROC

When Does Tracking Fail? • Tracking feature outside local window – Rare when continuous

When Does Tracking Fail? • Tracking feature outside local window – Rare when continuous tracking – Happens when occlusion occurs • Same feature splitting to two or more groups – Long occlusion – Multiple matching in a single frame Frame i Frame j=i+1

Computation Complexity • Everything except for MSER and SIFT algorithms are implemented in Matlab

Computation Complexity • Everything except for MSER and SIFT algorithms are implemented in Matlab (slow…) Complexity Time Frame Extraction O(N) ~0. 3 s/frame Shot Detection O(N*f(B)) ~0. 07 s/frame (B=16) MSER Detection O(N) ~0. 3 s/frame SIFT Detection O(N) ~0. 9 s/frame Feature Tracking O(N*F*W*L) ~0. 5 s/frame Matching across shots O(S 2*T 2) ~1 s/shot pair N: # of frames; (30, 000) B: # of bins for color hist (16) F: ave. # of features per frame; (400) W: Local window size; (15) L: tracking length; (20) T: ave. # of robust trackers per shot; (300) S: # of shots; (35)

Conclusion • We successfully implemented local affine feature tracking in sitcom “sex and the

Conclusion • We successfully implemented local affine feature tracking in sitcom “sex and the city”. The tracking method is robust to occlusion and feature misdetection. • Although no quantitative precision/recall curve (hard to find ground truth), the demonstration shows that precision is almost perfect with good recall performance. • We show one successful application of using robust features to associate similar shots together for scene retrieval.

Future Work • Implement algorithm in real-time (C/C++) • Search unique shots in films/sitcoms

Future Work • Implement algorithm in real-time (C/C++) • Search unique shots in films/sitcoms • Separate indoor scenes from outdoor scenes • Determine context of the scene

Acknowledgement

Acknowledgement