Challenges to Computer Vision Larry Davis Computer Science

  • Slides: 18
Download presentation
Challenges to Computer Vision Larry Davis Computer Science Department Institute for Advanced Computer Studies

Challenges to Computer Vision Larry Davis Computer Science Department Institute for Advanced Computer Studies University of Maryland College Park, MD 20742

Obstacles to human-level machine vision Segmentation – Finding what to see n Object representation

Obstacles to human-level machine vision Segmentation – Finding what to see n Object representation – What things look like n Visual learning – Of the What and How n Interface to cognition – Reasoning about what is seen n

Segmentation n Identification of “significant” groups of pixels – edge and local feature detection

Segmentation n Identification of “significant” groups of pixels – edge and local feature detection – edge and local feature grouping – region analysis texture n color n – motion analysis From Adelson, MIT

Segmentation of moving objects against “fixed” backgrounds

Segmentation of moving objects against “fixed” backgrounds

Space shuttle example

Space shuttle example

Segmentation challenges n Integration – Scales in space and time – Cues n Attention

Segmentation challenges n Integration – Scales in space and time – Cues n Attention

Representing 3 -D objects n Object versus viewer centered representations – shape + texture

Representing 3 -D objects n Object versus viewer centered representations – shape + texture

Appearance based matching example (Columbia, msoft)

Appearance based matching example (Columbia, msoft)

Outdoor example – traffic signs

Outdoor example – traffic signs

Pedestrian detection results

Pedestrian detection results

Representing 3 -D objects n Articulated and deformable objects

Representing 3 -D objects n Articulated and deformable objects

Object-centered deformable objects n Metaxas and Terzopoulos

Object-centered deformable objects n Metaxas and Terzopoulos

Representing 3 -D objects n Object versus viewer centered representations – shape + texture

Representing 3 -D objects n Object versus viewer centered representations – shape + texture Articulated and deformable objects n General object classes n

Representing 3 -D objects n Object versus viewer centered representations – shape + texture

Representing 3 -D objects n Object versus viewer centered representations – shape + texture Articulated and deformable objects n General object classes n Form and function n

Visual Learning n Learning representations – recovery – coding n Learning control information –

Visual Learning n Learning representations – recovery – coding n Learning control information – process parameters – rules of engagement

Recovery of 3 -D models Stereo n Structure from Motion n Prior shape models

Recovery of 3 -D models Stereo n Structure from Motion n Prior shape models as constraints on recovery n

Multi-view structure recovery (Torr)

Multi-view structure recovery (Torr)

Interface to cognition n Knowledge based vision systems

Interface to cognition n Knowledge based vision systems