The perception of Shading and Reflectance E H

The “Intrinsic Image” the underlying physical properties of a scene. ¡ Looking at a

What makes an image? ¡ A combination of three factors: l l l Lighting

Lighting ¡ Variables: l l Number of light sources Intensity Position Distribution (Spot-light or

Reflectance ¡ How a surface’s material changes the light: l l Color Absorbance Transparency

Shading ¡ A change to the angle of incidence of light based on the

a simple formulation of an image in terms of reflectance and shading ¡ I(x,

The bad news ¡ Any 2 D image can be described by infinitely many

The good news Humans are easily able to reason about which intrinsic 3 D

How do we find the best intrinsic image? ¡ ¡ A perception should correspond

The “workshop” metaphor ¡ ¡ A generative model for shading, reflectance, and lighting. We

The painter Can paint polygons with certain colors. ¡ Works on the reflectance component

The metal-worker Can cut out new pieces of metal ¡ Can bend pieces of

The Lighting Designer Can position lights to illuminate a scene. ¡ Can chose between

What does this give us? ¡ ¡ A fairly complete generative model to create

The pricelist ¡ ¡ ¡ Painter Fees: l Paint rectangular patch: l Paint general

Each worker can create an entire image with a minimum of help from the

We need a supervisor ¡ His role: l ¡ Coordinate three workers to find

The supervisor’s solution: ¡ Supervisor's solution: l l l ¡ Cut 1 rectangle Paint

Tweaking the price-list: Discouraging naïve solutions Make naïve solutions expensive. ¡ We don’t want

Is there an optimal pricelist? Price-list values can be determined experimentally and tweaked in

The main problem with this workshop theory The search space for cooperative solutions of

Their solution Instead of a simultaneous cooperative model, we use a simplified, multi-stage generative

Stage 1: The Shape Specialist ¡ Assumptions: l l ¡ image was made by

Shape Specialist Contd. Simple solutions are enforced by assigning higher costs to non-right angles.

Stage 2: Lighting Specialist ¡ ¡ Given the shape from the previous specialist, find

Stage 3: Reflectance specialist ¡ Given the shape and lighting from the previous two

The problem with this approach ¡ Real world scenes don’t look like this:

The problem with this approach ¡ Instead, they look more like this:

Some Other Shortcomings ¡ ¡ ¡ Tuning the cost-factors is done manually. There will

Slides: 31

Download presentation

The perception of Shading and Reflectance E. H. Adelson, A. P. Pentland Presenter: Stefan Zickler

The “Intrinsic Image” the underlying physical properties of a scene. ¡ Looking at a 2 D image, what does its 3 -dimensional source model look like? ¡

What makes an image? ¡ A combination of three factors: l l l Lighting Shading Reflectance

Lighting ¡ Variables: l l Number of light sources Intensity Position Distribution (Spot-light or Global)

Reflectance ¡ How a surface’s material changes the light: l l Color Absorbance Transparency Etc…

Shading ¡ A change to the angle of incidence of light based on the surface normal.

a simple formulation of an image in terms of reflectance and shading ¡ I(x, y) = r(x, y) s(x, y) l l r(x, y) is the reflectance image s(x, y) is the shading image / luminance image ¡ where s(x, y) = λ N(x, y)·L l l l N(x, y) is the surface normal L is the illumination direction λ is the “luminous flux”, meaning intensity of light.

The bad news ¡ Any 2 D image can be described by infinitely many 3 D models of shading and reflectance (the most simple being a flat 2 D screen, colored with the image).

The good news Humans are easily able to reason about which intrinsic 3 D model is likely to be the correct one. ¡ Therefore, a computer should be able do the same… ¡

How do we find the best intrinsic image? ¡ ¡ A perception should correspond to the simplest or likeliest explanation. One way to define simplicity is by introducing a cost-function.

The “workshop” metaphor ¡ ¡ A generative model for shading, reflectance, and lighting. We have three workers: l l l Painter Sheet Metal Worker Lighting Designer

The painter Can paint polygons with certain colors. ¡ Works on the reflectance component of our image. ¡

The metal-worker Can cut out new pieces of metal ¡ Can bend pieces of metal ¡ This is the shading component of our image. ¡

The Lighting Designer Can position lights to illuminate a scene. ¡ Can chose between flood lights and spot lights. ¡

What does this give us? ¡ ¡ A fairly complete generative model to create any arbitrary 3 D scene How do we enforce simplistic solutions? l Through a cost-function.

The pricelist ¡ ¡ ¡ Painter Fees: l Paint rectangular patch: l Paint general polygon: Sheet Metal Worker Fees: l Right angle cuts l Odd angle cuts l Right angle bends l Odd angle bends Lighting Designer Fees: l Flood light l Custom spot light $5 each $2 $5 each $5 each $30 each

Each worker can create an entire image with a minimum of help from the other workers. ¡ Painter’s solution: l Paint 9 polygons: l Setup 1 flood light l Cut 1 rectangle l Total $180 $5 $8 $193 ¡ Sheet metal worker's solution: l Cut 24 odd angles $120 l Bend 6 odd angles $30 l Set up 1 flood light $5 l Total $155 ¡ Lighting Designer's solution: l Cut 1 Rectangle $8 l Set up 9 spot lights $270 l Total $278

We need a supervisor ¡ His role: l ¡ Coordinate three workers to find a cooperative solution with the minimum overall cost. In more scientific terms: l To perform a search through the entire solution space and find the point of minimum overall cost.

The supervisor’s solution: ¡ Supervisor's solution: l l l ¡ Cut 1 rectangle Paint 3 rectangles Bend 2 right angles Supervisor's fee Total $8 $5 $4 $30 $47 Compare to: l l l Painter’s solution: Metal Worker’s solution: Lighting Worker’s solution: $193 $155 $278

Tweaking the price-list: Discouraging naïve solutions Make naïve solutions expensive. ¡ We don’t want our algorithm to simply create a painted 2 D screen. ¡ On the other hand we don’t want to make things like paint too expensive so that they never get used. ¡ Cooperative solutions should be cheaper than single workers ¡

Is there an optimal pricelist? Price-list values can be determined experimentally and tweaked in a way that they deliver the most likely solution for most images. ¡ However, there is no universal price list that correctly describes all possible images. ¡

The main problem with this workshop theory The search space for cooperative solutions of our workers is enormous, as there are infinitely many ways of combining their skills ¡ Even for small scenes, there exists no efficient search algorithm to solve this problem in a simultaneous fashion. ¡

Their solution Instead of a simultaneous cooperative model, we use a simplified, multi-stage generative model. ¡ Where have we seen this before? ¡

Stage 1: The Shape Specialist ¡ Assumptions: l l ¡ image was made by orthographic projection. We are given the observed x, y coordinates of all edges and vertices in the image. Operations: l We can move vertices among the z axis

Shape Specialist Contd. Simple solutions are enforced by assigning higher costs to non-right angles. ¡ Compactness (shorter edges) and planarity (less angle-variance) are rewarded. ¡ This cost-metric works for most figures, but not all of them. ¡

Stage 2: Lighting Specialist ¡ ¡ Given the shape from the previous specialist, find the lighting direction that best explains the observed luminance variation in terms of shading. This can be estimated linearly by solving for the light direction L of two connected surfaces: I 1 = r 1 λ N 1·L I 2 = r 2 λ N 2·L Where r(x, y) is an estimated average, and λ=1

Stage 3: Reflectance specialist ¡ Given the shape and lighting from the previous two specialists, explain any left-over differences by painting the surfaces.

An example:

The problem with this approach ¡ Real world scenes don’t look like this:

The problem with this approach ¡ Instead, they look more like this:

Some Other Shortcomings ¡ ¡ ¡ Tuning the cost-factors is done manually. There will never be a single set of parameters that will correctly describe all scenes. A psychologist’s approach to computer science: not much information on how far this approach can scale up to more complex scenes, not much work on coming up with a better search algorithm or parameter learning. How well this approach works on random, real-world scenes is questionable.