Slides2 - Tamara L Berg

Why

Categorize in Computer

Vision?

Why Use Categories?

People love categories!

Why Use Categories?

What if we didn’t have categories?

Humuhumunukunukuapua'a –

“fish that grunts like a pig”

Why Use Categories?

Our minds work very intimately with categories

– Every common noun in English is a category

– Proper nouns name object instances

– “this,” “that,” “the,” “my,” “yours,” etc. refer to object instances anonymously

The Categorization Problem


Categorization/Classification:

Given a set of pre-defined categories, “bin” this image

Does not necessarily require object detection

Vertical Dimension:

1.

General: “Animal”

2.

Basic: “Bird”

3.

Specific: “Robin”


What kinds of categorization are computers good at?

• Basic -- especially when using context clues

• Specific -- due to low intra-class variation


Bad at?

• General, due to high intra-class variation and a lack of visual cues


Bad at?

• Categories defined by non-visual characteristics

(like chairs)

Summary

• Semantic categories allow humans to convey a large amount of information concisely

• We want computers to be able to do the same

• What work has been done on this problem?

Has it been successful?

Uses of

Categorizati on

Two Examples

1. Using Context in Categorization

2. Fine-Grain Object Classification

Caltech 101 (2003)

• Dataset for basic-level categorization

• Objects from 101 classes

• Famously difficult

Categorization with Context

Goal: Resolve ambiguity between similarlooking objects of different classes using the

semantic context of an object

Rabinovich et al. (UC San Diego):

Objects in Context

First paper to attempt to use context at the object level

PASCAL 2007 dataset



Approach

1. Segment image to preserve some spatial data

2. Perform Bag-of-Features to give an initial ranked list of labels for each segment

3. Use a Conditional Random Field (CRF) framework to find agreement between segment labels


Bag-of-Features with Segmentation

Labeling Segments:

Confidence:

Conditional Random Field

Way to assign joint probabilities to elements without considering every possible combination in the training set


Idea

• Given set of segments S, set of labels C

• Want to find p(C | S) without knowing p(S)

• Associate a special graph with C that obeys the “Markov Property” (uses S)

• The ordered pair (S, C) is a CRF conditioned on

S


Results

Results

False correction

Fine-Grain Classification

Fine-Grain Image Categorization

Challenge: need good classifiers that capture detail well


Yao et al. (Stanford): Combining Randomization and

Discrimination for Fine-Grained Image Categorization

Approach

Random forest with discriminative classifiers

This is a kind of machine learning framework that allows us to handle the fine detail in this problem.


Random Discriminative Tree

Approach

• For each tree node, train an SVM classifier for a randomly sampled image region

• At each node, make a yes-or-no decision

• Uses grayscale SIFT descriptors

Random Discriminative Tree

Results

Conclusion

• Semantic categories allow humans to convey a large amount of information concisely

• Categorization has been used for basic-level object detection and scene recognition

• Fine-grain categorization can provide us with expert-level classification of objects

• Not all categories are defined by visual characteristics!

Questions?

Slides2 - Tamara L Berg

Why

Categorize in Computer

Vision?

Uses of

Categorizati on

Two Examples

Related documents

Products

Support

Slides2 - Tamara L Berg

Why

Categorize in Computer

Vision?

Uses of

Categorizati on

Two Examples

Related documents

Add this document to collection(s)

Add this document to saved

Suggest us how to improve StudyLib