Computer Vision (comp256)

Computer Vision (following Tomaso Poggio, MIT): Computer Vision, formerly an almost esoteric corner of research and regarded as a field of research still in its infancy, has emerged to a key discipline in computer science. Vision companies have emerged and commercial applications become available, ranging from industrial inspection and measurements to security database search, surveillance, multimedia and computer interfaces. Computer Vision is still far from being a solved problem, and most exciting developments, discoveries and applications still lie ahead of us. Understanding the principles of vision has implications far beyond engineering, since visual perception is one of the key modules of human intelligence.

Goal and Objectives

To introduce the fundamental problems of computer vision.

To introduce the main concepts and techniques used to solve those.

To enable participants to implement solutions for reasonably complex problems.

To enable the reader to make sense of the literature of computer vision.

Who should attend this course?
Graduate students that are interested to learn about the fundamental concepts of computer vision or desire to use compute vision techniques in their research.
Prerequisites: Most of the knowledge required should be part of the normal background in Computer Science and undergraduate/graduate Mathematics and Geometry. Comp235 or equivalent is required.

Textbook
The textbook for this course is "Computer Vision: a modern approach" by David Forsyth and Jean Ponce (should be available at Student Stores). This is one of the most recent books on computer vision, authored by two very well respected researchers in the field. This book has become one of the most popular textbooks for computer vision.

Topics

Introduction

Shape from Stereo, Motion, Shading, ...

Segmentation

Tracking

Recognition

Learning approach

Students should preferably read the relevant chapters of the books and/or reading assignements before the class.

In the course the material will then be covered in detail and motivated with real world examples and applications.

There will be assignements to provide students with practical experience of some computer vision techniques.

There will also be a final project where students will solve a real world problem using computer vision techniques.

Grade distribution

Assignments: 40%

Class participation: 10%

Project proposal: 10%

Final project: 40%

Some usefull links

The Computer Vision Homepage

CVonline

Middlebury Stereo Vision Page

Slides

Class 1 introduction

Class 2 cameras, lenses and sensors

Class 3 Radiometry

Class 4 Sources, shading and photometric stereo

Class 5 Color

Class 6 Linear filters & edges

Class 7 Pyramids and textures

Class 8 Multiple View Geometry

Class 9 Stereo

Class 10 Project Proposal Presentation

Class 11 Tracking

Class 12 Optical flow

Class 13 Silhouette and space carving approaches

Class 14 Structure from motion (affine)

Class 15 Structure from motion (projective)

Class 16 Calibration (and self-calibration)

Class 17 Segmentation (foreground/background, K-means, graph-based)

Class 18 Fitting (Hough, RANSAC, ... )

Class 19 Probablistic segmentation (EM and model selection)

Class 20 Template matching and object recognition (Classifiers, Neural Nets, SVM,...)

Class 21 Recognition by relations

Class 22 Range (Structured light, Registration, Merging, Recognition, ...)

Assignments

Assignment 1 Photometric stereo (additional files)

Assignment 2 Stereo (data files)

Assignment 3 Line fitting (datagen.m, linefit.m)

Class project

Template for proposal

Marc Pollefeys, October 4, 2003.