COMP 776 Project Ideas

Project proposal due: February 28

The goal of the class project is to produce a rudimentary working vision system. The simplest way to accomplish this is to pick one paper from the list below, to distill the essential ideas of the approach presented in that paper, and to create a basic implementation of those ideas. I also encourage you to define your own problem or to combine ideas from several papers, but please try to keep the scope of your project as simple and well-defined as possible! As part of your project, you may use any code or executables that are freely available on the web, as long as your project goes beyond that code and implements something sufficiently significant on top of it or modifies it in some interesting way. Your first task is to write a project proposal. This proposal should be brief (around one page) and contain the following things:
  1. Problem definition: Identify the paper(s) you plan to follow or the problem you plan to solve, and give a basic outline of what you propose to implement.
  2. Resources: Specify what data you plan to use or whether you plan to acquire your own (e.g., for panorama stitching). Also specify whether you plan to use any outside code and how you plan to build on it.
  3. Reservations: Try to anticipate which part of the implementation or testing may prove the most difficult. Possible stumbling blocks shouldn't necessarily prevent you from attempting a more ambitious project, but you should talk to me early on to make sure that you can still define the project in a satisfactory manner.
Please contact me in advance if you want to define your own project instead of following one of the options below!

Stereo

Resources: Middlebury stereo page

Geometry

  • Panorama stitching: Matthew Brown and David G. Lowe, Recognising panoramas, International Conference on Computer Vision (ICCV 2003), Nice, France (October 2003), pp. 1218-25. Project page
    Also see project description from a CMU class on Computational Photography: Part 1, Part 2

  • Space carving: K. N. Kutulakos and S. M. Seitz, A Theory of Shape by Space Carving, International Journal of Computer Vision, 2000, 38(3), pp. 199-218.

  • Structure from motion (advanced): Implement a rudimentary structure-from-motion system using techniques described in Chapters 12 and 13 of the Forsyth & Ponce textbook. Another possible reference is:

    M. Pollefeys, L. Van Gool, M. Vergauwen, F. Verbiest, K. Cornelis, J. Tops, R. Koch, Visual modeling with a hand-held camera, International Journal of Computer Vision 59(3), 207-232, 2004.

Resources: 3D Photography dataset, MATLAB functions for multiple view geometry, multi-view datasets from Oxford

Recognition Using Local Features

Resources: ICCV/CVPR short course on recognition (including pointers to data and code)

Misc. Image Classification

Object Detection

Resources: Face detection resources (including data)

Image Segmentation

Video

Resources: action recognition datasets, CVPR 2006 Human Motion tutorial

Other

  • Feel free to propose your own project idea! If you do this, please contact me in advance of submitting your project proposal.