Projects should contribute to research in some way. Possibilities include engaging in a potentially publishable research direction, working on research infrastructure, or building a tool for some research project. Projects may be done individually or in groups. The scope of the project should be a function of the experience of the team, but all teams should feel challenged. Project directions need to be approved by Kobus, and my incur modifications as a consequence of proposal review by ones peers and Kobus.
All project will require substantive background reading. Likely you will want to integrate what you are reading for your project with what you are presenting to others as class lectures, but this is not required.
In creating a project, you should consider ways in which you can simplify the problem, at least for intermediate goals. Working with real raw data is often difficult. Perhaps you can start with synthetic data? Perhaps you can use image data that is manually marked, instead of extracting features.
Please treat these documents as confidential!
Link to summaries for two research research proposals: CDI and SLIC
Link to descriptions for two research research proposals: CDI and SLIC
The following projects are real research projects, and all of them are challenging. The goal is to make a real contribution to ongoing research, and/or open up opportunities for further research after the course ends. However, it is not necessary to choose a project in the domains below. Conversely If more project ideas are needed, they will be forthcoming. Finally, if you need advice regarding creating a project in a particular domain, please contact Kobus.
Most of these projects are "encumbered" in the sense that they are both course
projects and collaborations, and thus are subject to associated constraints
which do not necessarily apply to standard course projects. In particular, your
contribution is implicitly shared with your collaborators, and you respect that
your collaborators part ownership of the intellectual property and artifacts
(code, data, and papers). Of course, you can assume the reciprocal relationship
with your collaborators.
For many applications, it is important to be able to automatically de-identify people in images, while still maintaining the overall gist of the scene. This is a need of Google street view, and Nirav Merchant has a real live application that also needs such a capability. There has been some published work in this area, and the first step is an exhaustive literature survey. However, a basic system should not be too hard to implement. One can imagine using the OpenCV to find faces relatively reliably, then modifying the image so that faces are hard to recognize, but humans do not strongly object to the resulting image. Even if a new method for doing this does not emerge, the project will contribute to the vision lab's collaboration with Nirav. In this project it is required that the implementation is available to the vision lab and Nirav, and that the implementation is modular and the modules themselves are integrated into the vision lab software system.
This project requires collaboration with several people and working within collaboratively developed code bases.
This project requires collaboration with several people and working within existing code bases.
The current state of words and pictures suggests several directions. First, we have work underway to test a number of algorithms, particularly on the region labeling task (most researchers focus on the image labeling task). A possible project is to contribute to this task by implementing some recent algorithms and running experiments.
Another direction is to develop an algorithm along the lines of our CVPR 2008 paper. This will likely require some contribution to the preceding idea as it would be necessary to compare new algorithms with recent approaches, especially if the new algorithm shares some ideas with them.
A third direction is to contribute to the funded pipeline framework for image annotation. Areas where help are needed include image feature extraction, word feature extraction, and integrating object identification (e.g., face identification).
Other projects are possible.
This project requires collaboration with several people and working within the code base which is in C.