Due: Tuesday, February 3, 2004, Midnight
Credit (U-grad): Approximately 5 points (Relative, and very rough absolute weighting)
Credit (Grad): Approximately 5 points (Relative, and very rough absolute weighting)
This assignment should be done individually
For those using the first assignment to judge the workload in the course: This assignment is not meant to be particularly difficult. Future assignments will be harder, and the project work will be yet more challenging.
(Much debt is owed to David Martin for parts of this assignment)
The purpose of this assignment is to become familiar with Matlab. Matlab is useful for exploring ideas and prototyping programs. It is a popular programming environment for computer vision, especially for those who do not have lots of experience in C/C++ because it allows one to focus more on the problem and less on the code. It has significant drawbacks for writing large programs or when performance is critical. It is also a "product" which needs to be purchased and installed before it can be used. Nonetheless, it is a useful tool which computer vision students should be exposed to. Some future assignments may either require Matlab, or be optionally done in Matlab. Matlab may also be a good tool for certain projects, and some projects may require recoding programs that are available in Matlab into C.
Matlab is installed on the UA CS linux machines (the path is /usr/local/bin). Normally, to begin, you just type "Matlab" at the shell prompt. For example, you may want to use the machines gr01 through gr10 in BSE, either on-site, or remotely. If your DISPLAY environment is properly set, the default behavior is the GUI interface. Otherwise, you will get the the command line interface. This GUI is required for much of this assignment. You may find that the GUI is slow to start up, and painfully slow to use over a modem.
Become familiar with Matlab.
Read this short Matlab
tutorial and be aware of this longer Matlab
primer. The
complete
documentation for Matlab is also available on the web.
All Matlab commands are well documented with
Matlab's help system. For example, if you want to know how the
svd
function works, type help
svd
. You can also get help on all the built in operators
and even the language itself. Typing help
gives a list of help topics.
Place a color JPEG (or TIFF) image in your working directory and load it
into a variable using imread
. Type
help imread
to find out how to do
this. Create a new figure using figure(1)
and display the image in it using
imshow
. Some images will be made available
via the
class web page
for those who don't have a handy image.
Tip: You may want to turn on paging (more on
) for reading help pages.
Tip: Make sure you put a semicolon at the end of your
imread
command. The semicolon at the
end of a line prevents Matlab from displaying the result of an
expression. Most of the time, you want the semicolon.
What data structure is used to represent the image? Type whos
to get a list of active variables
along with their types and sizes. You can see that your image (if it
is an RGB image) is a 3D array of bytes.
What is the range of values contained in the image array? Use the
min
and max
functions to do this.
Hint: These commands by default operate on only one dimension of their argument. Convert your image into a 1D array (a vector) using the(:)
notation when you pass it tomin
andmax
.
Convert the image to grayscale using rgb2gray
. What now is the representation
of the image? Use size
or whos
to find this out. You will see that
we now have a 2D array, or a matrix.
Convert the grayscale image into a double-precision type, and scale the values so that they lie in the range [0,1].
Hint: Use the double
function to do the type conversion, and then divide by the maximum
allowable value for a byte, which is 255.
Create a new figure using figure(2)
, and use imagesc
to display the grayscale image in
it. Why does it look so strange?! Type colorbar
to find out. You can see that
0 maps to blue, 0.5 to green, and 1 to red. This is the default
jet
colormap that is useful for data visualization. For this example, we
might prefer the grayscale colormap. Type colormap(gray)
to do this.
Tip: Type help
gray
to see what default colormaps are available.
Note that you can also make your own colormaps.
Tip: Theimagesc
command takes a second argument that lets you specify the range of values. By default,imagesc
scales the data to use the full colormap, but this is not always what you want. In this example, we could add the argument[0 1]
to theimagesc
command so that the gray values are displayed faithfully.
The image is probably distorted, i.e. the pixels aren't square.
Use the axis
command to fix this.
Tip: When you display matrices as images, you usually want the axes to be scaled so that the pixels are square and the image not distorted. Seehelp axis
to determine how to do this. Theaxis image
command is a useful one. You can also turn the display of the axes on and off with theaxis
command.
The interactivity of Matlab is great for debugging and
experimenting, but usually one wants to type code into a file. Create
a file hw1.m
and put the commands in
it to read and display your image in
figure(1)
, convert the image to
grayscale, convert it to double-precision scaled to
[0,1], and then display the grayscale image
in
figure(2)
with the colorbar, grayscale colormap, and square pixels.
Now type hw1
at the Matlab prompt to
execute the commands in the file. A file of this sort is known as a
script.
Tip: Matlab haspwd
,ls
, andcd
commands like the shell.
Tip: If your script is in some other directory thanpwd
, then you can add that other directory to Matlab's search path with theaddpath
command. The current directory is in Matlab's search path by default.
In this section, we will work with the converted image that you
should have displayed in figure(2)
.
Tip: Array indices in Matlab start at 1, not at 0.
Tip: As in standard mathematical notation, the first index of a matrix is the row, and the second index the column. When viewing a matrix as an image, this means that the first index is the y direction, and the second the x direction. As is usual with image, the origin is at the top left corner, the positive x axis points right, and the positive y axis points down.
Get the width and height of the image using size
.
The size
function, as is common in Matlab, can
return multiple arguments.
Hint: To store both the width and height into variables in one go, try[h,w]=size(im)
. You could alternately doh=size(im,1)
andw=size(im,2)
.
Write a couple of nested for
loops to set every 5th pixel, horizonatally and vertically, to 1.
This should set 1/25th of the pixels to
white in a square lattice pattern.
Hint: Use the colon operator to define the limits of thefor
loops. Seehelp colon
andhelp for
to see how to do this. Specifically, you want theminval:interval:maxval
form.
Set all the same pixels to 0, making the ones that you just set to white become black. Do it this time without using any for loops.
Hint: You can index arrays in Matlab with vectors as well as with scalars, soim(Y,X)=0
will set multiple entries ofim
to zero when eitherX
orY
are vectors, such as the vectors returned by the colon operator.
Set all the pixels whose values are greater than 0.9
to zero.
The command to do this is
im(find(im>0.9))=0
. It would be good
for you to understand why this works, but if this is just too
confusing, move on for now.
Hint: The(im>0.9)
expression evaluates to a boolean matrix, which istrue
where the condition holds. Thefind
function returns a vector containing the indices of the true values. This vector is then used to index the image and set all the values to zero. Why does this last step work when the matrix is 2D and the indexing is 1D? Matlab lets you treat matrices as 1D vectors too, linearizing the matrix in column-major order.
Tip: Matlab makes it easy to write "vectorized" expressions without having to write for loops or if statements. For example, this will add all the values of the image:The following will count the number of values greater than 0.9:sum(im(:))
So will this:numel(find(im>0.9))
This will halve only those values greater than 0.9 (note the use of thesum(sum(im>0.9))
.*
operator to do element-size multiplication of matrices):And so will this:im = im - 0.5*im.*(im>0.9);
This will set 100 unique random pixels to zero:mask = (im>0.9);
im = im.*~mask + im.*mask*0.5;Seep = randperm(numel(im));
im(p(1:100)) = 0;help elmat
for a list of interesting matrix manipulation and creation routines.
Extract a rectangular region of the image into another variable.
Use notation like im2 =
im(colmin:colmax,rowmin:rowmax)
to do this. Use the getrect
function to visually select the
rectangle you want from figure(2)
.
Create a new figure with figure(3)
,
and then display the extracted region in that figure using imagesc
.
Fix the axes so that the pixels
are square.
Tip:getrect
returns floating-point values that you must convert to integers before using them as indices. Use theround
function for this.
Tip: In addition togetrect
there are alsogetline
andgetpts
functions. Also, theginput
function is an extremely useful function to know. It allows you to retrieve mouse clicks and key presses from a figure window.
Write the image to a file called out.jpg
using the
imwrite
function. Use some independent image viewer like
display, xv or a web browser to verify that this worked. I
recomend learning about the ImageMagick suite of tools for converting, and
displaying images (do "man convert", and a "man display" to find out more).
Also the program import can be used to get a screen-shot in linux.
Go back to your hw1.m
file and
add the code to set every 5th horizontal/vertical pixel to 0, extract
a user-selected image region using getrect
from
figure(2)
, display the new image in
figure(3)
,
and write the image to an image file.
You can define a new Matlab function by simply putting code in a
file (as we have just done) and placing a function declaration at the
top. Write the following as the first line of your hw1.m
file:
function [imregion] = hw1(infile,outfile)
Tip: The name of the file and the name of the function should always match. A file can export only one function, but the file may contain internal functions.
Modify your code so that the imread
and imwrite
commands use the infile
and outfile
variables instead of hard-coded
filenames. Also make sure that the extracted image region is stored
in a variable called imregion
so that
it gets returned by the function. If the hw1.m
file is in your pwd
or in your Matlab path, then you can
execute this function by typing hw1('mug.jpg','mugcrop.jpg')
, or cropped=hw1('mug.jpg','mugcrop.jpg')
if you
want the cropped image returned in a variable.
Tip: If you want to return more than one variable from a function, add it to the list in square brackets in the function's declaration. Simply setting these variables inside the function will cause them to be returned.
Tip: If you want a Matlab function to modify a variable, then you have to duplicate it in the input and output lists. Matlab behaves like a functional language in this regard.
Tip: Functions can also have optional input and output arguments. To determine the actual number of input and output arguments a function was called with, look at the special variablesnargin
andnargout
.
Explore the plot
command. Plot the sin
function
over the domain -pi:pi
. Use the linspace
command to
define the domain x
and then do plot(x,sin(x))
. Use
the hold on
command to plot another function on the same graph,
such as cos
. Use a different color, e.g.
plot(x,cos(x),'r')
. The runing of hw1.m should produce a plot along these lines.
Use the hist
command to plot a
histogram of the pixel values in a grayscale image. Use the
h=hist(y,x)
form so that you can control the bin centers and so
that you can plot the result using plot(x,h)
. You may want to plot
the log
of the bin counts.
Your program should now (three times consequetively) prompt the user to select rectangular regions from the first image (figure 1) for which you will plot a histogram. It will be instructive to compare the histograms of different parts of the image, and thus previous plots should stay visible while additional ones are added.
Your program should now prompt the user to select a point from the first image (figure 1) for which you will plot a horizontal trace of the image through that point. Compare with lecture slide the "apricot" image in the lecture notes (the "apricot" image is available here ).
It is instructive to think about histograms and traces of different parts of the image and different images. I recommend running your program on several images. However, for grading purposes, only the functionality described need to be implemented. In particular, there is no need to formally report the results of your experimentation.
When you create a new function, it should always be documented so
that help
returns something
informative. The convention in Matlab is to place the help message in
comments after the function declaration (the comment character is
"%"). For an example, you can look at some of the code in the Matlab
library. The jet
command that we
used above to set the colormap exists as some .m
file on the system.
Type which jet
to locate it, and then use the
type
command to view the source code.
You see that the first line of the comment contains a one-line
description of the command. All subsequent contiguous comment lines
are included in the help message. Document your hw1.m
function in this style.
Read this paper . Write a short description of how you would go about providing a tool in Matlab to analyze images like the chandelier one for perspective validity and output either "yes" or "no". Note that the points on the chandelier identified by circles in Figure 2 in the paper were supplied by a human user.
Only a sketch of a solution is required, so there is no need to spend large amounts of time on this part. The main point is to read the paper and get enough understanding of Figure 2 so that you feel that you "could" do this if asked (and to communicate this to the TA to some extent).
Two additional questions:
Specification summary: The TA will change directory to where your assignment is, and then enter "help hw1". He will then invoke your program with the command code>hw1(infile,outfile). He should see a black and white image, a mucked version of it, and be prompted to select the rectangle. Then that rectangle should appear as a third figure. It will also be written to the outfile. Now sin/cosine plot should appear, and then he should see a prompt for a selection of a rectangle from figure 1. Then a histgram plot should appear. This should happen twice more. Finally, a prompt to select a point from figure 1 should appear. This should produce a graph representing a horizontal trace.
If you want to alert the TA to anything regarding your assignment, provide a README file. The TA will do a "help hw1", so provided features can be explained there as well.
Those doing the written part of the assignment should hand in either an ascii text file or a PDF file.
Instead of a written outline for question 9 of version one, provide an implementation. Your program should instruct the user to click on point pairs defining lines which are presumed to be parallel in the real world. These should be drawn over the image. Your program should then compute appropriate quantities declare the image to be either valid perspective or not. In order to accomplish this, you will have to handle errors in the immage locataion of the points in a reasonable way (essentially addressining the two questions raised in question 9 above). Be sure to document how you did this. Your program should support a help comand as described in version one, question 8.
If you would like to try your program on the chandlier image, click here. If you are interested in the image that it was cut out of, click here.