G89.2223
Perception
Fall 2009
Monday/Wednesday
4 pm - 5:15
Room 851 Meyer Hall
Last updated: July 27, 2009
9/9-21
Detection and Cue Integration (Landy) Readings: Wandell App 3; Signal Detection Theory handout; Hecht et al. (1942); Geisler (1989).
Supplementary readings: Cornsweet, T. N. (1970). Visual Perception. New York: Academic Press (chs. 2-4); Duda, R. O., Hart, P. E. & Stork, D. G. (2001). Pattern Classification. New York: Wiley (chs. 2-3); Green, D. M. & Swets, J. A. (1966/1974) Signal Detection Theory and Psychophysics. New York: Robert E. Krieger; Macmillan, N. A. & Creelman, C. D. (1991). Detection Theory: A User's Guide. New York: Cambridge; Wickens, T. D. (2002). Elementary Signal Detection Theory. New York: Oxford; Coombs, C. H., Dawes, R. M. & Tversky, A. (1970). Mathematical Psychology, An Elementary Introduction. Englewood Cliffs, NJ: Prentice-Hall (ch. 6).
Additional readings about retinal responses near absolute threshold: Field, Sampath & Rieke (2005); Chichilnisky & Rieke (2005).
Signal detection tutorial (zip archive of matlab code)
Lecture slides (pdf)
9/23-30
Note: no class 9/28
Cue Combination and Statistical Decision Theory (Landy) Readings: Landy, Maloney, Johnston & Young (1995); Ernst & Banks (2002); Knill, Kersten & Yuille (1996); Mamassian, Landy & Maloney (2003), pp. 13-22; Körding et al.
Supplementary readings: Saunders & Knill (2005); Maloney (2002), pp. 145-177; Dean, Wu & Maloney (2007); Battaglia & Schrater (2007)
Books for background reading on Bayesian estimation and decision theory (optional): Leanard & Hsu, Bayesian Methods: An analysis for statisticians and interdisciplinary researchers; Sivia, Data Analysis: A Bayesian Tutorial.
Lecture slides (7 MB pdf)
Lecture notes on depth (from undergrad perception course)
10/5-14 Color: Trichromacy, Color Opponency, & Chromatic Adaptation (Maloney) Readings: Wandell Chs 1, 3, 4, & 9.
Lecture slides:
Color slides part I
Color slides part II
Color slides part IIILecture notes:
Lecture notes on color (from undergrad perception course)
Lecture notes on the retina (from undergrad perception course)Color matching tutorial (100 KB, zip archive of matlab code)
10/19 - 11/4 Spatial Vision, Linear Systems Theory and Auditory Channels (Landy/Poeppel) Readings: Wandell Chs 2, 3, 5, 6, 7, 8, & App 1; Signals, Linear Systems, & Convolution Handout.
Auditory readings: Moore, first 10 pages of Painter & Spanias
Supplementary readings:
- Blakemore, C. & Sutton, P. (1969). Size adaptation: A new aftereffect. Science, 166, 245-247.
- Campbell, F. W. & Gubisch, R. W. (1966).Optical quality of the human eye. Journal of Physiology, 186, 558-578.
- Campbell, F. W. & Robson, J. G. (1968). Application of Fourier analysis to the visibility of gratings. Journal of Physiology, 197, 551-566.
- Graham, N. (1980). Spatial-frequency channels in human vision: Detecting edges without edge detectors. In Harris, C. (Ed.), Visual Coding and Adaptability (pp. 215-252). Hillsdale, NJ: Erlbaum.
- Graham, N. & Nachmias, J. (1971). Detection of grating patterns containing two spatial frequencies: A comparison of single-channel and multiple-channels models. Vision Research, 11, 251-259.
- Watson, A. B. & Robson, J. G. (1981). Discrimination at threshold: Labelled detectors in human vision. Vision Research, 21, 1115-1122.
- Wilson, H. R., McFarlane, D. K. & Phillips, G. C. (1983). Spatial frequency tuning by orientation selective units estimated by oblique masking. Vision Research, 23, 873-882.
- Bracewell, R. N. (2003). Fourier Analysis and Imaging. New York: Kluwer/Plenum.
Lecture slides (pdf)
David Poeppel's lecture slides (pdf)
Matlab tutorials, Note: Add the subdirectory "pyrTools" along with all its subdirectories to get these tutorials to work.
11/9-18 Visual Motion Perception (Heeger) Readings: Wandell Ch 10 & App 5; Adelson & Bergen (1985); Adelson & Movshon (1982); Weiss, Simoncelli, & Adelson (2002).
Supplementary readings: Simoncelli & Heeger (1998); Huk & Heeger (2002); Motion estimation handout.
Lecture slides:
Motion intro lecture slides (2MB pdf)
Functional specialization lecture slides (5.9MB pdf)
Computational theory lecture slides (7.1MB pdf)
Lecture notes:
Lecture notes on motion (from undergrad perception course)
Lecture notes on the visual cortex (from undergrad perception course)Matlab code:
Motion tutorial (160KB zipped archive, requires matlabPyrTools)
MT model (matlab code available for download)11/23 - 12/7
Note: no class 11/25
Attention (Carrasco) Readings: Carrasco (2006); Lu & Dosher (2004); Dosher & Lu (2000); Carrasco & Yeshurun (1998); Wolfe (1998); Palmer (1995); Reynolds, Pasternak, & Desimone (2000).
Lecture slides:
Lecture slides - part 1 (2.2MB powerpoint)
Lecture slides - part 2 (2.7MB powerpoint)
Lecture slides - part 3a (480 KB powerpoint)
Lecture slides - part 3b (12.7MB powerpoint)12/9-14 Recognition (Pelli) Readings: Rosch et al. (1976); Pelli & Tillman (2008) with supplementary material; Treisman & Kanwisher (1998); Ranzato, Huang, Boureeau, & LeCun (2007).
Lecture slides:
Part 1 (2.9 MB pdf)
Part 2 (7.1 MB pdf)
Part 3 (1.9 MB pdf)
Part 4 (9.8 MB pdf)
| Marisa Carrasco, Rm. 971, 8-8328 marisa.carrasco@nyu.edu |
| David J. Heeger, Rm. 963, 8-7868 david.heeger@nyu.edu |
| Michael Landy, Rm. 961, 8-7857 landy@nyu.edu |
| Laurence Maloney, Rm. 278, 8-7851 laurence.maloney@nyu.edu |
| Denis Pelli, Rm. 960, 8-3864 denis.pelli@nyu.edu |
| David Poeppel, Rm. 281, 2-7489 david.poeppel@nyu.edu |
For each assignment, write an essay (or essays for assignments with multiple part question), approximately 5 pages per assignment, with references and optionally with figures. Each essay must include background, summarizing the relevant material from the lectures and readings, in addition to the specific answer to the question. Submit your essay by email to (as a pdf file) to Prof. Landy.
Hecht, Schlaer and Pirenne were interested in determining the minimal visible stimulus under the best of conditions. However, they did not run their task as a signal-detection experiment, as was mentioned in class.
(a) Describe a true signal-detection-task version of the experiment. What would be the different stimuli, presented in what order, and blocked how? Please justify your choices.
(b) Describe precisely how you would estimate d' and beta (for standard signal detection theory using unit-variance Gaussians) for each condition in (a). Be specific about which data are used for each such estimate.
(c) However, the underlying model here is that performance is limited by photon statistics. Outline as precisely as you can a procedure for interpreting the data from (a) assuming that photoisomerizations are Poisson (not Gaussian) distributed. Note that photoisomerizations occasionally occur when no stimulus is delivered (which is given the lovely oxymoronic name of "dark light"). You can assume that the noise in the rods due to these spontaneous photoisomerizations is the only source of biological variability in behavioral performance. Your goal is to estimate the "dark light" and the number of photoisomerizations that are needed for threshold performance in detecting a flash of light. You can assume that the display is perfectly calibrated so that you know the mean number of photons emitted for each trial (you don't know the exact number of photons emitted on a given trial, only the average number emitted at any given light level). Furthermore, you can assume that you know the proportion of photons emitted that lead to photoisomerizations (recalling that some of the photons are reflected or absorbed by the rest of the stuff in the eye).
(d) However, it is not realistic to assume that the display is perfectly calibrated nor that you know the proportion of photons emitted that lead to photoisomerizations. So now, without making these assumptions, how would you estimate the "dark light" and the number of photoisomerizations that are needed for threshold performance in detecting a flash of light. Hint: Assume that the "dark light" corresponds to a mean rate of 1 photoisomerization per trial, that a test flash corresponds to a mean rate of 1 photoisomerization per trial and that twice the test flash gives twice the mean rate. What will the internal response (probability density) curves and the ROC curves look like? Next, assume that the "dark light" corresponds to a mean rate of 1 photoisomerization per trial, and that the test flashes yield mean rates of 10 or 20 photoisomerizations. In what way will the internal response and ROC curves differ? Then assume that the "dark light" corresponds to a different mean rate (10 photoisomerizations). In what way will the internal response and ROC curves differ? Based on this, outline how you would estimate the mean rates of photoisomerizations for the "dark light" and test flashes, from the experimental data. Don't forget that you need to do this in such a way that you can either estimate the criterion or make sure that the criterion doesn't affect your estimates of the mean rates of photoisomerizations for the "dark light" and test flashes.
(e) The above analysis is described effectively as if you did the detection with a single rod receptor that produce all the light-induced and spontaneous photoisomerizations. Suppose instead that the light is distributed equally across 5 receptors (each getting, on average, 1/5 of the photons) and that you know the average dark light for each receptor, but these dark-light values (average number of spontaneous photoisomerizations) are not identical across the 5 receptors. What decision rule would be optimal under these circumstances?
(f) This is for extra credit (i.e., not required). If you can, try to use Matlab to simulate the internal response and ROC curves for various different choices of "dark light" and test flash intensity. Better yet, use Matlab to generate a simulated data set ("yes" and "no" responses for a bunch of trials) and then fit the simulated data to estimate the mean rates of photoisomerizations for the "dark light" and test flashes. Although this part of the assignment is for "extra credit", implementing the simulations will guarantee that your answers are correct, and you might find that simulating it will help get an intuition for it.
As we discussed in class, standard theories of spatial vision suggests that spatial patterns are analyzed in the human visual system with a small number of "channels" varying in the spatial frequency to which they are maximally sensitive and the peak temporal frequency as well. Each "channel" consists of a set of elements with linear receptive fields identical in shape, differing only in spatial position. The models then typically include nonlinearities, noise (to simulate variability in human responses) and a decision mechanism that combines the noisy responses from each location to compute a decision in each simulated experimental trial.
Similarly, in color vision it has been suggested that colors are also encoded by separate "channels": the opponent mechanisms that encode blue-vs.-yellow, red-vs.-green, and black-vs.-white. Again, each of these "channels" begins with a receptive field that linearly combines the responses of cones, with different linear weights for each cone class (S, M, and L).
In two papers, Poirson & Wandell modeled and measured how spatial and chromatic tuning are combined in terms of the effect on color appearance (Poirson & Wandell, 1993) and detection threshold (Poirson & Wandell, 1996).
Please read the 2nd (detection) paper (you might scan the appearance paper for context if you like, but that's not necessary), and summarize what they did and what they found.
(1) Stimuli: Describe clearly and carefully their "chromatic" Gabor stimuli in terms of the responses such a stimulus engenders in each of the three cone classes. Then, describe as best you can how you would determine the RGB values at each pixel required to produce that stimulus.
(2) Model: Their most-constrained model is "pattern-color separable". That is, the corresponding receptive fields are f(x,y,lambda)=g(x,y)h(lambda), where x,y are the spatial position and lambda is wavelength of a light. Give some details of how you would build such separable receptive fields as a sequence of linear receptive fields building on the initial cone representation of the input, each stage linearly combining units at the preceding stage.
(3) Results: Describe the three channels found by Poirson & Wandell in terms of the color and pattern selectivity of each, as well as a description of the spatial receptive field corresponding to each.
(4) Suppose you wanted to verify the results of Poirson & Wandell using a standard experiment for identifying channel tuning properties (i.e., a masking, adaptation or summation experiment, as discussed in class). Describe the experiment you would perform (stimuli, conditions, task) and data analysis (estimation of threshold, etc.) required to establish estimated spatiochromatic tuning.