Introduction Seeing is harder than it looks
In 1966, Seymour Papert proposed the Summer Vision Project, bringing together artificial intelligence researchers for “the construction of a significant part of a visual system” over the course of just a few months. But the goal of solving many problems of computer vision proved overambitious and instead the researchers rediscovered a fact long familiar in vision science: seeing is harder than it looks.Histories don't write themselves. For credits and other histories of computer vision, scroll to the bottom of the page.
Since the optical studies of Ibn al-Haytham in the 11th century, scientists recognized a gap between the confused mess of visual information hitting the eye and visual experience, where segmented objects appear arrayed in space with clear differences between foreground and background. The assumption for the last millennium has been that there must be unconscious judgments engaged in processing the visual information: segmentation, spatial configuration, object recognition, and so on.
Papert recognized this in part. His optimism stemmed from the idea that the different unconscious judgments necessary for understanding an image could be instantiated in different computer programs. Thus the labor could be divided among different teams, with one team writing a program to detect edges, corners, and other pixel-level information in an image, another forming continous shapes out of these low-level features, a different group arranging the shapes in three-dimensional space, and so on. While the summer project failed, the general approach remained: treat vision not as a single problem, but as a number of discrete subproblems which can be stacked one on top of another in a hierarchy, from edge detection all the way up to robust object recognition.
Funding for computer vision was often generous because the military was being overwhelmed with spy plane and later satellite images (it was the Cold War, after all). The funding for this approach typically came from either the Department of Defense or the Advanced Research Projects Agency (ARPA, later DARPA), the US military research slush fund.
In 1964 the military began investigations into a facial recognition system with Woody Bledsoe, Charles Bisson and Helen ChanWired's Secret History of Facial Recognition delightfully recounts the Bledsoe, Bisson, and Chan story. which detected a number of identifying features — how far apart your eyes are, where your hairline starts — which could be matched with a face.
Although the research funding waxed and waned over the last 70 years, militaries across the world see more effective computer vision as essential to their national security.
Harun Farocki's Eye/Machine film trilogy examines “intelligent” image processing techniques in warfare and civilian life. Many academics and industries, more interested in basic research into human vision or self-driving vacuums, often found the pull of government money irresistable. This lead to enormously innovative work, some of it anthropocentric and some designed solely for mechanical eyes, into ways of interpreting the world of light.