CAVE | Projects: What is a Computational Camera?

What is a Computational Camera?

Figure 1: (a) The traditional camera model, which is based on the camera obscura. (b) A computational camera uses optical coding followed by computational decoding to produce new types of images. (c) A programmable imaging system is a computational camera whose optics and software can be varied/controlled. (d) The optical coding can also be done via illumination by means of a programmable flash.

February 2003. Revised: January 2011

1. Evolution of the Camera Model

The Traditional Camera

Over the last century, the evolution of the camera has been truly remarkable. However, through this evolution the basic model underlying the camera has remained essentially the same, namely, the camera obscura (Figure 1(a)). The traditional camera has a detector and a standard lens which only captures those principal rays that pass through its center of projection, or effective pinhole, to produce the familiar linear perspective image. In other words, the traditional camera performs a very simple and restrictive sampling of the complete set of rays, or the light field, that resides in any real scene.

Computational Cameras

A computational camera (Figure 1(b)) uses a combination of novel optics and computations to produce the final image. The novel optics is used to map rays in the light field of the scene to pixels on the detector in some unconventional fashion. For instance, the ray shown in Figure 1(b) has been geometrically redirected by the optics to a different pixel from the one it would have arrived at in the case of a traditional camera. As illustrated by the change in color from yellow to red, the ray could also be photometrically altered by the optics. In all cases, the captured image is optically coded and may not be meaningful in its raw form. The computational module has a model of the optics, which it uses to decode the captured image to produce a new type of image that could benefit a vision system. The vision system could either be a human observing the image or a computer vision system that uses the image to interpret the scene it represents.

Programmable Computational Cameras

Computational cameras produce images that are fundamentally different from the traditional linear perspective image. However, the hardware and software of each computational camera are typically designed to produce a particular type of image. The nature of this image cannot be altered without significant redesign of the imaging system.

A programmable imaging system uses an optical system for forming the image, that can be varied by a controller (Figure 1(c)) in terms of its radiometric and/or geometric properties. When such a change is applied to the optics, the controller also changes the decoding software in the computational module. The result is a single imaging system that can emulate the functionalities of several specialized ones. Such a flexible camera has two major benefits. First, a user is free to change the role of the camera based on his or her needs. Second, it allows us to explore the notion of a purposive camera that, as time progresses, automatically produces the visual information that is most pertinent to the task. In order to give its end-user true flexibility, a programmable imaging system must have an open hardware and software architecture.

Programmable Illumination

The basic function of the camera flash has remained the same since it first became commercially available in the 1930s. It is used to brightly illuminate the camera's field of view during the exposure time of the image detector. It essentially serves as a point light source. Due to the significant technological advances made with respect to digital projectors, the time has arrived for the flash to play a more sophisticated role in the capture of images. The use of a projector-like source as a camera flash is powerful as it provides full brightness and color control over time of each of the 2D set of rays it emits (a projector with a finite aperture actually projects a 4D set of rays but only permits control over two of the dimensions). This enables the camera to project arbitrarily complex illumination patterns onto the scene, capture the corresponding images, and compute information regarding the scene that is not possible to obtain with the traditional flash. In this case, the complete imaging system can still be thought of as a computational camera where captured images are optically coded due to the patterned illumination of the scene (Figure 1(d)).

An array of cameras and an array of projectors can be used simultaneously to capture coded measurements of the light field of a scene. Computational decoding of such measurements can facilitate post-capture control of a variety of imaging parameters, including, viewpoint, resolution (spatial, temporal, angular and spectral), depth of field and lighting.

2. Coding Approaches

The design space for the optics of computational cameras is large. It would be desirable to have a single design methodology that produces an optimized optical system for any given set of imaging specifications. The optimization criterion could incorporate a variety of factors, including performance and complexity. At this point in time, however, such a systematic design approach does not exist. Consequently, as with traditional optics, the design of computational cameras remains part science and part art.

The optical coding methods used in today's computational cameras can be broadly classified into the six approaches shown in Figure 2. The first four of these can be viewed as modifications to the traditional camera model. Examples of existing computational cameras that lie in each of the six categories can be found in [Nayar 2011].

Object Side Coding

This is the most convenient way to implement a computational camera, as it only requires optics to be externally attached to a traditional camera (Figure 2(a)). A few examples of this approach include wide angle catadioptric imaging, generalized mosaicing and integral imaging using an externally attached lens or prism array. Object side coding has also been used to develop a variety of "non-central" cameras that do not have a single effective viewpoint but rather a locus of viewpoints. In some cases, the locus of viewpoints is a necessary compromise made to achieve a particular type of image projection and in other cases, such as panoramic stereo, it is a desirable attribute.

Figure 2: Optical coding approaches used in computational cameras. The first four of these are shown as modifications made to a traditional camera. (a) Object side coding, where an optical element is attached externally to a conventional lens. (b) Pupil plane coding, where an optical element is placed at, or close to, the aperture of the lens. (c) Focal plane coding, where an optical element is placed at, or close to, the detector plane. (d) Illumination coding, where coding is achieved by projecting complex illumination patterns onto the scene. (e) The imaging system is made up of a cluster or array of traditional camera modules. (f) A radically different camera design that cannot be described as a modification to a traditional camera or a collection of traditional cameras. See [Nayar 2011] for examples of each approach.

Pupil Plane Coding

In this case, an optical element is placed at, or close to, the pupil plane of a traditional lens (Figure 2(b)). Examples include the use of phase plates and coded apertures for depth of field extension, the use of coded apertures for enhancing signal-to-noise ratio and resolution, aperture and focus control for depth estimation, aperture splitting for dynamic range extension and image replication, and the use of programmable apertures for viewpoint control and light field capture.

Focal Plane Coding

Here, an optical element is placed on, or close to, the image detector (Figure 2(c)). In this approach, we also include the use of small physical motions of the image sensor or pixel-wise control of exposure. Examples include the use of lens arrays and attenuation masks for light field imaging, the use of assorted pixel filters for multispectral and high dynamic range imaging, and the use of sensor motion to achieve super-resolution, extended depth of field and motion deblurring.

Illumination Coding

As mentioned earlier, by using a spatially and/or temporally controllable flash, captured images can be coded using illumination patterns. This approach enables image coding in ways that are not possible by only altering the imaging optics (Figure 2(d)). Illumination coding has a long history in the field of computer vision - virtually any structured light method or variant of photometric stereo is based on the notion of illumination coding. Recent examples include the use of multiplexed illumination for SNR enhancement and object relighting, and the use of coded illumination patterns for the measurement of light transport in a scene.

Camera Clusters and Arrays

A number of traditional cameras can be spatially arranged to create new types of images (Figure 2(e)). In this case, these is no explicit optical coding involved. One can view this approach as increasing (in space and/or time) the sampling of the light field. While camera clusters seek to capture wide fields of view with minimal overlap between the fields of view of adjacent cameras, camera arrays capture multiple perspectives of the same scene with large overlap between the fields of view to acquire 3d reconstructions or light fields of the scene.

Unconventional Imaging Systems

These are optical designs that cannot be easily described as modifications to, or collections of, traditional cameras (Figure 2(f)). While we have not seen many well-tested examples of such systems, one can expect novel designs in the decades to come. Examples may include flexible cameras that can be wrapped around objects or incorporated into clothing, networked dust cameras that can be scattered to produce images of volumes of space, and surfaces made of pixels that can both measure and radiate light.

Figure 3: Computational cameras can be characterized based on the coding approach they use, the number images they need to capture as input, and the type of information they produce.

Figure 3 shows a way to characterize computational cameras based on three factors: (a) the technical approach used for coding, (b) the number of images that need to be captured, and (c) the type of information produced.

3. Benefits of Computational Cameras

New Imaging Functionalities

One motivation for developing computational cameras is to create new imaging functionalities that would be difficult, if not impossible, to achieve using the traditional camera model. The new functionality may come in the form of images with enhanced field of view, spectral resolution, dynamic range, temporal resolution, etc. The new functionality can also manifest in terms of flexibility - the ability to manipulate the optical settings of an image (focus, depth of field, viewpoint, resolution, lighting, etc.) after the image has been captured.

Improved Performance-to-Complexity Ratio

Another major benefit of computational imaging is that it enables the development of cameras with higher performance-to-complexity ratio than traditional imaging. Camera complexity has yet to be defined in concrete terms. However, one can formulate it as some function of size, weight and cost. In traditional imaging, it is generally accepted that higher performance comes at the cost of complexity. For instance, to increase the resolution of a camera, one needs to increase the number of elements in its lens - this is the only way to combat the aberrations that limit resolution. In contrast, computational imaging allows a designer to shift complexity from hardware to computations. For instance, high image resolution can be achieved by post-processing an image captured with very simple optics (even a single element).

4. Limits of Computational Cameras

The design of computational cameras may be viewed as choosing an appropriate operating point within a high dimensional parameter space. Some of the parameters are photometric resolution, spatial resolution, temporal resolution, angular resolution, spectral resolution, field of view and F-number. The space could include additional parameters related to the "cost" of the design, such as, size, weight and expense. In general, while making a final design choice to achieve a desired functionality, one is forced to trade-off between the various parameters. In short, as with traditional imaging, there is no "free lunch" with computational cameras. For instance, in the cases of omnidirectional imaging and integral imaging, resolution is traded-off for wider field of view and viewpoint (or focus) control, respectively. Generally, the trade-off made with any given computational camera is straightforward to analyze and quantify.

While computational cameras have been shown to enable new imaging functionalities and achieve high performance-to-complexity ratios, it is not known whether computational imaging can be used to break fundamental limits of imaging. For instance, it is not clear that the hard resolution limits imposed by diffraction can be overcome using computations. This is an open question that deserves closer attention.

The development of computational cameras lies within the larger field of computational imaging. While computational imaging encompasses a wide range of imaging modalities and applications, computational cameras seek to overcome the limits of the traditional camera and impact all fields that use the camera as a source of information. Examples of such fields include photography, computer vision, computer graphics, biometrics, remote sensing and robotics.

Figure 4: One way to define the terms digital photography, computational photography, computational imaging/cameras and computational image sensors.

Figure 4 shows one possible way to define the terms digital photography, computational photography, computational imaging/cameras and computational image sensors. The field of computational cameras naturally overlaps the areas of computational photography and computational image sensors (Figure 5). Computational photography includes the development of purely software based methods that seek to process multiple images (which could be taken with even a traditional camera) to produce a new type of image or scene representation. With respect to the computational image sensors, several research teams are developing detectors that can perform image sensing as well as early visual processing.

Figure 5: The four fields defined in Figure 4 are closely related to each other and overlap significantly in terms of the types of methods they encompass.

Publications

"Computational Cameras: Convergence of Optics and Processing,"
C. Zhou and S.K. Nayar,
IEEE Transactions on Image Processing,
Vol. 20, No. 12, pp. 3322-3340, Dec. 2011.
[PDF] [bib] [©]

"Computational Cameras: Approaches, Benefits and Limits,"
S.K. Nayar,
Technical Report, Department of Computer Science, Columbia University CUCS-001-11,
Jan. 2011.
[PDF] [bib] [©]

"Programmable Imaging: Towards a Flexible Camera,"
S. K. Nayar, V. Branzoi, and T. E. Boult,
International Journal on Computer Vision,
Oct. 2006.
[PDF] [bib] [©] [Project Page]

"Computational Cameras: Redefining the Image,"
S. K. Nayar,
IEEE Computer Magazine, Special Issue on Computational Photography,
pp. 30-38, Aug. 2006.
[PDF] [bib] [©]

Slides

Computational cameras and variants
(Plenary talk, ICIP, Thessaloniki, October 2001)

Related Fields
(Symposium on Computational Photography, Boston, May 2005)

Programmable illumination and the convergence of cameras and projectors
(Keynote, Procams Workshop, New York, June 2006)

Approaches to Optical Coding
(Adobe, San Jose, November 2007)

Computational Imaging Projects