In Douglas Adams's (1988) novel Dirk Gently's Holistic Detective Agency, a sofa gets stuck on a stairway landing. Throughout the remainder of the novel, Dirk Gently ponders how to get it unstuck by imagining the sofa rotating into various positions (he eventually solves the problem using a time machine). The well-known psychologist Roger Shepard once had a somewhat similar experience, awakening one morning to "a spontaneous kinetic image of three-dimensional structures majestically turning in space" (Shepard and Cooper 1982, 7). That experience inspired Shepard and his student Jacqueline Metzler to run what has become a seminal experiment in cognitive science -- one that both defines and operationalizes mental rotation.
Shepard and Metzler (1971) presented subjects with images of novel three-dimensional (3-D) objects at various orientations -- on each trial a pair of images appeared side-by-side and subjects decided whether the two images depicted the same (figures 1a and 1b) or different objects (figure 1c) regardless of any difference in orientation. A given 3-D object had two "handedness" versions: its "standard" version and a mirror-reflected version (equivalent to the relationship between left- and right-handed gloves). Different objects were always mirror reflections of one another, so objects could never be discriminated using distinctive local features. Shepard and Metzler measured the time it took subjects to make same/different discriminations as a function of the angular difference between them. What they found was a remarkably consistent pattern across both picture plane and depth rotations -- mean response times increased linearly with increasing angular separation. This outcome provides evidence that subjects mentally rotate one or both objects until they are (mentally) aligned with one another. Shepard and Metzler suggest that the mental rotation process is an internal analogue of physical rotation, that is, a continuous shortest path 3-D rotation that would bring the objects into alignment.
A variation on Shepard and Metzler's experiment demonstrated that mental rotation is also used when subjects judge the handedness of familiar objects. Another student of Shepard's, Lynn Cooper, presented subjects with single English letters or digits (Cooper and Shepard 1973). On each trial a standard (e.g., "R") or a mirror-reflected version (e.g., "") of a letter or digit was shown at some misorientation in the picture plane. Because subjects had to judge whether the misoriented character was of standard or mirror handedness they could not use local distinguishing features to make the discrimination. Moreover, because handedness is only defined relative to the viewer, subjects presumably needed to align each test character with their egocentric reference frame in which left and right are defined. The results, quite similar to those obtained with 3-D objects, confirmed this assumption -- mean response times for judging handedness increased monotonically as the test characters were misoriented farther and farther from their canonical upright orientations. Response times turned out to be symmetric around 180 degrees -- the point at which the characters were exactly upside down. Thus subjects were apparently mentally rotating the characters in the shortest direction to the upright regardless of whether this rotation was clockwise or counterclockwise.
Why are these findings so important? Much of the theorizing about COGNITIVE ARCHITECTURE during the 1960s assumed a symbolic substrate (e.g., PRODUCTION SYSTEMS) in which all mental representations were thought to have a common amodal format. Shepard"s results demonstrated that at least some cognitive processes were modal, and, in particular, tied to visual perception. The hypothesis that mental rotation is a continuous process akin to a real-world rotation also has implications for the nature of IMAGERY, namely, that humans have the capacity to make judgments using inherently spatial representations and that such representations are sophisticated enough to support PROBLEM SOLVING, spatial reasoning, and SHAPE PERCEPTION. Not surprisingly, this claim evoked a great deal of skepticism.
In response, Shepard and Cooper went on to meticulously demonstrate that mental rotation indeed involved a continuous mental transformation. Three critical results provide converging evidence for this conclusion. First, Cooper and Shepard (1973) ran a variant of their familiar characters experiment in which they preceded each letter or digit with a cue to the test character"s orientation, identity, or both. They found that neither orientation nor identity alone was sufficient to diminish the effect of stimulus orientation on response times. In contrast, providing both, along with sufficient time for the subject to prepare, removed almost all effects of orientation. Thus it appears that mental rotation operates on a representation that depicts a particular shape at a particular position in space -- properties associated with an image.
Second, Cooper and Shepard (1973; see also Cooper 1976) ran an experiment in which they controlled an individual subject"s putative rate of rotation across a series of trials. Given this information, for a misoriented test letter or digit, they could predict the instantaneous orientation of the rotating mental image of that character. On each trial, a test character was presented, followed at some point by a probe character. The task was to begin rotating the test character, but then to judge the handedness of the probe. Under these conditions, response times were essentially unrelated to the absolute orientation of the probe, but increased monotonically with increasing angular distance from the presumed orientation of the rotating mental image of the test character. For example, when the predicted orientation of the test character image and the visible probe corresponded, response times were independent of the actual orientation of the probe. Thus the changing image actually passes through all of the intermediate orientations -- a property expected for a truly analog transformation mechanism.
Third, Cooper and Shepard ran an experiment in which subjects were given extensive practice judging the handedness of characters mentally rotated in only one direction, for example, 0 degrees to 180 degrees counterclockwise. When these subjects were tested with characters misoriented slightly past 180 degrees, say 190 degrees, the distribution of response times had two peaks -- one corresponding to mentally rotating the short way around (clockwise) and one corresponding to mentally rotating the long way around (counterclockwise and consistent with the practice subjects had received). Thus it was the actual angular distance traversed by a rotation that determined the time consumed -- again consistent with an analog mechanism.
In summary, there is compelling evidence for the use of a continuous mental rotation process that brings images of two objects into correspondence or the image of a single object into alignment with an internal representation. The existence of such a mechanism suggests that models of cognitive architecture should include modality-specific mechanisms that can support mental imagery. Mental rotation may also play a role in HIGH-LEVEL VISION, for example, in shape perception (Rock and Di Vita 1987), FACE RECOGNITION (Hill, Schyns, and Akamatsu 1997; Troje and Bülthoff 1996), and OBJECT RECOGNITION (Jolicoeur 1985; Tarr 1995; Tarr and Pinker 1989).
Adams, D. (1988). Dirk Gently's Holistic Detective Agency. New York: Pocket Books.
Cooper, L. A. (1976). Demonstration of a mental analog of an external rotation. Perception and Psychophysics 19:296-302.
Cooper, L. A., and R. N. Shepard. (1973). Chronometric studies of the rotation of mental images. In W. G. Chase, Ed., Visual Information Processing. New York: Academic Press, pp. 75-176.
Hill, H., P. G. Schyns, and S. Akamatsu. (1997). Information and viewpoint dependence in face recognition. Cognition 62:201-222.
Jolicoeur, P. (1985). The time to name disoriented natural objects. Memory and Cognition 13:289-303.
Rock, I., and J. Di Vita. (1987). A case of viewer-centered object perception. Cognitive Psychology 19:280-293.
Shepard, R. N., and L. A. Cooper. (1982). Mental Images and Their Transformations. Cambridge, MA: MIT Press.
Shepard, R. N., and J. Metzler. (1971). Mental rotation of three-dimensional objects. Science 171:701-703.
Tarr, M. J. (1995). Rotating objects to recognize them: A case study of the role of viewpoint dependency in the recognition of three-dimensional objects. Psychonomic Bulletin and Review 2:55-82.
Tarr, M. J., and S. Pinker. (1989). Mental rotation and orientation-dependence in shape recognition. Cognitive Psychology 21, 233-282.
Troje, N., and H. H. Bülthoff. (1996). Face recognition under varying pose: The role of texture and shape. Vision Research 36:1761-1771.
Carpenter, P. A., and M. A. Just. (1978). Eye fixations during mental rotation. In J. W. Senders, D. F. Fisher, and R. A. Monty, Eds., Eye Movements and the Higher Psychological Functions. Hillsdale, NJ: Erlbaum, pp. 115-133.
Cohen, M. S., S. M. Kosslyn, H. C. Breiter, G. J. Digirolamo, W. L. Thompson, A. K. Anderson, S. Y. Bookheimer, B. R. Rosen, and J. W. Belliveau. (1996). Changes in cortical activity during mental rotation: A mapping study using functional MRI. Brain 119:89-100.
Cohen, D., and M. Kubovy. (1993). Mental rotation, mental representation and flat slopes. Cognitive Psychology 25:351-382.
Cooper, L. A., and R. N. Shepard. (1984). Turning something over in the mind. Scientific American 251(6):106-114.
Georgopoulos, A. P., J. T. Lurito, M. Petrides, A. B. Schwartz, and J. T. Massey. (1989). Mental rotation of the neuronal population vector. Science 243:234-236.
Jolicoeur, P., S. Regehr, L. B. J. P. Smith, and G. N. Smith. (1985). Mental rotation of representations of two-dimensional and three-dimensional objects. Canadian Journal of Psychology 39:100-129.
Pinker, S. (1984). Visual cognition: An introduction. Cognition 18:1-63.
Rock, I. (1973). Orientation and Form. New York: Academic Press.
Rock, I., D. Wheeler, and L. Tudor. (1989). Can we imagine how objects look from other viewpoints? Cognitive Psychology 21:185-210.
Tarr, M. J., and S. Pinker. (1990). When does human object recognition use a viewer-centered reference frame? Psychological Science 1:253-256.