The human mind is a complex and powerful tool that allows us to navigate through the world and make decisions every day. One of the ways our mind helps us make decisions is through the use of mental shortcuts, or heuristics. One such heuristic is the Representativeness Heuristic, which involves making judgments and decisions based on how closely an object or person resembles a prototype or stereotype. In this introduction, we will explore the definition and function of the Representativeness Heuristic, and how it impacts our decision-making process.

The representativeness heuristic is a psychological term wherein people judge the probability or frequency of a hypothesis by considering how much the hypothesis resembles available data as opposed to using a Bayesian calculation. While often very useful in everyday life, it can also result in neglect of relevant base rates and other cognitive biases. The representative heuristic was first proposed by Amos Tversky and Daniel Kahneman. In causal reasoning, the representativeness heuristic leads to a bias toward the belief that causes and effects will resemble one another (examples include both the belief that “emotionally relevant events ought to have emotionally relevant causes”, and magical associative thinking).

Examples

Tom W.

In a study done in 1973, Kahneman and Tversky gave their subjects the following information:

“Tom W. is of high intelligence, although lacking in true creativity. He has a need for order and clarity, and for neat and tidy systems in which every detail finds its appropriate place. His writing is rather dull and mechanical, occasionally enlivened by somewhat corny puns and by flashes of imagination of the sci-fi type. He has a strong drive for competence. He seems to feel little sympathy for other people and does not enjoy interacting with others. Self-centered, he nonetheless has a deep moral sense.”

The subjects were then divided into three groups who were given different decision tasks:

One group of subjects was asked how similar Tom W. was to a student in one of nine types of college graduate majors (business administration, computer science, engineering, humanities/education, law, library science, medicine, physical/life sciences, or social science/social work). Most subjects associated Tom W. with an engineering student, and thought he was least like a student of social science/social work.

A second group of subjects was asked instead to estimate the probability that Tom W. was a grad student in each of the nine majors. The probabilities were in line with the judgments from the previous group.

A third group of subjects was asked to estimate the proportion of first-year grad students there were in each of the nine majors.

The second group’s probabilities were approximated by how much they thought Tom W. was representative of each of the majors, and less on the base rate probability of being that kind of student in the first place (the third group). Had the subjects approximated their answers by the base rates, their estimated probability that Tom W. was an engineer would have been much lower, as there were few engineering grad students at the time.

The Taxicab problem

In another study done by Tversky and Kahneman, subjects were given the following problem:

“A cab was involved in a hit and run accident at night. Two cab companies, the Green and the Blue, operate in the city. 85% of the cabs in the city are Green and 15% are Blue.

A witness identified the cab as Blue. The court tested the reliability of the witness under the same circumstances that existed on the night of the accident and concluded that the witness correctly identified each one of the two colors 80% of the time and failed 20% of the time.

What is the probability that the cab involved in the accident was Blue rather than Green knowing that this witness identified it as Blue?”

Most subjects gave probabilities over 50%, and some gave answers over 80%. The correct answer, found using Bayes’ theorem, is lower than these estimates:

There is a 12% chance (15% times 80%) of the witness correctly identifying a blue cab.
There is a 17% chance (85% times 20%) of the witness incorrectly identifying a green cab as blue.
There is therefore a 29% chance (12% plus 17%) the witness will identify the cab as blue.
This results in a 41% chance (12% divided by 29%) that the cab identified as blue is actually blue.
Representativeness is cited in the similar effect of the gambler’s fallacy, the regression fallacy and the conjunction fallacy.

Representativeness, Extensionality, and Bayes’ Theorem

The Representativeness Heuristic violates one of the fundamental properties of probability: extensionality. For example, participants were provided with a description of Linda who resembles a feminist. Then participants were asked to evaluate the probability of her being a feminist, the probability of her being a bank teller, or the probability of being both a bank teller and feminist. Probability theory dictates that the probability of being both a bank teller and feminist (the conjunction of two sets) must be less than or equal to the probability of being either a feminist or a bank teller. However, participants judged the conjunction (bank teller and feminist) as being more probable than being a bank teller alone.

The use of the Representativeness Heuristic will likely lead to violations of Bayes’ Theorem. Bayes’ Theorem states:

P(H|D) = frac{P(D | H), P(H)}{P(D)}.

However, judgments by Representativeness only look at the resemblance between the hypothesis and the data, thus inverse probabilities are equated:

P(H | D) = P(D | H)

As can be seen, the base rate P(H) is ignored in this equation, leading to the base rate fallacy. This was explicitly tested by Dawes, Mirels, Gold and Donahue (1993) who had people judge both the base rate of people who had a particular personality trait and the probability that a person who had a given personality trait had another one. For example, participants were asked how many people out of 100 answered true to the question “I am a conscientious person” and also, given that a person answered true to this question, how many would answer true to a different personality question. They found that participants equated inverse probabilities (e.g., P(conscientious | neurotic) = P(neurotic | conscientious)) even when it was obvious that they were not the same (the two questions were answered immediately after each other).

Disjunction Fallacy

In addition to extensionality violation, base-rate neglect, and the conjunction fallacy, the use of Representativeness Heuristic may lead to a Disjunction Fallacy. From probability theory the disjunction of two events is at least as likely as either of the events individually. For example, the probability of being either a physics or biology major is at least as likely as being a physics major, if not more likely. However, when a personality description (data) seems to be very representative of a physics major (e.g., pocket protector) over a biology major, people judge that it is more likely for this person to be a physics major than a natural sciences major (which is a superset of physics).

Further evidence that the Representativeness Heuristic may be causal to the Disjunction Fallacy comes from Bar-Hillel and Neter (1986). They found that people judge a person who is highly representative of being a statistics major (e.g., highly intelligent, does math competitions) as being more likely to be a statistics major than a social sciences major (superset of statistics), but they do not think that he is more likely to be a Hebrew language major than a humanities major (superset of Hebrew language). Thus, only when the person seems highly representative of a category is that category judged as more probable than its superordinate category. These incorrect appraisals remained even in the face of losing real money in bets on probabilities.

Alternative Explanations

Jon Krosnick, a professor in Communication at Stanford, in his work[which?] has proposed that the effects that Kahneman and Tversky saw in their work may be partially attributed to information order effects. When the order of information was reversed – with probability figures coming later, a lot of the effects were mitigated.