Base rate fallacy
From Wikipedia, the free encyclopedia
Base rate fallacy, also called base rate neglect or base rate bias, is an error in thinking. If presented with related base rate
information (i.e. generic, general information) and specific
information (information only pertaining to a certain case), the mind
tends to ignore the former and focus on the latter. This is what the
base rate fallacy refers to.[1]
Contents
Example 1
- John is a man who wears gothic inspired clothing, has long black hair, and listens to death metal. How likely is it that he is a Christian and how likely is it that he is a Satanist?
If people were asked this question, they would likely underestimate
the probability of him being a Christian, and overestimate the
probability of him being a Satanist. This is because they would ignore
that the base rate of being a Christian (there are about 2 billion in
the world) is vastly higher than that of being a Satanist (estimated to
be in the thousands).[2]
Therefore, even if such clothing choices indicated an order of
magnitude jump in probability of being a Satanist, the probability of
being a Christian is still much larger.
Example 2
- A group of policemen have breathalyzers displaying false drunkenness in 5% of the cases tested. However, the breathalyzers never fail to detect a truly drunk person. 1/1000 of drivers are driving drunk. Suppose the policemen then stops a driver at random, and force them to take a breathalyzer test. It indicates that he or she is drunk. We assume you don't know anything else about him or her. How high is the probability he or she really is drunk?
Many would answer as high as 0.95, but the correct probability is about 0.02.
To find the correct answer, one should use Bayes' theorem.
The goal is to find the probability that the driver is drunk given that
the breathalyzer indicated he/she is drunk, which can be represented as
where "D" means that the breathalyzer indicates that the driver is drunk. Bayes' Theorem tells us that
We were told the following in the first paragraph:
As you can see from the formula, one needs p(D) for Bayes' Theorem, which one can compute from the preceding values using
which gives
Plugging these numbers into Bayes' Theorem, one finds that
A more intuitive explanation: in average, for every 1000 drivers tested,
- 1 driver is drunk, and it is 100% certain that for that driver there is a true positive test result, so there is 1 true positive test result
- 999 drivers are not drunk, and among those drivers there are 5% false positive test results, so there are 49.95 false positive test results
therefore the probability that one of the drivers among the 1 + 49.95 = 50.95 positive test results really is drunk is .
The validity of this result does, however, hinge on the validity of
the initial assumption that the policemen stopped the driver truly at
random, and not because of bad driving. If that or another non-arbitrary
reason for stopping the driver was present, then the calculation also
involves the probability of a drunk driver driving competently and a
non-drunk driver driving competently.
Example 3
In a city of 1 million inhabitants let there be 100 terrorists and
999,900 non-terrorists. To simplify the example, it is assumed that all
people present in the city are inhabitants. Thus, the base rate
probability of a randomly selected inhabitant of the city being a
terrorist is 0.0001, and the base rate probability of that same
inhabitant being a non-terrorist is 0.9999. In an attempt to catch the
terrorists, the city installs an alarm system with a surveillance camera
and automatic facial recognition software.
The software has two failure rates of 1%:
- The false negative rate: If the camera scans a terrorist, a bell will ring 99% of the time, and it will fail to ring 1% of the time.
- The false positive rate: If the camera scans a non-terrorist, a bell will not ring 99% of the time, but it will ring 1% of the time.
Suppose now that an inhabitant triggers the alarm. What is the chance
that the person is a terrorist? In other words, what is P(T | B), the
probability that a terrorist has been detected given the ringing of the
bell? Someone making the 'base rate fallacy' would infer that there is a
99% chance that the detected person is a terrorist. Although the
inference seems to make sense, it is actually bad reasoning, and a
calculation below will show that the chances they are a terrorist are
actually near 1%, not near 99%.
The fallacy arises from confusing the natures of two different
failure rates. The 'number of non-bells per 100 terrorists' and the
'number of non-terrorists per 100 bells' are unrelated quantities. One
does not necessarily equal the other, and they don't even have to be
almost equal. To show this, consider what happens if an identical alarm
system were set up in a second city with no terrorists at all. As in the
first city, the alarm sounds for 1 out of every 100 non-terrorist
inhabitants detected, but unlike in the first city, the alarm never
sounds for a terrorist. Therefore 100% of all occasions of the alarm
sounding are for non-terrorists, but a false negative rate cannot even
be calculated. The 'number of non-terrorists per 100 bells' in that city
is 100, yet P(T | B) = 0%. There is zero chance that a terrorist has
been detected given the ringing of the bell.
Imagine that the city's entire population of one million people pass
in front of the camera. About 99 of the 100 terrorists will trigger the
alarm—and so will about 9,999 of the 999,900 non-terrorists. Therefore,
about 10,098 people will trigger the alarm, among which about 99 will be
terrorists. So, the probability that a person triggering the alarm
actually is a terrorist, is only about 99 in 10,098, which is less than
1%, and very, very far below our initial guess of 99%.
The base rate fallacy is so misleading in this example because there are many more non-terrorists than terrorists.
Findings in psychology
In experiments, people have been found to prefer individuating
information over general information when the former is available.[3][4][5]
In some experiments, students were asked to estimate the grade point averages
(GPAs) of hypothetical students. When given relevant statistics about
GPA distribution, students tended to ignore them if given descriptive
information about the particular student, even if the new descriptive
information was obviously of little or no relevance to school
performance.[4] This finding has been used to argue that interviews are an unnecessary part of the college admissions process because interviewers are unable to pick successful candidates better than basic statistics.
Psychologists Daniel Kahneman and Amos Tversky attempted to explain this finding in terms of a simple rule or "heuristic" called representativeness.
They argued that many judgements relating to likelihood, or to cause
and effect, are based on how representative one thing is of another, or
of a category.[4] Kahneman considers base rate neglect to be a specific form of extension neglect.[6] Richard Nisbett has argued that some attributional biases like the fundamental attribution error
are instances of the base rate fallacy: people underutilize "consensus
information" (the "base rate") about how others behaved in similar
situations and instead prefer simpler dispositional attributions.[7]
There is considerable debate in psychology on the conditions under which people do or do not appreciate base rate information.[8][9]
Researchers in the heuristics-and-biases program have stressed
empirical findings showing that people tend to ignore base rates and
make inferences that violate certain norms of probabilistic reasoning,
such as Bayes' theorem. The conclusion drawn from this line of research was that human probabilistic thinking is fundamentally flawed and error-prone.[10]
Other researchers have emphasized the link between cognitive processes
and information formats, arguing that such conclusions are not generally
warranted.[11][12]
Consider again Example 2 from above. The required inference is to
estimate the (posterior) probability that a (randomly picked) driver is
drunk, given that the breathalyzer test is positive. Formally, this
probability can be calculated using Bayes' theorem,
as shown above. However, there are different ways of presenting the
relevant information. Consider the following, formally equivalent
variant of the problem:
- 1 out of 1000 drivers are driving drunk. The breathalyzers never fail to detect a truly drunk person. For 50 out of the 999 drivers who are not drunk the breathalyzer falsely displays drunkness. Suppose the policemen then stop a driver at random, and force them to take a breathalyzer test. It indicates that he or she is drunk. We assume you don't know anything else about him or her. How high is the probability he or she really is drunk?
In this case, the relevant numerical information—p(drunk), p(D | drunk), p(D | sober)—is presented in terms of natural frequencies with respect to a certain reference class (see reference class problem).
Empirical studies show that people's inferences correspond more closely
to Bayes' rule when information is presented this way, helping to
overcome base-rate neglect in laypeople[12] and experts.[13] As a consequence, organizations like the Cochrane Collaboration recommend using this kind of format for communicating health statistics.[14]
Teaching people to translate these kinds of Bayesian reasoning problems
into natural frequency formats is more effective than merely teaching
them to plug probabilities (or percentages) into Bayes' theorem.[15]
It has also been shown that graphical representations of natural
frequencies (e.g., icon arrays) help people to make better inferences.[15][16][17]
Why are natural frequency formats helpful? One important reason is
that this information format facilitates the required inference because
it simplifies the necessary calculations. This can be seen when using an
alternative way of computing the required probability p(drunk|D):
where N(drunk ∩ D) denotes the number of drivers that are drunk and get a positive breathalyzer result, and N(D)
denotes the total number of cases with a positive breathalyzer result.
The equivalence of this equation to the above one follows from the
axioms of probability theory, according to which N(drunk ∩ D) = p (D | drunk) × p
(drunk). Importantly, although this equation is formally equivalent to
Bayes’ rule, it is not psychologically equivalent. Using natural
frequencies simplifies the inference because the required mathematical
operation can be performed on natural numbers, instead of normalized
fractions (i.e., probabilities), because it makes the high number of
false positives more transparent, and because natural frequencies
exhibit a "nested-set structure".[18][19]
It is important to note that not any kind of frequency format facilitates Bayesian reasoning.[19][20] Natural frequencies refer to frequency information that results from natural sampling,[21]
which preserves base rate information (e.g., number of drunken drivers
when taking a random sample of drivers). This is different from systematic sampling,
in which base rates are fixed a priori (e.g., in scientific
experiments). In the latter case it is not possible to infer the
posterior probability p (drunk | positive test) from comparing
the number of drivers who are drunk and test positive compared to the
total number of people who get a positive breathalyzer result, because
base rate information is not preserved and must be explicitly
re-introduced using Bayes' theorem.
See also
- Bayesian probability
- Data dredging
- False positive paradox
- Inductive argument
- List of cognitive biases
- Misleading vividness
- Prosecutor's fallacy
References list
- "Logical Fallacy: The Base Rate Fallacy". Fallacyfiles.org. Retrieved 2013-06-15.
- B.A. Robinson (March 2006). "Religious Satanism, 16th century Satanism, Satanic Dabbling, etc". Ontario Consultants on Religious Tolerance. Retrieved March 24, 2013.
- Bar-Hillel, Maya (1980). "The base-rate fallacy in probability judgments". Acta Psychologica 44: 211–233. doi:10.1016/0001-6918(80)90046-3.
- Kahneman, Daniel; Amos Tversky (1973). "On the psychology of prediction". Psychological Review 80: 237–251. doi:10.1037/h0034747.
- Kahneman, Daniel; Amos Tversky (1985). "Evidential impact of base rates". In Daniel Kahneman, Paul Slovic & Amos Tversky (Eds.). Judgment under uncertainty: Heuristics and biases. pp. 153–160.
- Kahneman, Daniel (2000). "Evaluation by moments, past and future". In Daniel Kahneman and Amos Tversky (Eds.). Choices, Values and Frames.
- Nisbett, Richard E.; E. Borgida, R. Crandall & H. Reed (1976). "Popular induction: Information is not always informative". In J. S. Carroll & J. W. Payne (Eds.). Cognition and social behavior 2. pp. 227–236.
- Koehler, J. J. (2010). "The base rate fallacy reconsidered: Descriptive, normative, and methodological challenges". Behavioral and Brain Sciences 19: 1. doi:10.1017/S0140525X00041157.
- Barbey, A. K.; Sloman, S. A. (2007). "Base-rate respect: From ecological rationality to dual processes". Behavioral and Brain Sciences 30 (3). doi:10.1017/S0140525X07001653.
- Tversky, A.; Kahneman, D. (1974). "Judgment under Uncertainty: Heuristics and Biases". Science 185 (4157): 1124–1131. doi:10.1126/science.185.4157.1124. PMID 17835457.
- Cosmides, Leda; John Tooby (1996). "Are humans good intuitive statisticians after all? Rethinking some conclusions of the literature on judgment under uncertainty". Cognition 58: 1–73. doi:10.1016/0010-0277(95)00664-8.
- Gigerenzer, G.; Hoffrage, U. (1995). "How to improve Bayesian reasoning without instruction: Frequency formats". Psychological Review 102 (4): 684. doi:10.1037/0033-295X.102.4.684.
- Hoffrage, U.; Lindsey, S.; Hertwig, R.; Gigerenzer, G. (2000). "MEDICINE: Communicating Statistical Information". Science 290 (5500): 2261–2262. doi:10.1126/science.290.5500.2261. PMID 11188724.
- Akl, E. A.; Oxman, A. D.; Herrin, J.; Vist, G. E.; Terrenato, I.; Sperati, F.; Costiniuk, C.; Blank, D.; Schünemann, H. (2011). "Using alternative statistical formats for presenting risks and risk reductions". In Schünemann, Holger. "Cochrane Database of Systematic Reviews". doi:10.1002/14651858.CD006776.pub2.
- Sedlmeier, P.; Gigerenzer, G. (2001). "Teaching Bayesian reasoning in less than two hours". Journal of Experimental Psychology: General 130 (3): 380. doi:10.1037/0096-3445.130.3.380.
- Brase, G. L. (2009). "Pictorial representations in statistical reasoning". Applied Cognitive Psychology 23 (3): 369–381. doi:10.1002/acp.1460.
- Edwards, A.; Elwyn, G.; Mulley, A. (2002). "Explaining risks: Turning numerical data into meaningful pictures". BMJ 324 (7341): 827–830. doi:10.1136/bmj.324.7341.827. PMC 1122766. PMID 11934777.
- Girotto, V.; Gonzalez, M. (2001). "Solving probabilistic and statistical problems: A matter of information structure and question form". Cognition 78 (3): 247–276. doi:10.1016/S0010-0277(00)00133-5. PMID 11124351.
- Hoffrage, U.; Gigerenzer, G.; Krauss, S.; Martignon, L. (2002). "Representation facilitates reasoning: What natural frequencies are and what they are not". Cognition 84 (3): 343–352. doi:10.1016/S0010-0277(02)00050-1. PMID 12044739.
- Gigerenzer, G.; Hoffrage, U. (1999). "Overcoming difficulties in Bayesian reasoning: A reply to Lewis and Keren (1999) and Mellers and McGraw (1999)". Psychological Review 106 (2): 425. doi:10.1037/0033-295X.106.2.425.
- Kleiter, G. D. (1994). "Natural Sampling: Rationality without Base Rates". "Contributions to Mathematical Psychology, Psychometrics, and Methodology". Recent Research in Psychology. pp. 375–388. doi:10.1007/978-1-4612-4308-3_27. ISBN 978-0-387-94169-1.
No comments:
Post a Comment