Prediction versus Accommodation

In early philosophical literature, a ‘prediction’ was considered to be an empirical consequence of a theory that had not yet been verified at the time the theory was constructed—an ‘accommodation’ was one that had. The view that predictions are superior to accommodations in the assessment of scientific theories is known as ‘predictivism’. Commonly, however, predictivism is understood more precisely as entailing that evidence confirms theory more strongly when predicted than when accommodated. Much ink has been spilled modifying the concept of ‘prediction’ and explaining why predictivism is or is not true, and whether the history of science and, more recently, logic (Martin and Hjortland 2021) reveals that scientists are predictivist in their assessment of theories. The debate over predictivism also figures importantly in the debate about scientific realism.

1. Historical Introduction

2. ad hoc hypotheses, 3. early characterizations of novelty, 4. a predictivist taxonomy, 5. the null support thesis, 6.1 reliable discovery methods, 6.2 the fudging explanation, 6.3 arbitrary and non-arbitrary conjunctions, 6.4 severe tests, 6.5 conditional and unconditional confirmation, 6.6 the archer analogy, 6.7 the akaike approach, 6.8 endorsement novelty and the confirmation of background beliefs, 7. anti-predictivism, 8 the realist/anti-realist debate, other internet resources, related entries.

There was in the eighteenth and nineteenth centuries a passionate debate about scientific method—at stake was the ‘method of hypothesis’ which postulated hypotheses about unobservable entities which ‘saved the phenomena’ and thus were arguably true (see Laudan 1981a). Critics of this method pointed out that hypotheses could always be adjusted artificially to accommodate any amount of data. But it was noted that some such theories had the further virtue of generating specific predictions of heretofore unobserved phenomena—thus scientists like John Herschel and William Whewell argued that hypotheses that saved phenomena could be justified when they were confirmed by such ‘novel’ phenomena. Whewell maintained that predictions carry special weight because a theory that correctly predicts a surprising result cannot have done so by chance, and thus must be true (Whewell 1849 [1968: 294]). It thus appeared that predicted evidence confirmed theory more strongly than accommodated evidence. But John Stuart Mill (in his debate with Whewell) categorically denied this claim, affirming that

(s)uch predictions and their fulfilment are, indeed, well calculated to impress the ignorant vulgar, whose faith in science rests solely upon similar coincidences between its prophecies and what comes to pass. But it is strange that any considerable stress should be laid upon such a coincidence by scientific thinkers. (1843, Vol. 2, 23)

John Maynard Keynes provides a simple account of why predictivism has a misleading appearance of truth in a brief passage in his book A Treatise on Probability :

The peculiar virtue of prediction or predesignation is altogether imaginary… The plausibility of the argument [for predictivism] is derived from a different source. If a hypothesis is proposed a priori , this commonly means that there is some ground for it, arising out of our previous knowledge, apart from the purely inductive ground, and if such is the case the hypothesis is clearly stronger than one which reposes on inductive grounds only. But if it is merely a guess, the lucky fact of its preceding some or all of the cases which verify it adds nothing whatever to its value. It is the union of prior knowledge, with the inductive grounds which arise out of the immediate instances, that lends weight to any hypothesis, and not the occasion on which the hypothesis is first proposed. (1921: 305–306) [ 1 ]

By ‘the inductive ground’ for a hypothesis Keynes clearly means the data that the hypothesis fits. Keynes means that when some theorist who undertakes to test a hypothesis first proposes it, typically some other (presumably theoretical) form of support prompted the proposal. Thus hypotheses which are proposed without being built to fit the empirical data (which they are subsequently shown to entail) are typically better supported than hypotheses which are proposed merely to fit the data—for the latter lack the independent support possessed by the former. The appearance of plausibility to predictivism arises because the role of the preliminary hypothesis-inducing evidence is being suppressed.

Karl Popper is probably the most famous proponent of prediction in the history of philosophy. In his lecture “Science: Conjectures and Refutations” Popper recounts his boyhood attempt to grapple with the question “When should a theory be ranked as scientific?” (Popper 1963: 33–65). Popper had become convinced that certain popular theories of his day, including Marx’s theory of history and Freudian psychoanalysis, were pseudosciences. Popper deemed the problem of distinguishing scientific from pseudoscientific theories ‘the demarcation problem’. His solution to the demarcation problem, as is well known, was to identify the quality of falsifiability (or ‘testability’) as the mark of the scientific theory.

The pseudosciences were marked, Popper claimed, by their vast explanatory power. They could explain not only all the relevant actual phenomena the world presented, they could explain any conceivable phenomena that might fall within their domain. This was because the explanations offered by the pseudosciences were sufficiently malleable that they could always be adjusted ex post facto to explain anything. Thus the pseudosciences never ran the risk of being inconsistent with the data. By contrast, a genuinely scientific theory made specific predictions about what should be observed and thus ran the risk of falsification. Popper emphasized that what established the scientific character of relativity theory was that it ‘stuck its neck out’ in a way that pseudosciences never did.

Like Whewell and Herschel, Popper appeals to the predictions a theory makes as a way of separating the illegitimate uses of the method of hypothesis from its legitimate uses. But while Whewell and Herschel pointed to predictive success as a necessary condition for the acceptability of a theory that had been generated by the method of hypothesis, Popper focuses in his solution to the demarcation problem not on the success of a prediction but on the fact that the theory made the prediction at all. Of course, there was for Popper an important difference between scientific theories whose predictions were confirmed and those whose prediction were falsified. Falsified theories were to be rejected, whereas theories that survived testing were to be ‘tentatively accepted’ until falsified. Popper did not hold, with Whewell and Hershel, that successful predictions could constitute legitimate proof of a theory—in fact Popper held that it was impossible to show that a theory was even probable based on the evidence, for he embraced Hume’s critique of inductive logic that made evidential support for the truth of theories impossible. Thus, one should ascribe to Popper a commitment to predictivism only in the broad sense that he held predictions to be superior to accommodations—he did not hold that predictions confirmed theory more strongly than accommodations. It would ultimately prove impossible for Popper to reconcile his claim that a theory which enjoyed predictive success ought to be ‘tentatively accepted’ with his anti-inductivism (see, e.g., Salmon 1981).

Imre Lakatos (1970, 1971) proposed an account of scientific method in the form of his ‘methodology of scientific research programmes’ which was a development of Popper’s approach. A scientific research program was constituted by a ‘hard core’ of propositions which were retained throughout the life of that programme together with a ‘protective belt’ which was constituted by auxiliary hypotheses that were adjusted so as to reconcile the hard core with the empirical data. The attempt on the part of the proponents of the research programme to reconcile the programme to empirical data produced a series of theories \(T_1\), \(T_2\),… \(T_n\) where, at least in some cases, \(T_{i+1}\) serves to explain some data that is anomalous for \(T_i\). Lakatos held that a research programme was ‘theoretically progressive’ insofar as each new theory predicts some novel hitherto unexpected fact. A research programme is ‘empirically progressive’ to the extent that its novel empirical content was corroborated, that is, if each new theory leads to the discovery of “some new fact” (Lakatos 1970: 118). Lakatos thus offered a new solution to the demarcation problem: a research programme was pseudoscientific to the extent that it was not theoretically progressive. Theory evaluation is construed in terms of competing research programmes: a research programme defeats a rival programme by proving more empirically progressive over the long run.

According to Merriam-Webster’s Collegiate Dictionary, [ 2 ] something is ‘ad hoc’ if it is ‘formed or used for specific or immediate problems or needs’. An ad hoc hypothesis then is one formed to address a specific problem—such as the problem of immunizing a particular theory from falsification by anomalous data (and thereby accommodating that data). Consequently what makes a hypothesis ad hoc, in the ordinary English sense of the term, has nothing to do with the content of the hypothesis but simply with the motivation of the scientist who proposes it—and it is unclear why there would be anything suspicious about such a motivation. Nonetheless, ad hoc hypotheses have long been suspect in discussions of scientific method, a suspicion that resonates with the predictivist’s skepticism about accommodation.

For Popper, a conjecture is ad hoc “if it is introduced…to explain a particular difficulty, but…cannot be tested independently” (Popper 1974: 986). Thus Popper’s conception of ad hocness added to the ordinary English meaning a further requirement—in the case of an ad hoc hypothesis that was simply introduced to explain a single phenomenon, the ad hoc hypothesis has no testable consequences other than that phenomenon. In the case of an ad hoc theory modification introduced to resolve an anomaly for a theory, the modified theory had no testable consequences other than those of the original theory.

Popper offered two explications of why ad hoc hypotheses were suspect. One was that if we offer T as an explanation of f , but then cite f as the only reason we have to believe T , Popper claims that we have engaged in reasoning that is suspicious for reasons of circularity (Popper 1972: 192–3). This was arguably fallacious on Popper’s part—a circular proof would offer one proposition, p , in support of a second proposition q , when q has already been offered in support of p . But in the above example, while f is offered as evidence for T , T is offered as an explanation of (not as evidence for) f —and thus there is no circular reasoning (Bamford 1993: 338).

Popper’s other explanation of why ad hoc hypotheses were regarded with suspicion was that they ran counter to the aim of science, which for Popper included the proposal of theories with increasing empirical content, viz., increasing falsifiability. Ad hoc hypotheses, for Popper, suffer from a lack of independent testability and thus reduce (or at least fail to increase) the testability of the theories they modify (cf. above). However, Popper’s claim that the process of modifying a theory ad hoc tends to lead to insufficient falsifiability and is ‘unscientific practice’ has been challenged (e.g., Bamford 1993: 350).

Subsequent authors argued that a hypothesis proposed for the sake of immunizing a theory from falsification could be ‘suspicious’ for various reasons, and thus could be ‘ad hoc’ in various ways. Zahar (1973) argued that a hypothesis was ad hoc 1 if it had no novel consequences as compared with its predecessor (i.e. was not independently testable), ad hoc 2 if none of its novel predictions have actually been verified (either because it has not yet been tested or has been falsified), and ad hoc 3

if it is obtained from its predecessor through a modification of the auxiliary hypotheses which does not accord with the spirit of the heuristic of the programme. (1973: 101)

Beyond Popper’s criterion of a lack of independent testability then, a hypothesis introduced to accommodate some datum could be ad hoc because it was simply unconfirmed (ad hoc 2 ) or because it failed to cohere with the basic commitments of the research programme in which it is proposed (ad hoc 3 ).

Another approach proposes that a hypothesis H introduced into a theory T in response to an experimental result E is ad hoc if it is generally unsupported and appears to be a superficial attempt to paper over deep problems with a theory that is actually in need of substantive revision. Thus to level the charge of ad hocness against a hypothesis was actually to direct serious skepticism toward the theory the hypothesis was meant to rescue. This concept of ad hocness arguably makes sense of Einstein’s critique of the Lorentz-Fitzgerald contraction hypothesis as ‘ad hoc’ as a supplementary hypothesis to the aether theory, and Pauli’s postulation of the neutrino as an ad hoc rescue of classical quantum mechanics (Leplin 1975, 1982; for further discussion see Grünbaum 1976).

It seems clearly true that the scientific community’s judgment about whether a hypothesis is ad hoc can change. Given this revisability, and the aesthetic dimension of theory evaluation (which leaves assessment to some degree ‘in the eye of the beholder’) there may be no particular point to embracing a theory of ad hocness, if by the term ‘ad hoc’ we mean ‘illegitimately proposed’ (Hunt 2012).

Popper wrote that

Confirmations should count only if they are the result of risky predictions; that is to say, if, unenlightened by the theory in question, we should have expected an event which was incompatible with the theory in question, we should have expected an event which was incompatible with the theory—an event which would have refuted the theory. (1963: 36)

Popper (and subsequently Lakatos) thereby endorsed a temporal condition of novelty—a prediction counts as novel is if it is not known to be true (or is expected to prove false) at the time the theory is constructed. But it was fairly obvious that this made important questions of confirmation turn implausibly on the time at which certain facts were known.

Thus Zahar proposed that a fact is novel “if it did not belong to the problem-situation which governed the construction of the hypothesis” (1973: 103). This form of novelty has been deemed ‘problem-novelty’ (Gardner 1982: 2). But in the same paper Zahar purports to exemplify this concept of novelty by referring to the case in which Einstein did not use the known behavior of Mercury’s perihelion in constructing his theory of relativity. [ 3 ] Gardner notes that this latter conception of novelty, which he deemed ‘use-novelty’, is distinct from problem-novelty (Gardner 1982: 3). Evidence is use-novel for T if T was not built to fit that evidence (whether or not it was part of the relevant ‘problem-situation’ the theory was intended to address). In subsequent literature, the so-called heuristic conception of novelty has been identified with use-novelty—it was further articulated in Worrall 1978 and 1985. [ 4 ]

Another approach argues that a novel consequence of a theory is one that was not known to the theorist at the time she formulated the theory—this seems like a version of the temporal conception, but this point appeals implicitly to the heuristic conception: if a theorist knew of a result prior to constructing a theory which explains it, it may be difficult to determine whether that theorist somehow tailored the theory to fit the fact (e.g., she may have done so unconsciously). A knowledge-based conception is thus the best that we can do to handle this difficulty (Gardner 1982). [ 5 ]

The heuristic conception is, however, deeply controversial—because it makes the epistemic assessment of theories curiously dependent on the mental life of their constructors, specifically on the knowledge and intentions of the theorist to build a theory that accommodated certain data rather than others. Leplin’s comment is typical:

The theorist’s hopes, expectations, knowledge, intentions, or whatever, do not seem to relate to the epistemic standing of his theory in a way that can sustain a pivotal role for them…. (1997: 54)

(For similar comments see Gardner 1982: 6; Thomason 1992: 195; Schlesinger 1987: 33; Achinstein 2001: 210–230; and Collins 1994.)

Another approach notes that scientists operate with competing theories and that the role of novel confirmations is to decide between them. Thus, a consequence of a theory T is a ‘novel prediction’ if it is not a consequence of the best available theory actually present in the field other than T (e.g., the prediction of the Mercury perihelion by Einstein’s relativity theory constituted a novel prediction because it was not a (straightforward) consequence of Newtonian mechanics; Musgrave 1974: 18). Operating in a Lakatosian framework, Frankel claims a consequence was novel with respect to a theory and its research programme if it is not similar to a fact which already has been used by members of the same research program to support a theory designed to solve the same problems as the theory in question (1979: 25). Also in a Lakatosian framework, Nunan claims that a consequence is novel if it has not already been used to support, or cannot readily be explained in terms of, a theory entertained in some rival research program (1984: 279). [ 6 ]

There are clearly multiple forms of novelty and it is generally recognized that a fact could be ‘novel’ in multiple senses—as we will see, some carry more epistemic weight than others (Murphy 1989).

Global predictivism holds that predictions are always superior to accommodations, while local predictivism holds that this only holds in certain cases. Strong predictivism asserts that prediction is intrinsically superior to accommodation, whereas weak predictivism holds that predictive success is epistemically relevant because it is symptomatic of other features that have epistemic import. The distinction between strong and weak predictivism cross classifies with the distinctions between different types of novelty. For example, one could maintain that temporal predictions are intrinsically superior to temporal accommodations (strong temporal predictivism) or that temporal predictions were symptomatic of some other good-making feature of theories (weak temporal predictivism; Hitchcock and Sober 2004: 3–5). These distinctions will be further illustrated below.

A version of global strong heuristic predictivism is the null support thesis that holds that theories never receive confirmation from evidence they were built to fit—precisely because of how they were built. This thesis has been attributed to Bacon and Descartes (Howson 1990: 225). Popper and Lakatos also subscribe to this thesis, though it is important to remember that they do not recognize any form of confirmational support—even from successful predictions. But others who maintained that successful predictions do confirm theories nonetheless endorsed the null support hypothesis. Giere provides the following argument:

If the known facts were used in constructing the model and were thus built into the resulting hypothesis…then the fit between these facts and the hypothesis provides no evidence that the hypothesis is true [since] these facts had no chance of refuting the hypothesis. (1984: 161; Glymour 1980: 114 and Zahar 1983: 245 offer similar arguments)

The idea is that the way the theory was built provided an illegitimate protection against falsification by the facts—hence the facts cannot support the theory. Others however find this argument specious, noting that since the content of the hypothesis is fixed, it makes no sense to think of any facts as having a ‘chance’ to falsify the theory. The theory says what it says, and any particular fact refutes it or it doesn’t.

Giere has confused what is in effect a random variable (the experimental setup or data source E together with its set of distinct possible outcomes) with one of its values (the outcome e )…Moreover, it makes perfectly good sense to say that E might well have produced an outcome other than the one, e , it did as a matter of fact produce. (Howson 1990: 229; see also Collins 1994: 220)

Thus Giere’s argument collapses.

Howson argued in a series of papers (1984, 1988, 1990) that the null support thesis is falsified using simple examples, such as the following:

An urn contains an unknown number of black and white tickets, where the proportion p of black tickets is also unknown. The data consists simply in a report of the relative frequency \(r/k\) of black tickets in a large number k of draws with replacement from the urn. In the light of the data we propose the hypothesis that \(p = (r/k)+\epsilon\) for some suitable \(\epsilon\) depending on k . This hypothesis is, according to standard statistical lore, very well supported by the data from which it is clearly constructed. (1990: 231)

In this case there is, Howson notes, a background theory that supplies a model of the experiment (it is a sequence of Bernoulli trials, viz., a sequence of trials with two outcomes in which the probability of getting either outcome is the same on each trial; it leaves only a single parameter to be evaluated). As long as we have good reason to believe that this model applies, our inference to the high probability of the hypothesis is a matter of standard statistical methodology, and the null support thesis is refuted.

It has been argued that one of the limitations of Bayesianism is that it is fatally committed to the (clearly false) null support thesis (Glymour 1980). The standard Bayesian condition by which evidence e supports h is given by the inequality \(p(h\mid e) \gt p(h)\). But where e is known (and thus \(p(e) = 1\)), we have \(p(h\mid e) = p(h)\). This came to be known as the ‘Bayesian problem of old evidence’. Howson (1984) noted that this problem could be overcome by selecting a probability function \(p^*\) based on the assumption that e was not known—thus even if \(p(h\mid e) = p(h)\), it could still hold that \({p^*}(h\mid e) \gt {p^*}(h)\). Thus followed an extensive literature on the old evidence problem which will not be summarized here (see, e.g., Christiansen 1999; Eells & Fitelson 2000; Barnes 1999, 2008: Ch. 7; and Hartmann & Fitelson 2015).

6 Contemporary Theories of Predictivism

Patrick Maher (1988, 1990, 1993) presented a seminal thought experiment and a Bayesian analysis of its predictivist implications.

The thought experiment contained two scenarios: in the first scenario, a subject (the accommodator) is presented with E , a sequence of 99 coin flips. E forms an apparently random sequence of heads and tails. The accommodator is then instructed to tell us the outcome of the first 100 flips—he responds by reciting E and then adding the prediction that the 100 th toss will be heads—the conjunction of E and this last toss is T . In the other scenario, another subject (the predictor) is asked to predict the first 100 flip outcomes without witnessing any outcomes—the predictor endorses theory T . Thereafter the coin is flipped 99 times, E is established, and the predictor’s first 99 predictions are confirmed. The question is in which of these two scenarios is T better confirmed. It is strongly intuitive that T is better confirmed in the predictor’s scenario than in the accommodator’s scenario, suggesting that predictivism holds true in this case. If we allow ‘ O ’ to assert that evidence E was input into the construction of T , predictivism asserts:

Maher argues that the successful prediction of the initial 99 flips constitutes persuasive evidence that the predictor ‘has a reliable method’ for making predictions of coin flip outcomes. T ’s consistency with E in the case of the accommodator provides no particular evidence that the accommodator’s method of prediction is reliable—thus we have no particular reason to endorse his prediction about the 100 th flip. Allowing R to assert that the method in question is reliable, and \(M_T\) that method M generated hypothesis T , this amounts to:

Maher’s (1988) provides a rigorous proof of (2), which is shown to entail (1) on various assumptions.

Maher’s (1988) makes the simplifying assumption that any method of prediction used by a predictor is either completely reliable (this is the claim abbreviated by ‘ R ’) or is no better than a random method (\(\neg R\)). (Maher [1990] shows that this assumption can be surrendered and a continuum of degrees of reliability of scientific methods assumed; the predictivist result is still generated.) In qualitative terms, where M generates T (and thus predicts E ) without input of evidence E , we should infer that it is much more likely that the method that generated E is reliable than that E just happened to turn out true though R was no better than a random method. In other words, we judge that we are much more likely to stumble on a subject using a reliable method M of coin flip prediction than we are to stumble on a sequence of 99 true flip predictions that were merely lucky guesses—because

Maher has articulated a weak heuristic predictivism because he claims that predictive success is symptomatic of the use of a reliable discovery method. [ 7 ]

For critical discussion of Maher’s theory of predictivism see Howson and Franklin 1991 (and Maher’s 1993 reply); Barnes 1996a,b; Lange 2001; Harker 2006; and Worrall 2014. [ 8 ]

It was noted above that ad hoc hypotheses stand under suspicion for various reasons, one of which was that a hypothesis that was proposed to resolve a particular difficulty may not cohere well with the theory it purports to save or relevant background beliefs. [ 9 ] This could result from the fact that there is no obvious way to resolve the difficulty in a way that is wholly ‘natural’ from the standpoint of the theory itself or operative criteria of theory choice. For example, the phlogiston theory claimed that substances emitted phlogiston while burning. However, it was established that some substances actually gained weight while burning. To accommodate the latter phenomenon it was proposed that phlogiston had negative weight—but the latter hypothesis was clearly ad hoc in the sense of failing to cohere with the background belief that substances simply do not have negative weight, and with the knowledge that many objects lost weight when burned (Partington & McKie 1938a: 33–38).

Thus the ‘fudging explanation’ defends predictivism by pointing out that the process of accommodation lends itself to the proposal of hypotheses that do not cohere naturally with operative constraints on theory choice, while successful predictions are immune from this worry (Lipton 1990, 1991: Ch. 8). Of course, it is an important question whether scientists actually rely on the fact that evidence was predicted (or accommodated) in their assessment of theories—if a theory was fudged to accommodate some datum, couldn’t the scientist simply note that the fudged theory suffers a defect of coherence and pay no attention to whether the data was accommodated or predicted? Some argue, however that scientists are imperfect judges of such coherence—a scientist who accommodates some datum may think his accommodation is fully coherent, while his peers may have a more accurate and objective view that it is not. The scientist’s ‘assessed support’ of his proposed accommodation may thus fail to coincide with its ‘objective support’, and the scientist might rely on the fact that his evidence was accommodated as evidence that it was fudged (or conversely, that his evidence was predicted as evidence that it was not fudged; Lipton 1991: 150f).

Lange (2001) offers an alternate interpretation of the coin flip example that claims that the process of accommodation (unlike prediction) tends to generate theories that are not strongly supported by confirming data. He imagines a ‘tweaked’ version of the coin flip example in which the initial 99 outcomes form a strict alternating sequence ‘tails heads tails heads…’ (instead of forming the ‘apparently random sequence’ of outcomes provided in the original case). Again we imagine a predictor who correctly predicts 99 outcomes in advance and an accommodator who witnesses them. Both the predictor and the accommodator predict that the 100 th outcome will be tails. Now there is little or no difference in our assessed probability that the subject will correctly predicted the 100 th outcome.

This suggests that the intuitive difference between Maher’s original pair of examples does not reflect a difference between prediction and accommodation per se. (Lange 2001: 580)

Lange’s analysis appeals to what Goodman called an ‘arbitrary conjunction’—the mark of which is that

establishment of one component endows the whole statement with no credibility that is transmitted to other component statements. (1983: 68–9)

An example of an arbitrary conjunction is “The sun is made of helium and August 3 rd 2017 falls on a Thursday and 17 is a prime number”. In the original coin flip case, we judge that H is weakly supported in the accommodator’s scenario because we judge that the apparently random sequence of outcomes is probably an arbitrary conjunction—thus the fact that the initial 99 conjuncts are confirmed implies almost nothing about what the 100 th outcome will be. But the success of the predictor in predicting the initial 99 outcomes strongly implies that the sequence is not an arbitrary conjunction after all:

(w)e now believe it more likely that the agent was led to posit this particular sequence by way of something we have not noticed that ties the sequence together—that would keep it from being a coincidence that the hypothesis is accurate to the 100 th toss…. (Lange 2001: 581)

Having judged it not to be an arbitrary conjunction, we are now prepared to recognize the first 99 outcomes as strongly confirming the prediction in the 100 th case. What accounts for the difference between the two scenarios, in other words, is not primarily whether E was predicted or accommodated, but whether we judge H to be an arbitrary conjunction, and thus whether E provides support for the remaining portion of H .

Thus in Lange’s tweaked case, the non-existence of the predictivist effect is due to the fact that it is clear from the initial 99 flips that the sequence is not an arbitrary conjunction—thus E confirms H equally strongly in both scenarios.

Lange goes on to suggest that in actual science the practice of constructing a hypothesis by way of accommodating known evidence has a tendency to generate arbitrary conjunctions. Thus Lorentz’s contraction hypothesis, when appended to his electrodynamics to accommodate the failure to detect optically any motion with respect to the aether, resulted in an arbitrary conjunction (since evidence that supported the contraction hypothesis did not support the electrodynamics, or vice versa)—essentially for this reason, Lange argues, it was rejected by Einstein as ad hoc. When evidence is predicted by a theory, by contrast, this is typically because the theory is not an arbitrary conjunction. The evidential significance of prediction and accommodation for Lange is that they tend to be correlated (negatively and positively) with the construction of theories that are arbitrary conjunctions. Lange’s view might thus be classed as a weak heuristic predictivism, though Lange never takes a stand on whether scientists actually rely on such correlations in assessing theories.

For critical discussion of Lange’s theory see Worrall 2014: 59–61 and Harker 2006: 317f.

Deborah Mayo has argued (particularly in Mayo 1991, 1996, and 2014) that the intuition that predictivism is true derives from a premium on severe tests of hypotheses. A test of a hypothesis H is severe to the extent that H is unlikely to pass that test if H is false. Intuitively, if a novel consequence N is shown to follow from H , and the probability of N on the assumption \({\sim}H\) is very low (for the reason of its being novel), then testing for N would seem to count as a severe test of H , and a positive outcome should strongly support H . Here novelty and severity appear to coincide—but Mayo observes that there are cases in which they come apart. For example, it has seemed to many that if H is built to fit some body of evidence E then the fact that H fits E does not support H because this fit does not constitute H ’s having survived a severe test (or a test at all). One of Mayo’s central objectives is to expose the fallacies that this latter reasoning involves.

Giere (1984: 161, 163) affirms that evidence H was built to fit cannot support H because, given how H was built, it was destined to fit that evidence. Mayo summarizes his reasoning as follows:

  • (1) If H is use-constructed, then a successful fit is assured no matter what.

But Mayo notes that ‘no matter what’ can be interpreted in two ways: (a) no matter what the data are, and (b) no matter whether H is true or false. (1) is true when interpreted as (a), but in order to establish that accommodated evidence fails to support H (as Giere intends) (1) must be interpreted as (b). However, (1) is false when so interpreted. Mayo (1996: 271) illustrates this with a simple example: let the evidence e be a list of SAT scores from students in a particular class. Use this evidence to compute the average score x , and set h = the mean SAT score for these students is x . Now of course h has been use-constructed from e . It is true that whatever mean score was computed would fit the data no matter what the data are—but hardly true that h would have fit the evidence no matter whether h was true or false. If h were false it would not fit the data, because the data will inevitably fit only a true hypothesis. Thus h has passed a maximally severe test: it is virtually impossible for h to fit the data if h is false—despite the fact that h is built to fit e .

Mayo gives an additional example of how a use-constructed hypothesis can count as having survived a severe test that pertains to the famous 1919 Eddington eclipse experiment of Einstein’s General Theory of Relativity. GTR predicted that starlight that passed by the sun would be bent to a specific degree (specifically 1.75 arcseconds). There were actually two expeditions carried out during the eclipse—one to Sobral in Northern Brazil and the other to the island of Principe in the Gulf of Guinea. Each expedition generated a result that supported GTR, but there was a third result generated by the Sobral expedition that appeared to refute GTR. This result was however disqualified because it was determined that a mirror used to acquire the images of the stars’ position had been damaged by the heat of the sun. While one might worry that such dismissing of anomalous evidence was the kind of ad hoc adjustment that Popper warned against, Mayo notes that this is instead a perfectly legitimate case of using evidence to support a hypothesis (that the third result was unreliable) that amounted to that hypothesis having passed a severe test. Mayo concludes that a general prohibition on use-constructed hypothesis “fails to distinguish between problematic and unproblematic use-constructions (or double countings)” (1996: 285). However, Hudson (2003) argues that there is historical evidence that suggests there was legitimate reason to question the hypothesis that the third result was unreliable (he uses this point to support his own contention that the fact that a hypothesis was use-constructed is prima facie evidence that the hypothesis is suspect). Mayo (2003) replies that insofar as the third result was nonetheless suspect the physicists involved were right to discard it.

Mayo (1996: Ch. 9) defends a predictivist-like position attributed to Neyman-Pearson statistical methods—the prohibition on after-trial constructions of hypotheses. To illustrate: Kish (1959) describes a study that investigated the statistical relationship between a large number of infant training experiences (nursing, toilet training, weaning, etc.) and subsequent personality and behavioral traits (e.g., school adjustment, nail biting, etc.) The study found a number of high correlations between certain training experience and later traits. The problem was that the study investigated so many training experiences that it was quite likely that some correlations would appear in the data simply by chance—even if there would ultimately prove to be no such correlation. An investigator who studied many possible correlations thus could survey that data and simply look for statistically significant differences and proclaim evidence for correlations despite such evidence being misleading—thus engaging in the dubious practice of the ‘after-trial construction of hypothesis’. [ 10 ] Mayo notes that such hypotheses should not count as having passed a severe test, thus she endorses the Neyman-Pearson prohibition on such construction. Hitchcock and Sober (2004) note that Mayo’s definition of severity as applied in this case differs from the one she employs in dealing with cases like her SAT example; Mayo (2008) replies at length to their criticism and argues that while she does employ two versions of the severity definition they nonetheless reflect a unified conception of severity.

For critical discussion of Mayo’s account see Iseda 1999 and Worrall 2006: 56–60, 2010: 145–153—see also Mayo’s (1996: 265f, 2010) replies to Worrall.

John Worrall has been an important contributor to the predictivism literature from the 1970s until the present time. He was, along with Elie Zahar, one of the early proponents of the significance of heuristic novelty (e.g., Worrall 1978, 1985). In his more recent work (cf. his 1989, 2002, 2005, 2006, 2010, 2014; also Scerri & Worrall 2001) Worrall has laid out a detailed theory of predictivism that, while sometimes presented in heuristic terms, is “at root a logical theory of confirmation” (2005: 819)—it is thus a weak heuristic account that takes use-novelty of evidence to be symptomatic of underlying logical features that establish strong confirmation of theory.

Worrall’s mature account is based on a view of scientific theories that he credits to Duhem—which claims that a scientific theory is naturally thought of as consisting of a core claim together with some set of more specific auxiliary claims. It is commonly the case that the core theory will leave undetermined certain ‘free parameters’ and the auxiliary claims fix values for such parameters. To cite an example Worrall often uses, the wave theory of light consists of the core theory that light is a periodic disturbance transmitted though some sort of elastic medium. This core claim by itself leaves open various free parameters concerning the wavelengths of particular types of monochromatic light. Worrall proposes to understand the diminished status of evidential support associated with accommodation as follows: when evidence e is ‘used’ in the construction of a theory, it is typically used to establish the value of a free parameter in some core theory T . The fixed version will be a specific version \(T'\) of T . e serves to confirm \(T'\), then, only on the condition that there is independent support for T —thus accommodation provides only ‘conditional confirmation’. Importantly, evidence e that is used in this way will by itself typically provide no evidence for core theory T . Worrall (2002: 201) offers as an illustration the support offered to the wave theory of light ( W ) by the two slit experiment using light from a sodium arc—the data will consist of various alternating light and dark ‘fringes’. The fringe data can be used to compute the wavelength of sodium light—and thus used to generate a more specific version of the wave theory of light \(W'\)—one which conjoins W with a claim about the wavelength of this particular sort of light. But the data offer merely conditional support to \(W'\)—that is the data support \(W'\) only on the condition that there is independent evidence for W .

Predicted evidence for Worrall is thus evidence that is not used to fix free parameters. Worrall cites two forms that predictions can take: one is when a particular evidential consequence falls ‘immediately out of the core’, i.e., is a consequence of the core, together with ‘natural auxiliaries’, and the other is when it is a consequence of a specific version of a theory whose free parameters have been fixed using other data. To illustrate the first: retrograde motion [ 11 ] was a natural consequence of the Copernican core (the claim that the earth and planets orbit the sun) because observation of the planets was carried out on a moving observatory that periodically passed other planets—however it could only be accommodated by Ptolemaic astronomy by proposing and adjusting auxiliary hypotheses that supposed the planet to move on an epicycle (retrograde motion did not follow naturally from the Ptolemaic core idea that the Sun, stars and planets orbit the earth). Thus retrograde motion was predicted by the Copernican theory and thus offered unconditional support to that theory, while it offered only conditional confirmation to the Ptolemaic theory. The second form of prediction is one which follows from a specific version of a theory but was not used to fix a parameter—imagine \(W'\) in the preceding paragraph makes a new prediction p (say for another experiment, such as the one slit experiment)— p offers unconditional confirmation of \(W'\) (and W ; Worrall 2002: 203).

However it is important to understand that Worrall’s repeated expression of his position in terms of the heuristic conception of novelty (particularly after his 1985) does not amount to an endorsement of strong heuristic predictivism. Worrall clarifies this in his 1989 article that focuses on the evidential significance of the ‘white spot’ confirmation of Fresnel’s version of the wave theory of light. The reason the white spot datum carried such important weight is not ultimately that it was not used by Fresnel in the construction of the theory but because this datum followed naturally from the core theory that light is a wave. The reason the fringe data that was used to compute the wavelength of sodium light (cf. above) did not carry such weight is that it is not a consequence of this core idea (nor has the wavelength of sodium light been fixed by some other data). Thus d is novel for T when “there is a heuristic path to [ T ] that does not presuppose [d’s] existence” (Scerri & Worrall 2001: 418). As Worrall sometimes puts it, whether d carries unconditional confirmation for T does not depend on whether d was actually used in constructing T , but whether it was ‘needed’ to construct T (e.g., 1989: 149–151). Thus Worrall is actually a proponent of ‘essential use-novelty’ (Alai 2014: 304). For Worrall, facts about heuristic prediction and accommodation serve to track underlying facts about the logical relationship between theory and evidence. Thus Worrall is ultimately a proponent of weak (not strong) heuristic predictivism. Worrall categorically rejects temporal predictivism, arguing that the fact that the white spot was a temporally novel consequence in itself was of no epistemic importance.

For further discussion of Worrall’s theory of predictivism see Mayo 2010: 155f; Schurz 2014; Votsis 2014; and Douglas & Magnus 2013: 587–8.

Scerri and Worrall 2001 contains a detailed rendering of the historical episode of the scientific community’s assessment of Mendeleev’s theory of the periodic law—it is argued that this story ultimately vindicates Worrall’s theory of predictivism.

For discussion of Scerri and Worrall see Akeroyd 2003; Barnes 2005b (and replies from Worrall 2005 and Scerri 2005); Schindler 2008, 2014; Brush 2007; and Sereno 2020.

A common argument for predictivism is that we should avoid inferring that a theory T is true on the basis of evidence E that it is built to fit because we can explain why T entails E by simply noting how T was built—but if T was not built to fit E then only the truth of T can explain the fact that T fits E . Various philosophers have noted that this reasoning is fallacious. As noted above it makes no sense to offer an explanation (for example, in terms of how the theory was built) for the fact that T entails E —for this latter fact is a logical fact for which no causal explanation can be given. Insofar as there is an explanandum in need of an explanans here it is rather the fact that the theorist managed to construct or ‘choose’ a theory (which turned out to be T ) that correctly entailed E (Collins 1994; Barnes 2002)—that explanandum could be explained by noting that the theorist built a theory (which turned out to be T ) to fit E , or endorsed it because it fit E .

White (2003) offers a theory of predictivism that begins with this same insight—the relevant explanandum is:

  • (ES) The theorist selected a datum-entailing theory.

This explanandum could be explained in one of two ways:

  • (DS) The theorist designed her theory to entail the datum.
  • (RA) The theorist’s selection of her theory was reliably aimed at the truth.

White explains that (RA) means “roughly that the mechanisms which led to her selection of a theory gave her a good chance of arriving at the truth” (2003: 664). (Thus White analogizes the theorist to an ‘archer’ who is more or less reliable in ‘aiming’ at the truth in selecting a theory.) Then White offers a simple argument for predictivism: assuming ~DS, ES provides evidence for RA. But assuming DS, ES provides no evidence for RA. Thus, heuristic predictivism is true.

Interestingly, White bills his account as a strong heuristic account. In making this claim he is claiming that the epistemic advantage of prediction would not be entirely erased for an observer who was completely aware of all relevant evidence and background knowledge possessed by the scientific community at the relevant point in time. This is because the degree to which theorizing is reliable depends upon principles of evidence assessment and causal relations (including the reliability of our perceptual faculties, accuracy of measuring instruments, etc.) that are not entirely “transparent” to us. [ 12 ] Insofar as fully informed scientists may not be fully convinced of just how reliable these principles and relations are, evidence that they lead to the endorsement of theories which are predictively successful continues to redound to their assessed reliability. Thus, White concludes, strong heuristic predictivism is vindicated (2003: 671–4).

Hitchcock and Sober (2004) provide an original theory of weak heuristic predictivism that is based on a particular worry about accommodation. On the assumption that data are noisy (i.e. imbued with observational error), a good theory will almost never fit the data perfectly. To construct a theory that fits the data better than a good theory should, given noisy data, is to be guilty of “overfitting”—if we know a theorist built her theory to accommodate data, we may well worry that she has overfit the data and thus constructed a flawed theory. If we know however that a theorist built her theory without access to such data, or without using it in the process of theory construction, we need not worry that overfitting that data has occurred. When such a theory goes on to make successful predictions, Hitchcock and Sober moreover argue, this provides us with evidence that the data on which the theory was initially based were not overfit in the process of constructing the theory.

Hitchcock and Sober’s approach derives from a particular solution to the curve-fitting problem presented in Forster and Sober 1994. The curve fitting problem is how to select an optimally supported curve on the basis of a given body of data (e.g., a set of \([X,Y]\) points plotted on a coordinate graph). A well-supported curve will feature both ‘goodness of fit’ with the data and simplicity (intuitively, avoiding highly bumpy or irregular patterns). Solving the curve-fitting problem requires some precise way of characterizing a curve’s simplicity, a way of characterizing goodness of fit, and a method of balancing simplicity against goodness of fit to identify an optimal curve.

Forster and Sober cite Akaike’s (1973) result that an unbiased estimate of the predictive accuracy of a model can be computed by assessing both its goodness of fit and its simplicity as measured by the number of adjustable parameters it contains. A model is a statement (a polynomial, in the case of a proposed curve) that contains at least one adjustable parameter. For any particular model M , a given data set, and identifying \(L(M)\) as the likeliest (i.e. best data fitting) curve from M , Akaike showed that the following expression describes an unbiased estimate of the predictive accuracy of model M:

This estimate is deemed a model’s ‘Akaike Information Criterion’ (AIC) score—it measures goodness of fit in terms of the log likelihood of the data on the assumption of \(L(M)\). The simplicity of the model is inversely proportion to k , the number of adjustable parameters in the model. The intuitive idea is that models with a high k value will provide a large variety of curves that will tend to fit data more closely than models with a lower k value—and thus large k values are more prone to overfitting than small k values. So the AIC score assesses a model’s likely predictive accuracy in a way that balances both goodness of fit and simplicity, and the curve-fitting problem is arguably solved.

Hitchcock and Sober (2004) consider a hypothetical example involving two scientists, Penny Predictor and Annie Accommodator. Working independently, they acquire the same set of data D —Penny proposes theory Tp while Annie proposes Ta . The critical difference however was that Penny proposed Tp on the basis of an initial segment of the data D 1—thereafter she predicted the remaining data D 2 to a high degree of accuracy \((D = D1 \cup D2)\). Annie however was in possession of all the data in D prior to proposing Ta and in proposing this theory accommodated D . Hitchcock and Sober ask whether there might be reason to suspect that Penny’s theory will be more predictively accurate in the future, and in this precise sense be better confirmed.

Hitchcock and Sober argue that there is no one answer to this question—and then present a series of several cases. Insofar as predictivism holds in some and not others, their account of predictivism is clearly a local (rather than global) account. In cases in which Penny and Annie propose the same theory, or propose theories whose AIC scores can be computed and directly compared, there is no reason to regard facts about how they built the theory to carry further significance. But if we do not know which theories were proposed, or by what method they were constructed, the fact that Penny predicted data that Annie accommodated can argue for Penny’s theory having a higher AIC score than Annie’s, and thus carry an epistemic advantage.

Insofar as predictivism holds in some cases but not the others, the question whether predictivism holds in actual episodes of science depends on which cases such actual episodes tend to resemble, but Hitchcock and Sober “take no stand on how often the various cases arise” (2004: 21).

Although their account of predictivism is tailored initially to the curve-fitting problem, it is by no means limited to such cases. They note that it is natural to think of a model as analogous to the ontological framework of a scientific theory where the various ontological commitments can function as ‘adjustable parameters’—for example, the Ptolemaic and Copernican world pictures both begin with a claim that a certain entity (the sun or the earth) is at the center, and these models are articulated by producing models with adjustable parameters.

For critical discussion of Sober and Hitchcock’s account, see Lee 2012, 2013 and Douglas & Magnus 2013: 582–584. Peterson (2019) argues that Sober and Hitchcock's approach can be extended to issue methodological recommendations involving methods of cross validation and replication in psychology.

Barnes (2005a, 2008) maintains that predictivism is frequently a manifestation of a phenomenon he calls ‘epistemic pluralism’. A ‘ T -evaluator’ (a scientist who assigns some probability to theory T ) is an epistemic pluralist insofar as she regards one form of evidence to be the probabilities posted (i.e. publicly presented) by other scientists for and against T and other relevant claims (she is an epistemic individualist if she does not do this but considers only the scientific evidence ‘on her own’). One form of pluralistic evidence is the event in which a reputable scientist endorses a theory—this takes place when a scientist posts a probability for T that is (1) no lower than the evaluator’s probability and (2) high enough that subsequent predictive confirmation of T would redound to the scientist’s credibility (2008: 2.2).

Barnes rejects the heuristic conception of novelty on the grounds that it is a mistake to think that what matters epistemically is the process by which the theory was constructed—what matters is on what basis the theory was endorsed (2008: 33f) . In the example above, confirmation of N (a consequence of T ) could carry special weight for an evaluator who learned that the theorist endorsed the theory without appeal to observational evidence for N (irrespective of how the theory was constructed). He proposes to replace the heuristic conception with his endorsement conception of novelty: N (a known consequence of T ) counts as a novel confirmation of T relative to agent X insofar as X posts an endorsement-level probability for T that is based on a body of evidence that does not include observation-based evidence for N .

Barnes claims that the notion of endorsement novelty has several advantages over the heuristic conception—one is that endorsement novelty can account for the fact that prediction is a matter of degree: the more strongly the theorist endorses T , the more strongly its consequence N is predicted (and thus the more evidence for T for pluralist evaluators who trust the endorser). Another is that the orthodox distinction between the context of discovery and the context of justification is preserved. According to the latter distinction, it does not matter for purposes of theory evaluation how a theory was discovered. But this turns out not to be true on the heuristic conception given the central importance it accords to how a theory was built (cf. Leplin 1987). Endorsement novelty respects the irrelevance of the process by which theories are discovered (Barnes 2008: 37–8).

One claim central to this account is that confirmation is a three-way relation between theory, evidence, and background belief (cf. Good 1967). Barnes distinguishes between two types of theory endorser: (1) virtuous endorsers post probabilities for theories that cohere with their evidence and background beliefs and (2) unvirtuous endorsers who post probabilities that do not so cohere. A common way of explaining the predictivist intuition is to note that accommodators tend to be viewed with a certain suspicion—their endorsement of T based on accommodated evidence may reflect a kind of social pressure to endorse T whatever its merits (cf. the ‘fudging explanation’ above). Such an endorser may post a probability for T that is too high given her total evidence and background belief—predictivism thus becomes a strategy by which pluralist endorsers protect themselves from unvirtuous accommodators (Barnes 2008: 61–69).

Barnes then presents a theory of predictivism that is designed to apply to virtuous endorsers. Virtuous predictivism has two roots: (1) the prediction per se, which is constituted by an endorser’s posting an endorsement level probability for T that entails empirical consequence N on a basis that does not include observation-based evidence for T , and (2) predictive success, constituted by the empirical demonstration that N is true. The prediction per se carries epistemic significance for a pluralist endorser because it implies that the predictor possesses reason R (consisting of background beliefs) that supports T . If the endorser views the predictor as credible, this simple act of prediction carries epistemic weight. Predictive success then confirms the truth of R , which thereby counts as evidence for T . Novel confirmation thus has the special virtue of confirming the background beliefs of the predictor—accommodative confirmation lacks this virtue.

Barnes presents two Bayesian thought experiments that purport to establish virtuous predictivism. In each experiment an evaluator Eva faces two scenarios—one in which she confronts Peter who posts an endorsement probability for T without appeal to N -supporting observations (thus Peter predicts N ) and another in which she confronts Alex who posts an endorsement probability for T on a basis that includes observations that establish N (thus Alex accommodates N ). The idea behind both thought experiments is to make the scenarios otherwise as similar as possible—Barnes makes a number of ceteris paribus assumption that render the probability functions of Peter and Alex maximally similar. However it turns out that there is more than one way to keep the scenarios maximally similar: in the first experiment, Peter and Alex have the same likelihood ratio but have different posteriors for T . In the second scenario they have the same posteriors but different likelihood ratios. Barnes demonstrates that Eva’s posterior probability is higher in the predictor scenario in both experiments—thus vindicating virtuous predictivism (2008: 69–80).

Although his defense of virtuous predictivism is the centerpiece of his account, Barnes claims that predictivism can hold true of actual theory evaluation in a variety of ways. He maintains that the position deemed ‘weak predictivism’ is actually ambiguous—it could refer to the claim that scientists actually rely on knowledge that evidence was (or was not) predicted because prediction is symptomatic of a some other feature(s) of theories that is epistemically important (‘tempered predictivism’ [ 13 ] ) or simply to the fact that there is a correlation between prediction and this other feature(s) (‘thin predictivism’). The distinction between tempered and thin predictivism cross classifies with the distinction between virtuous and unvirtuous predictivism to produce four varieties of weak predictivism. Barnes then turns to the case of Mendeleev’s periodic law and argues that all four varieties can be distinguished in the scientific community’s reaction to Mendeleev’s theory of the elements (2008: 82–122). In particular, he argues that it was specifically Mendeleev’s predicted evidence, not his accommodated evidence, that had the power to confirm his scientific and methodological background beliefs from the standpoint of the scientific community.

Critical responses to Barnes’s account are presented in Glymour 2008; Leplin 2009; and Harker 2011. Barnes 2014 responds to these. See also Magnus 2011 and Alai 2016.

It was noted in Section 1 that John Maynard Keynes rejected predictivism—he argued that when a theory T is first constructed it is usually the case that there are reasons R that favor T . If T goes on to generate successful novel predictions E then those reasons combine with R to support T —but if some \(T'\) is constructed ‘merely because it fit E ’ then \(T'\) will be less supported than T . This has been deemed the “Keynesian dissolution of the paradox of predictivism” (Barnes 2008: 15–18)

Colin Howson cites with approval the Keynesian dissolution (1988: 382) and provides the following illustration: consider h and \(h'\) which are rival explanatory frameworks. \(h'\) independently predicts e ; h does not entail e but has a free parameter which is fixed on the basis of e to produce \(h(a_{0})\)—this latter hypothesis thus entails e . So \(h'\) predicts e while \(h(a_{0})\) merely accommodates e . Let us assume that the prior probabilities of h and \(h'\) are equal (i.e., \(p(h) = p(h')\)). Now it stands to reason that \(p(h(a_0)) \lt p(h)\) since \(h(a_{0})\) entails h but not vice versa—thus Howson shows it follows that the effect of e ’s confirmation will be to leave \(h'\) no less probable—and quite possibly more probable—than \(h(a_{0})\) (1990: 236–7). Thus predictivism appears true but the operating factor is the role of unequal prior probabilities. [ 14 ]

The argument from Keynes and Howson against predictivism holds that the evidence which appears to support predictivism is illusory—they are clearly asserting that strong predictivism is false, presumably in its temporal and heuristic forms.

However, it is important to note that the arguments of Keynes and Howson cited above predate the injection of the concept of ‘weak predictivism’ into the literature. [ 15 ] It is thus unclear what stand Keynes or Howson would take on weak predictivism. Likewise, Collins’ 1994 paper “Against the Epistemic Value of Prediction” strongly rejects predictivism, but what he is clearly denying is what has since been deemed strong heuristic predictivism. He might endorse weak heuristic predictivism as he concedes that

all sides to the debate agree that knowing that a theory predicted, instead of accommodated, a set of data can give us an additional reason for believing it is true by telling us something about the structural/relational features of a theory. (1994: 213)

Similarly Harker argues that “it is time to leave predictivism behind” but also concedes that “some weak predictivist theses may be correct” (2008: 451); Harker worries that proclaiming weak predictivism may mislead some into thinking that predictive success is somehow more important than other epistemic indicators (such as endorsement by reliable scientists). White goes so far as to claim that weak predictivism “is not controversial” (2003: 656).

Stephen Brush is the author of a body of historical work much of which purports to show that temporal predictivism does not hold in various episodes of the history of science. [ 16 ] These include the case of starlight bending in the assessment of the General Theory of Relativity (Brush 1989), Alfvén’s theories of space plasma phenomena (Brush 1990), and the revival of big bang cosmology (Brush 1993). However, Brush (1996) argues that temporal novelty did play a role in the acceptance of Mendeleev’s Periodic Table based on Mendeleev’s predictions. Scerri and Worrall (2001) presents considerable historical detail about the assessment of Mendeleev’s theory and dispute Brush’s claim that temporal novelty played an important role in the acceptance of the theory (2001: 428–436). (See also Brush 2007.) Steele and Werndl (2013) argue that predictivism fails to hold in assessing models of climate change, while Frish (2015) affirms that it displays weak predictivism.

Another form of anti-predictivism holds that accommodations are superior to predictions in theory confirmation. “The information that the data was accommodated rather than predicted suggests that the data is less likely to have been manipulated or fabricated, which in turn increases the likelihood that the hypothesis is correct in light of the data” (Dellsen forthcoming).

Scientific realism holds that there is sufficient evidence to believe that the theories of the ‘mature sciences’ are at least approximately true. Appeals to novelty have been important in formulating two arguments for realism—these are the ‘no miracle argument’ and the realist reply to the so-called ‘pessimistic induction’. [ 17 ]

The no-miracle argument for scientific realism holds that realism is the only account that does not make the success of science a miracle (Putnam 1975: 73). ‘The success of science’ here refers to the myriad verified empirical consequences of the theories of the mature sciences—but as we have seen there is a long standing tendency to regard with suspicion those verified empirical consequences the theory was built to fit. Thus the ‘ultimate argument for scientific realism’ refers to a version of the no miracle argument that focuses just on the verified novel consequences of theories—it would be a miracle, this argument proclaims, if a theory managed to have a sustained record of successful novel predictions if the theory were not at least approximately true. Thus, assuming there are no competing theories with comparable records of novel success, we ought to infer that such theories are at least approximately true (Musgrave 1988). [ 18 ]

Insofar as the ultimate argument for realism clearly emphasizes a special role for novel successes, the nature of novelty has been an important focus in the realist account. Leplin 1997 is a book length articulation of the ultimate argument for realism; Leplin proposes a sufficient condition for novelty consisting of two conditions:

An observational result O is novel for T if:

  • Independence Condition: There is a minimally adequate reconstruction of the reasoning leading to T that does not cite any qualitative generalization of O .
  • Uniqueness Condition: There is some qualitative generalization of O that T explains and predicts, and of which, at the time that T first does so, no alternative theory provides a viable reason to expect instances. (Leplin 1997: 77).

Leplin clarifies that a ‘minimally adequate reconstruction’ of such reasoning will be a valid deduction D of the ‘basic identifying hypotheses’ of T from independently warranted background assumptions—the premises of D cannot be weakened or simplified while preserving D ’s validity. Thus for Leplin what establishes whether O is a novel consequence of T is not whether O was actually used in the construction of T , but rather whether it was ‘needed’ for T ’s construction. As with Worrall’s mature ‘essential use’ conception of novelty, what matters is whether there is a heuristic path to T that does not appeal to O , whether or not O was used in constructing T . The Uniqueness Condition helps bolster the argument for the truth of theories with true novel consequences, for if there were another theory \(T'\) (incompatible with T ) that also provides a viable explanation of O , the imputation of truth could not explain the novel success of both T and \(T'\). The success of at least one would have to be due to chance, but if chance could explain one such success it could explain the other as well.

Both of these conditions for novelty have been questioned. Given the Independence Condition, it is unclear that any observational result O will count as novel for any theory, for it may always be true that the logically weakest set of premises that entail T (which will be cited in a minimally adequate reconstruction of the reasoning that led to T ) will include O as a disjunct of one of the premises (Healey 2001: 779). The Uniqueness Condition insists that there be no available alternative explanation of O at the time T first explains O —but clearly, theories that explain O could be subsequently proposed and would threaten the imputation of truth to T no less. This condition seems arbitrarily to privilege theories depending on when they were proposed (Sarkar 1998: 206–8; Ladyman 1999: 184).

Another conception of novelty whose purpose is to bolster the ultimate argument for realism is ‘functional novelty’ (Alai 2014). A datum d is ‘functionally novel’ for theory T if (1) d was not used essentially in constructing T (viz., there is a heuristic path to T and related auxiliary hypotheses that does not cite d ), (2) d is a priori improbable, and (3) d is heterogeneous with respect to data that is used in constructing T and related auxiliary hypotheses (i.e. d is qualitatively different from such data). Functional novelty is a ‘gradual’ concept insofar as a priori improbability and data heterogeneity come in degrees. If there is more than one theory for which d is functionally novel then the dispute between these theories cannot be settled by the ultimate argument (Alai 2014: 306).

Anti-realists have argued that insofar as we adopt a naturalistic philosophy of science, the same standards should be used for assessing philosophical theories as scientific theories. Consequently, if novel confirmations are necessary for inferring a theory’s truth then scientific realism should not be accepted as true, as the latter thesis has no novel confirmations to its credit (Frost-Arnold 2010, Mizrahi 2012).

Another component of the realist/anti-realist debate in which appeals to novel success figure importantly is the debate over the ‘pessimistic induction’ (or ‘pessimistic meta-induction’). According to this argument, the history of science is almost entirely a history of theories that were judged empirically successful in their day only to be shown subsequently to be entirely false. There is no reason to think that currently accepted theories are any different in this regard (Laudan 1981b).

In response some realists have defended ‘selective realism’ which concedes that while the majority of theories from the history of science have proven false, some of them have components that were retained in subsequent theories—these tend to be the components that were responsible for novel successes. Putative examples of this phenomenon are the caloric theory of heat and nineteenth century optical theories (Psillos 1999: Ch. 6), both of which were ultimately rejected as false but which had components that were retained in subsequent theories; these were the portions that were responsible for their novel confirmations. [ 19 ] So in line with the ultimate argument the claim is made that novel successes constitute a serious argument for the truth of the theory component which generates them. However, antirealists have responded by citing cases of theoretical claims that were subsequently determined to be entirely false but which managed nonetheless to generate impressive records of novel predictions. These include certain key claims made by Johannes Kepler in his Mysterium Cosmographicum (1596), assumptions used by Adams and Leverrier in the prediction of the planet Neptune’s existence and location (Lyons 2006), and Ptolemaic astronomy (Carman & Díez 2015). Leconte (2017) maintains that predictive success legitimates only sceptical realism – the claim that some part of a theory is true, but it is not known which part.

realism: and theory change in science | scientific discovery | scientific explanation | scientific method | scientific realism | Whewell, William

Copyright © 2022 by Eric Christian Barnes

what is an example of ad hoc hypothesis

  • History Of Philosophy

what is an example of ad hoc hypothesis


Published online by Cambridge University Press:  18 August 2022

When a theory is confronted with a problem such as a paradox, an empirical anomaly, or a vicious regress, one may change part of the theory to solve that problem. Sometimes the proposed solution is considered ad hoc. This paper gives a new definition of ‘ad hoc solution’ as used in both philosophy and science. I argue that a solution is ad hoc if it fails to live up to the explanatory requirements of a theory because the solution is not backed by an explanation or because it does not diagnose the problem. Ad hoc solutions are thus magical: they solve a problem without providing insight. This definition helps to explain both why ad hoc solutions are bad and why there may be disagreement about cases.

From the late 1950s until the 1980s ad hoc hypotheses were all the rage. Not that science was in a particularly bad state at that time; but due to Popper, philosophers of science were obsessed with ad hoccery. If science progresses via falsification, as Popper thought, we should ensure that theorists do not immunize their pet theories by devising additional hypotheses whenever experimental data contradicts their theory. But the history of science shows that additional hypotheses do sometimes constitute fruitful developments of a theory. For Popper and his school, the problem of determining which hypotheses are degenerative rather than progressive is the problem of determining which hypotheses are ad hoc. (The term ‘ad hoc’ has a pejorative and nonpejorative meaning. Here I am only interested in its pejorative sense.)

Despite the benefit of hindsight, the falsificationists were unable to give a definition of ‘ad hoc hypothesis’ that conformed to the usage of scientists in the examples they discussed. The main problem was that they considered empirical testability a cornerstone of science, and (thence?) ad hoccery should be some lack of testability. But nothing in the scientific literature suggests that scientists consider a hypothesis ad hoc if it lacks testable consequences. Worse, some hypotheses were considered ad hoc even though they did have independently testable consequences. Jarrett Leplin mockingly observes that what followed were ‘distinctions and refinements [that] constitute something like a degenerating research program, many of whose entries are patently ad hoc ’ ( Reference Leplin 1975 : 314 fn.16).

Leplin's comment illustrates that philosophers are not immune to giving ad hoc solutions. But although there is a large body of philosophical literature about ad hoc hypotheses in science, there is little work on ad hoccery more generally. Worse, many definitions of ad hoccery only apply to solutions suggested when faced with an empirical anomaly, thus excluding most parts of philosophy. But there is nothing to suggest that when a physicist claims some solution is ‘ad hoc’, they mean something quite different from when a philosopher makes a similar claim. On the contrary, it is likely that ‘ad hoc solution’ means roughly the same thing in different fields. There are differences between specific fields: physics uses different methods than (say) history. But at a very general level, in all fields of rational inquiry one develops theories based on evidence with the aim of explaining certain aspects of the world. Such theories may run into problems for which solutions are then proposed. Some of these solutions are called ‘ad hoc’. Other semitechnical terms in the vicinity of ‘ad hoc solution’—‘begging the question’, ‘special pleading’, and ‘moving the goalposts’—mean the same thing whether used by physicists, historians, or philosophers. This is one reason to think ‘ad hoc solution’ is not polysemous.

A second reason is that ‘ad hoc solution’ fails three standard tests for polysemy. The first is conjunction reduction. For example, ‘credit’ can mean praise as well as a type of loan from a bank. Conjoining sentences that use these different senses results in ambiguity. For example, ‘Jane and Anna got credit from the bank for their work’ is ambiguous, while ‘Jane got credit from the bank’ and ‘Anna got credit for her work’ are unambiguous. However, ‘Jane gave an ad hoc solution to a paradox’ and ‘Anna gave an ad hoc solution to an experimental anomaly’ can be conjoined unambiguously into ‘Jane and Anne gave ad hoc solutions to a paradox and an experimental anomaly’.

Another test is ellipsis: unless a polysemous term is used univocally, it is inappropriate to ellipse it. Thus, if Jane wants to participate as a runner in a marathon, while Anna wants to organize that marathon, one cannot express this as ‘Jane tried to run the marathon and Anna did too’. However, ‘Lorentz once proposed an ad hoc solution and so did Tarski’ is perfectly all right. Finally, polysemous terms can avoid contradictory readings. Since Jane wants to participate in, but not organize, the marathon, one can say without contradiction that Jane wanted to run the marathon, but she did not want to run the marathon. It is an awkward formulation, but contradiction is avoided because of polysemy. No such noncontradictory reading seems available for ‘Lorentz's contraction hypothesis was an ad hoc solution, but it was not an ad hoc solution’.

Thus, pace Lakatos ( Reference Lakatos, Lakatos and Musgrave 1970 ), ‘ad hoc’ does not have multiple senses and unless we find good evidence that ‘ad hoc solution’ means something different in the context of one field than it does in another, we better have a definition of ‘ad hoc solution’ that applies across the board. But currently there is not even any attempt made at such a definition.

This paper aims to fill that lacuna by giving a definition of ‘ad hoc solution’ that applies to science and also to philosophy. I argue in particular that once we add the concept of explanation, we gain an adequate understanding of ad hoccery. This more general definition also gives insight into why ad hoc solutions are bad, and it helps explain why some may disagree whether a solution is indeed ad hoc.

Before giving my definition, I discuss the most common associations people have with ad hoccery in section 1 and show how those fail to provide necessary or sufficient conditions for ad hoccery. In section 2 I define ‘ad hoc solution’ in terms of explanatory failure, and in section 3 I apply this definition to the discussion about the Church-Fitch paradox of knowability, the discussion about the axioms of Zermelo-Fraenkel (ZF) set theory, and the Lorentz-FitzGerald contraction hypothesis. I conclude with some remarks about the relation between ad hoccery and our theories of explanation.

1. Some Stubborn Associations

The Latin phrase ‘ad hoc’ translates ‘to this’, and the adjective ‘ad hoc’ is commonly defined as ‘created for a specific purpose’. The many definitions given in the philosophy of science literature may suggest that ‘ad hoc’ is not used for a specific purpose but is rather a highly polysemous term. To restore order, we should rid ourselves of some stubborn associations by showing that none of these are necessary or sufficient for something to count as ad hoc.

The first of these associations is that ad hoc hypotheses lack testability or empirical content. Popper made this the cornerstone of his notion of ad hoccery ( Reference Popper 1959 : 83; Reference Popper 1965 : 241). But scientists do consider some testable hypotheses ad hoc—even when the hypothesis makes predictions beyond the specific experiment that brought about the problem for the original theory. The endlessly discussed Lorentz-FitzGerald contraction hypothesis (LFC), which Popper ( Reference Popper 1959 : 83) considered a paradigmatic example of an ad hoc hypothesis, did have (novel) testable consequences: the result of the Kennedy-Thorndike experiment in 1931 constituted an empirical refutation of it (Grünbaum Reference Grünbaum 1959 : 49ff.).

Neither is lack of testability sufficient for ad hoccery unless virtually all hypotheses in fields such as philosophy, mathematics, or history count as ad hoc. Hypotheses in these fields are commonly not experimentally testable. Popper and others in this tradition usually limit their definition to the use of ‘ad hoc’ in the sciences and in particular physics, thus implicitly holding that ‘ad hoc’ is polysemous. Worse, it is not obvious that hypotheses in physics that lack testable consequences are ad hoc. String theory is currently untestable, and the multiverse hypothesis seems untestable in principle. But to the best of my knowledge, no one currently complains that they therefore are ad hoc: testability thus does not seem sufficient for ad hoccery. Better to abandon this ‘strange fixation on testability’ (Leplin Reference Leplin 1975 : 345). (For more arguments against lack of testability as a cornerstone for ad hoccery, see inter alia : Bamford Reference Bamford 1993 , Grünbaum Reference Grünbaum 1976 : 342ff., and Hunt Reference Hunt 2012 : 3ff.)

A second association is with circular explanations. Popper takes ad hoc explanations to be ‘almost circular’ ( Reference Popper 1972 : 192), and David Miller gives ‘it is the dormitive virtue of opium that induces sleep’ as an example of an ad hoc hypothesis that is ‘supposedly explanatory’ ( Reference Miller, Bynum, Browne and Porter 1981 : 6–7). Since ‘dormitive virtue’ just means ‘the ability to induce sleep’, this is a circular explanation. But it is unclear what is ad hoc about it—unless, of course, all explanatory failures are ad hoc. Conversely, there are paradigmatic cases of ad hoccery, such as the LFC, that are clearly not circular explanations. If the LFC were a circular explanation, then the phenomena it explained should provide (part of) its explanation. The phenomena it explained did not provide the explanation for the LFC but were (part of) the justification for the LFC. But justification and explanation are distinct notions. My justification for believing that it is raining is that I see the rain, but (me seeing) the rain does not explain why it is raining. (Although it does help explain why I believe it is raining. But the target of explanation is the fact that it rains, not the fact that I believe that it is raining.) It thus seems that two fallacies are here combined by a fallacy of association. (For more on why circular explanations are different from ad hoc explanations, see Bamford Reference Bamford 1993 : 336ff.)

A third, related association is that ad hoc solutions lack independent evidence or independent reasons (Schaffner Reference Schaffner 1974 : 68; Zahar Reference Zahar 1973 : 101ff.). Roughly, the idea is that a solution is ad hoc if the problem it solves is the only evidence or reason that can be given for it. Of course, it is a good thing when your hypothesis is corroborated by various pieces of independent evidence. But the history of science suggests that a lack of independent evidence is not sufficient to consider a hypothesis ad hoc. The only empirical evidence for the postulation of Neptune was an anomaly in the expected movement of Uranus; yet, there is no evidence that scientists considered it ad hoc (Leplin Reference Leplin 1982 : 237). Similarly, the redshift of a galaxy's characteristic spectrum is explained by the velocity of the galaxy although there is little evidence for the velocity of a galaxy beyond its redshift (Lipton Reference Lipton, Hon and Rakover 2001 : 45).

Conversely, no scientist at the time seemed to think that additional (experimental) evidence for the LFC would make it less ad hoc (Holton Reference Holton 1969 : 177). And, as a more extreme example, a parapsychologist who holds that psychic phenomena are disturbed by the presence of inquisitive or skeptical observers can point to a wide range of corroborating cases where psychic phenomena were different from their expectations (Boudry Reference Boudry 2013 : 249). Still, the parapsychologist's hypothesis is ad hoc. In sum, there is little evidence for the idea that a hypothesis is ad hoc if and only if it lacks independent evidence.

A lack of generality or a failure to unify is a fourth association (Lange Reference Lange 2001 ; Leplin Reference Leplin 1975 : 336ff.). The idea is that non-ad hoc solutions solve similar problems and therefore unify these problems. Unfortunately, some solutions are ad hoc even if they take care of various related problems. I solve Russell's paradox by holding that every predicate specifies a set except in those cases where this would result in paradox. Moreover, my solution is general and unified: all these paradoxes have in common that supposing the existence of some set leads to a paradox. Still, the solution is blatantly ad hoc (Hand and Kvanvig Reference Hand and Kvanvig 1999 : 426). (Another example would be the parapsychologist's hypothesis mentioned in the previous paragraph.)

Conversely, not every exception to a general rule is ad hoc. Many sweeping scientific generalizations are, in the face of new data, restricted, and this is often considered progress. Bamford ( Reference Bamford 1993 ) illustrates this by Hooke's law, which states that the extension of a spring is proportional to the force exerted on the spring. Springs do not always behave in accordance with Hooke's law; the point at which they stop behaving in that way is their elastic limit. Knowledge of the elastic limit of a material and how this limit changes due to fatigue is indeed crucial for designing bridges (Bamford Reference Bamford 1993 : 305ff.). But all these exceptions to Hooke's law do not make the theory of springs ad hoc. (At least to my knowledge no one ever made that complaint.)

Some accounts of ad hoccery combine some of these associations. For example, Leplin's ( Reference Leplin 1975 : 337) detailed analysis of ad hoccery states, among other things, that if a hypothesis is ad hoc, then (i) there is no other evidence for it other than the experiment for which the hypothesis was formulated, (ii) the hypothesis has no applications outside of that experiment, and (iii) it has no independent theoretical support. But as we just saw, neither lack of independent evidence nor lack of generalizability is necessary for ad hoccery. Moreover, Leplin thinks ad hoccery is a global affair. He states that if a hypothesis is ad hoc, there are problems other than the specific experimental anomaly that triggered the hypothesis. These other problems indicate that the theory is ‘non-fundamental’: that the problems cannot be solved unless the non-fundamentality is removed, and that ‘a satisfactory solution to any of these problems . . . must contribute to the solution of the others’ ( Reference Leplin 1975 : 337). I do not see why ad hoccery cannot be local. Below I discuss two examples (a theory of bread and the Church-Fitch paradox) that are local: there are no obvious other problems that need to be dealt with too.

This ends our association game. Time to look at the problem of ad hoccery with fresh eyes.

2. Ad Hoc Solutions

Let us start with the classic toy example of the nourishment of bread. Suppose our basic bread theory states that all bread nourishes. This helps explain various bread-related facts, but it also faces a challenge. In August 1951 at least five people from the French village Pont Saint-Esprit died after eating bread. Not all bread nourishes, it seems, and our basic bread theory needs changing. A straightforward solution is to restrict the general statement of our theory: all bread nourishes, except the bread in Pont Saint-Esprit in August 1951. Now the tragedy of Pont Saint-Esprit no longer contradicts our theory. This is a good thing. At the same time, the solution is fishy and a paradigm of ad hoccery. Why?

It is important to note that the exception is true. The bread in Pont Saint-Esprit did not nourish while bread normally does. Every solution should thus state that the bread in Pont Saint-Esprit was not nourishing or at least be compatible with this claim; otherwise the resulting theory is simply false. The restriction itself can thus not be what is ad hoc: all solutions should exclude the bread from Pont Saint-Esprit from the nourishing bread. But not all solutions are considered ad hoc. We can understand why some are ad hoc if we focus on explanation. The simple exclusion solution gives no clue as to why the bread from Pont Saint-Esprit did not nourish while any satisfactory solution should explain this fact. Its explanatory failure, I submit, is what makes the simple exclusion solution ad hoc.

Contrast the simple exclusion with the ergot solution that states that all bread nourishes except bread that contains ergot and that the bread from Pont Saint-Esprit contained ergot. Neither solution is contradicted by the tragedy of Pont Saint-Esprit so they are equal in that respect. But the ergot solution has more explanatory depth: it can answer the question why the bread of Pont Saint-Esprit did not nourish. While the simple exclusion solution takes this as an unexplained fact, the ergot solution explains it: the bread contained ergot, and bread containing ergot does not nourish. The ergot solution thus explains or, as I prefer to put it, diagnoses the problem by pointing to a feature of the bread in Pont Saint-Esprit that is responsible for its failure to nourish. By ‘diagnosing a problem’ I thus mean an explanation for why the problem arose. (Note that this need not be an explanation for why the problem is indeed a problem. In our example we want an explanation for why the bread in Pont Saint-Esprit did not nourish. We are not interested in explaining why it is a problem for the original theory that the bread in Pont Saint-Esprit did not nourish. One should explain why theories should be empirically adequate to meet this latter explanatory demand. A diagnosis merely explains how the problem arose and takes it for granted that the problem is indeed a problem.)

The ergot solution as it stands may still be somewhat wanting. It solves the problem of the bread in Pont Saint-Esprit using an entity that (apparently) counters bread's capacity to nourish. But why does bread containing ergot fail to nourish? And what exactly is ergot? Some might say this solution is still not free of ad hoccery. Compare it with the poison solution, which holds that the bread in Pont Saint-Esprit contained a poison and that all bread nourishes except when it contains a poison. While the ergot solution triggers the question ‘why does bread containing ergot fail to nourish?’, the poison solution triggers no analogous question. In many contexts it is considered obvious that bread containing a poison fails to nourish. Poisons are bad for human beings, and therefore bread containing them will have a bad effect on human beings eating that bread. The poison solution is thus backed by a sufficient explanation.

Despite this, the poison solution may not be overall better than the ergot solution, because the first provides a less specific diagnosis than the latter. The diagnosis of the poison solution is that the bread in Pont Saint-Esprit contained a poison. But this is arguably too general. There are many poisons, and the poison solution does not single out a particular one that the bread in Pont Saint-Esprit was supposed to contain. It raises the question which poison was in the bread exactly? Ideally, we would thus want a theory that provides both a specific diagnosis and is backed by an explanation. Of course, in our example this is achieved by combining the last two solutions, that is, the ergot-poisoning solution, which states that (a) all bread nourishes except when it contains a poison; (b) ergot is a poison; and (c) the bread in Pont Saint-Esprit contained ergot. In many contexts claim (a) is not in need of any further explanation. The diagnosis (c) is now backed by (a) and (b) because these latter two answer the question why bread containing ergot fails to nourish. The resulting theory thus provides a non-ad hoc solution to the problem of the bread of Pont Saint-Esprit because it diagnoses the problem, and the diagnosis is backed by an explanation. The simple exclusion solution, on the other hand, provides an ad hoc solution because it fails to diagnose the problem, and, trivially, its diagnosis is not backed by an explanation.

More generally, I claim that ‘ad hoc’ is used in philosophy and science for solutions that are non-explanatory, thus:

(Ad Hoc) A solution to a problem is ad hoc if and only if

(a) the solution does not diagnose the problem, or [Diagnosis]

(b) the solution is not backed by a good explanation; and [Explanation]

(ii) it is reasonable to demand a solution that diagnoses the problem and is backed by a good explanation. [Reasonable]

An ad hoc solution is thus mysterious either because it does not diagnose the problem or because it is not backed by a good explanation. And this mystery is problematic because a non-mysterious solution is reasonably demanded. I have defined ad hoccery for solutions rather than for hypotheses because ad hoccery mostly comes up in the context of problems: empirical anomalies, paradoxes, vicious regresses, and so on. (Many definitions of ‘ad hoc hypothesis’ therefore state that the hypothesis is proposed as a solution to some empirical anomaly.) It may help to go over the conditions in (Ad Hoc) in some more detail.

The first condition states that the solution is not explanatory because (a) it fails to diagnose the problem or (b) it is not backed by an explanation. To diagnose a problem is to explain how it arose in the first place. In the above example both the ergot solution and the ergot-poison solution diagnosed the problem as arising from ergot in the bread. The diagnosis in this case thus introduces an object that is held responsible, but a diagnosis may instead delete rather than introduce an object. The problems of phlogiston theory may, for example, be diagnosed as arising from the mistaken assumption that phlogiston exists. (Diagnosing is easier with the benefit of hindsight.) And some diagnoses are orthogonal to matters of existence: a diagnosis of the Grelling-Nelson paradox (‘Is “heterological” a heterological word?’) would not point to an object that is responsible for the paradox but may point out a property of natural languages (that they are semantically closed, for example).

A solution is backed by an explanation when it is explained how the solution solves the problem. In the bread example, the simple exclusion solution was not backed by an explanation because there was no answer to the question why the bread in Pont Saint-Esprit did not nourish. But neither was the ergot solution backed by an explanation because (to most people) it would be unclear why the presence of ergot in the bread ensures that it no longer nourishes. The poison solution was backed by an explanation: something that is nutritious in normal circumstances is no longer nutritious when it contains a poison. And the final solution we discussed partly used this same explanatory backing: ergot is a poison, and something that is nutritious in normal circumstances is no longer nutritious when it contains a poison; therefore, bread containing ergot fails to nourish.

In the bread example we saw also that more general diagnoses, which may apply to more cases than the case at hand, are not necessarily better. Blaming poison whenever a nourishing substance fails to nourish is a general solution, but we often want to know specifics. On the other hand, a good diagnosis can often be generalized—other products made from grain and containing ergot will also fail to nourish. Generality is thus a bad predictor for the quality of a diagnosis: good diagnoses can often be generalized, but a general diagnosis may be too general to be informative. Clearly, it may be debatable whether a diagnosis is good, which explains why we may disagree about ad hoccery.

I should stress that the distinction between diagnoses and explanations is artificial: a diagnosis is a kind of explanation. But the bread example shows that not every diagnosis is backed by an explanation, and therefore I have chosen the term ‘diagnosis’ for that which explains how the problem arose so that ‘explanation’ can be unambiguously used for that which backs the solution. I take the term ‘diagnosis’ from Stephen Read who argues that Buridan's solution to the liar and revenge paradoxes is ‘an ad hoc device designed solely, and without any real diagnosis, to block the paradoxes’ ( Reference Read 2002 : 202). (Indeed, not everyone agrees with Read's claim, see Benétreau-Dupin Reference Benétreau-Dupin 2015 ; Hughes Reference Hughes 1984 : 20; and Klima Reference Klima, Rahman, Tulenheimo and Genot 2008 .)

In the context of theories of truth, there is another good example of a solution that does not diagnose a problem. Deflationists hold that there is not much to truth; it is completely characterized by ( T ): ‘ p ’ is true if and only if p . Like many other theories of truth, deflationism faces a problem with the liar sentence: ‘this sentence is not true’. By applying ( T ) to the liar and using some basic logic, we get a contradiction. One way out for deflationists is to restrict ( T ) so it does not apply to those propositions that lead to a paradox. This solves the problem but without diagnosing it because it does not tell us why applying ( T ) to some sentences leads to a contradiction.

Contrast this with a solution that bans all self-referential sentences from ( T ). This solution diagnoses the problem as due to self-referentiality; the liar results in paradox because it is self-referential. This response might still be ad hoc, though, for it seems to meet condition (i) (b): the solution is not backed by an explanation. The claim that self-referentiality is problematic needs to be backed by an explanation if only because not all self-referential sentences seem problematic. ‘This sentence contains five words’ is a perfectly consistent self-referential sentence. Hence, unless backed by an explanation, the self-referentiality response may be ad hoc although for a slightly different reason than the previous solution. (Of course, the self-referentiality response might just be bad because it is false. Or maybe it is both false and ad hoc: ad hoccery offers no protection against other vices.)

To avoid satisfying condition (i) of (Ad Hoc) a solution should both diagnose the problem and, unless the solution is not in need of an explanation, be backed by an explanation. This link between ad hoccery and explanatory failure can also be found in Read's assessment of Paul Horwich's deflationist solution to the liar:

The fact that he [Horwich] excludes the paradoxical cases of ( T ) from the account of truth shows, first, the ad hoc and unsatisfactory nature of his account of the paradoxes—after all, he has no further account of truth to which he can appeal to explain the exclusion. (Read Reference Read 2002 : 214)

A similar sentiment is expressed by J.C. Beall and Bradley Armour-Garb:

Nothing in deflationism itself yields a principled explanation of why such sentences should not be within the range of (T)'s variables (as it were). This leaves open the possibility that deflationists may none the less resort to ad hoc restrictions. (Beall and Armour-Garb Reference Beall and Armour-Garb 2003 : 313)

It may of course happen that a solution lacks both a diagnosis and an explanatory backing. For example, Read seems to think that ‘to explain the exclusion’ Horwich should provide both a diagnosis of the paradox and an explanatory backing for the exclusion of the problematic sentences.

I will add one small aside about Samuel Schindler's ( Reference Schindler 2018 : 59) analysis of ad hoccery because it resembles (Ad Hoc) in some sense. Schindler holds that a hypothesis is non-ad hoc if it coheres with the theory at hand or the relevant background theories, and a hypothesis coheres with the (background) theory just in case the theory provides theoretical reasons for believing the hypothesis. If the hypothesis can be deduced from the (background) theory, then, for Schindler, this constitutes a theoretical reason to believe the hypothesis. Another theoretical reason for believing a hypothesis is if the (background) theory explains why the hypothesis is true. This latter idea is somewhat similar to condition (i) (b) of (Ad Hoc). But note that coherence plays no role in (Ad Hoc) and that mere deduction of a solution is, under (Ad Hoc), insufficient to save it from ad hoccery. Moreover, Schindler's definition of ‘ad hoc hypothesis’ only covers hypotheses that are ‘are introduced to save a theory . . . from empirical refutation’ ( Reference Schindler 2018 : 59) and is thus inapplicable in most philosophical contexts.

Just as there may be disagreement about the quality of a diagnosis or about the quality of the explanation backing the diagnosis, there may be disagreement about whether the diagnosis needs an explanation. In the bread example, someone might demand a further explanation for why ergot is poisonous. This demand is reasonable in the context of chemistry and biology where we seek to explain why some substances are poisonous. But the demand may be unreasonable in the context of history.

Which brings us to the second condition: just because you can cook up a why-question that the solution does not answer, does not mean that the solution is ad hoc. Some demands for explanation are unreasonable, and condition (ii) ensures that failing to live up to an unreasonable demand does not make a solution ad hoc. Again, there may be disagreement about whether demanding an explanation is reasonable and thus disagreement about whether a solution is ad hoc. I have no theory to offer on when an explanation is reasonably demanded, and it is beyond of the scope of this paper to construct one. (But see Bromberger Reference Bromberger 1992 for an illuminating discussion.)

Conditions (i) and (ii) show that a solution may become non-ad hoc if it becomes possible to diagnose the problem, if a reasonably demanded explanation starts backing it, or if it becomes unreasonable to demand an explanation for it. In the next section I show that philosophers and scientists use ‘ad hoc’ in accordance with (Ad Hoc).

3. (Ad Hoc) Applications

Philosophers often complain that something is ad hoc. Within the philosophy of language, ‘many philosophers regard [Tarski's solution to the liar] as ad hoc’ (Sher and Bo Reference Sher and Bo 2019 : 38). (For example, Fox Reference Fox 1989 : 177 and Priest Reference Priest 2000 : 309.) In the philosophy of mathematics, the axioms of ZF set theory are often considered to provide an ad hoc solution to the set-theoretical paradoxes (Cook and Hellman Reference Cook, Hellman, Cook and Hellman 2018 : 53; Menzel Reference Menzel 1986 : 37–39; Putnam Reference Putnam, Sher and Tieszen 2000 : 24). And in metaphysics it has been argued that universals and tropes are pieces of ad hoc ontology (Rodriguez-Pereyra Reference Rodriguez-Pereyra 2002 : 210ff.) and that postulating a primitive non-mereological form of composition for facts is an ad hoc solution to the unity problem (Betti Reference Betti 2015 : ch. 2).

In science the complaint of ad hoccery seems to be less often made than in philosophy. Examples of alleged ad hoc hypotheses that are often discussed by philosophers of science are Ptolemy's epicycles, the LFC, and the neutrino hypothesis. Of these three, the LFC is arguably the strongest example because even Lorentz himself thought it was ad hoc.

Since I lack the space to discuss all cases where the complaint of ad hoccery is made, I will only discuss the use of ‘ad hoc’ in the debate about the Church-Fitch paradox of knowability, in the discussion about the axioms of ZF set theory, and in the evaluation of the LFC. In each case I show that the use of ‘ad hoc’ corresponds to (Ad Hoc).

3.1 An Unknown Truth

The Church-Fitch paradox shows that all truths are known if all truths can be known. (For a general introduction to this paradox, see Brogaard and Salerno Reference Brogaard, Salerno and Zalta 2019 .) Some antirealists are committed to the claim that all truths can be known, and the paradox threatens their position because it seems absurd to hold that all truths are known. Besides the rules of elementary logic, the paradox assumes that knowledge is factive and distributes over conjunctions and that absurd propositions are impossible. Here is a sketch of the problem. Suppose that all truths can be known (TCK), that is,

(TCK) For all propositions p , if p is true then it is possible to know p .

Suppose, for contradiction, that there is a truth, q , that is not known. By (TCK) it is possible to know the following conjunction: q is true and q is not known. Suppose this conjunction is known. Then it follows from knowledge's distribution over conjunctions that q is known and that it is known that q is not known; from this, by the factivity of knowledge, it follows that it is known and not known that q— contradiction. Hence, if all truths are knowable, all truths are known.

Neil Tennant ( Reference Tennant 1997 ) offers a solution to the paradox based on distinguishing Cartesian from anti-Cartesian propositions and restricting (TCK) to Cartesian propositions. A proposition p is anti-Cartesian if and only if we can derive a contradiction from the assumption that p is known. Tennant distinguishes three ways in which a proposition might be anti-Cartesian. First, the proposition itself may be inconsistent, in which case a contradiction can be derived from the assumption that we know it. Second, a proposition such as ‘No thinking thing exists’ may be consistent but false whenever it is the object of a propositional attitude. Finally, there are claims of the forms ‘ p and it is not known that p ’ from which we can derive a contradiction if we assume that knowledge distributes over conjunctions. By restricting (TCK) to Cartesian propositions, paradox is avoided because ‘ q and it is not known that q’ is an anti-Cartesian proposition. Tennant's diagnosis of the Church-Fitch paradox is thus that anti-Cartesian propositions are wrongly taken to be within the scope of (TCK).

Not everyone likes Tennant's solution. Michael Hand and Jonathan Kvanvig ( Reference Hand and Kvanvig 1999 ) argue that it is ad hoc. A non-ad hoc solution to the paradox must not merely exclude those propositions that lead to a problem, but

one must go beyond such arbitrary approaches. Realists do this by observing that truth is ‘radically nonepistemic’, thereby giving themselves a reason based on their conception of truth for denying [(TCK)]. Tennant must do something comparable. We should expect him to find some feature of truth, antirealistically conceived, that disarms the paradox by allowing some truths to be unknowable. (Hand and Kvanvig Reference Hand and Kvanvig 1999 : 423, my italics)

Hand and Kvanvig demand some theory of truth that provides a reason or explanation for restricting (TCK). Note that Hand and Kvanvig do not deny that Tennant's solution is general. They simply think that generality is no defeater for ad hoccery. As a toy example they mention the claim that any grammatically predicative expression defines a set, except when this assumption leads to a contradiction. This solution to Russell's paradox is quite general but ‘clearly ad hoc’ (Hand and Kvanvig Reference Hand and Kvanvig 1999 : 426).

Hand and Kvanvig grant that Tennant has given a diagnosis of the problem: the problem arises for (TCK) because it uses an anti-Cartesian proposition. But they seem to think that the diagnosis is wrong or lacks an explanatory backing. For example, they argue that Tennant's solution works for (TCK) but not for its necessitation; it should also work for this modalized cousin because the antirealist holds that it is essential to truth that it is knowable. This suggests that Tennant's diagnosis is incorrect by not going to the heart of the matter. After considering some ways to deal with the stronger version, Hand and Kvanvig claim these are all ad hoc because they do ‘do not cite some feature of truth that calls for the restriction in question’ ( Reference Hand and Kvanvig 1999 : 425). They thus want the diagnosis to be backed by an explanation: some feature of truth should explain the restriction.

In his response to Hand and Kvanvig, Tennant operates with a different conception of ad hoc, for he stresses the generality of his approach and compares it favorably to other general solutions to various problems. For example, he considers the following restriction to ( T ) to avoid the liar paradox: For all propositions p , ‘ p ’ is true if and only if p , except for those propositions from which we can derive a contradiction. This restricted schema ‘is substantive, informative and important. The objection that the restriction invoked is ad hoc is groundless’ (Tennant Reference Tennant 2001 : 110).

I beg to differ. This recipe for restricting general principles is a get-out-of-jail-free card that immunizes principles like (T) and (TCK) against defeat. It also leaves it completely mysterious why some propositions are excluded. This is as (Ad Hoc) prescribes: restricting a general principle by simply excluding the instances that lead to problems fails to diagnose the problem because it does not even try to explain how the problem came about. This makes such solutions unsatisfactory and triggers the reproach of ad hoccery. I am not alone in thinking this: Igor Douven ( Reference Douven 2005 : 50–51) makes the same point before offering a more principled case for Tennant's solution.

Douven's account of what it takes to be non-ad hoc is quite similar to that of Hand and Kvanvig although Douven rightly objects to the idea that one's solution to the Church-Fitch paradox should be based on one's theory of truth. Why, Douven asks, ‘could it not be something about one's conception of, for instance, knowledge that explains what is wrong with [(TCK)]?’ ( Reference Douven 2005 : 49). Douven provides the following criterion:

In order to qualify as principled or non- ad hoc , it is necessary and sufficient that a proposal for restricting [(TCK)] in a particular way be accompanied by a reason for adopting it other than its capability to solve the paradox, and that reason must be related, in an informative or explanatory way, to one or more of the concepts that are either implicitly or explicitly involved in [(TCK)]. ( Reference Douven 2005 : 50)

Douven thus thinks that the restriction imposed on (TCK) is not ad hoc only if it is backed up by some explanatory or informative reason. Douven is thus appealing to (i) (b) of (Ad Hoc). He then provides such a reason for Tennant's (anti-)Cartesian solution based on the idea that anti-Cartesian propositions cannot be consistently believed. He takes this to be both independently motivated (because it also helps to solve a version of Moore's paradox) and an explanation for why there are unknowable truths (there are unknowable truths because there are propositions that cannot be consistently believed; Douven Reference Douven 2005 : 57–58). This last point illustrates that although Douven does not explicitly mention the need for a diagnosis in his conditions for non-ad hoccery, he does provide such a diagnosis.

I do not wish to pass judgement on Douven's solution but only want to note the strategy. Although he ends up with a more general restriction of (TCK) than Tennant, Douven nowhere suggests that this is part of the reason his solution is not ad hoc. Instead Douven attempts to explain the restriction, and this explanation also diagnoses the paradox: exactly as (Ad Hoc) prescribes.

3.2 An Iteration

To illustrate the adequacy of (Ad Hoc) further I apply it to the philosophical debate about the axioms of ZF set theory. Any student of set theory knows that naive set theory leads to problems such as Russell's paradox and the Burali-Forti paradox. The main suspect is the axiom schema of (naive) comprehension whose instances ensure that any grammatically predicative expression defines a set. In ZF set theory one replaces this schema either with the axiom schema of replacement (together with an axiom stating the existence of the empty set) or with the axiom schema of separation. This avoids the known set theoretic paradoxes—but not to everyone's satisfaction.

Zermelo's approach, however, offers no justification of the restrictions imposed upon P [i.e., the naive comprehension principle] other than the fact that the paradoxes are avoided. But that is ad hoc. What we would like is some sort of explanation of why there is no Russell set or no set of all ordinals, or why, at least, we shouldn't be able to prove there are such sets from our axioms. (Menzel Reference Menzel 1986 : 39, italics in the original)

In line with (Ad Hoc) Menzel wants a solution that offers a diagnosis: why do the paradoxical sets not exist? Menzel is not alone in finding the solution ad hoc: ‘the “resolution” offered by first-order ZFC is a paradigm of the ad hoc’ (Cook and Hellman Reference Cook, Hellman, Cook and Hellman 2018 : 53). (And although Putnam [ Reference Putnam, Sher and Tieszen 2000 : 24] does not use the term ‘ad hoc’, I agree with Douven [ Reference Douven 2005 : 51] that Putnam is best understood as saying that ZF is ad hoc.)

These critics of ZF demand an explanation for the selection of axioms as well as a diagnosis of the paradoxes of naive set theory, preferably in one sweep. This may be provided by the iterative conception of sets: the idea that sets are ‘constructed’ in stages and that each stage contains all previously constructed sets plus all subsets that can be constructed out of them. Boolos ( Reference Boolos 1971 ) popularized this conception among philosophers, and it diagnoses Russell's paradox as a problem of trying to construct a set consisting both of elements one previously constructed and of an element one has not yet constructed. But at each stage one can only use previously constructed sets. (Of course, all this talk of ‘constructing’ should not be taken literally.) Moreover, Boolos takes the iterative conception of a set to explain the axioms of ZF such that they are ‘not at all ad hoc’ ( Reference Boolos 1971 : 218).

Interestingly, Boolos notes that the axiom schema of replacement does not ‘follow from the iterative conception’ ( Reference Boolos 1971 : 228) but has ‘many desirable consequences and (apparently) no undesirable ones’ (229). This provides at best an abductive argument for replacement but may fall short of the kind of explanation that someone like Putnam demands. This illustrates that disagreement about ad hoccery can be due to disagreement about whether a solution is (sufficiently) backed by an explanation.

Instead of trying to meet the demand for a satisfactory explanation, Penelope Maddy ( Reference Maddy 2011 ) defends contemporary set theory against the charge of ad hoccery by holding that such an explanation is unreasonable, effectively saying that condition (ii) of (Ad Hoc) is not met. (To be sure, Maddy does not frame her argument in terms of ad hoccery.) Maddy argues that the axioms of a mathematical theory need no explanation or ‘intrinsic justification’, that is, justification coming from some pretheoretic notion of a set. Rather, the axioms are extrinsically justified: they are as simple and powerful as possible while (for all we know) avoiding contradiction. According to Maddy, it thus is unreasonable to demand an explanation; hence none of the axioms of set theory are ad hoc because condition (ii) of (Ad Hoc) is not satisfied.

3.3 LFC Revisited

The LFC hypothesis was one of Popper's main examples of an ad hoc hypothesis ( Reference Popper 1959 : 83), and it is now a litmus test for any definition of ‘ad hoc solution’. The LFC states that an object contracts in its direction of travel. It was proposed by FitzGerald and, independently, by Lorentz after the famous null results of the Michelson-Morley experiments. These experiments used an interferometer designed to detect the ether on the basis of its effect on the speed of light. A light beam was first split into two beams traveling in perpendicular directions, and both beams were then sent back to a single screen. The light beam that would travel parallel to the direction of the earth relative to the ether should take longer than the light beam that travelled perpendicular to the earth's direction of travel. However, the two light beams always arrived at (virtually) the same time, no matter when the experiment was conducted or which direction the interferometer was facing. Thus, either the speed of light was constant, which was hard to square with the idea that light travelled through the ether, or—as the LFC states—objects contract in their direction of travel, which explains why the ‘slower’ light beam arrives at the same time as the ‘faster’ one. Current physics holds that, in a sense, both are true. The speed of light is constant and, stated in relativistic terms, the length of a moving object is shorter than its proper length, which is its length as measured in its own rest frame. Note that this does not mean a moving object is physically deformed—its proper length does not change—but ‘merely’ that the measured length of an object depends on whether the object is in motion relative to the observer. Crucially, the LFC was not originally stated in relativistic terms, and it was clear to everyone that the hypothesis was fishy.

The LFC is a rather good litmus test for any theory of ad hoccery because physicists at the time, including Lorentz, were dissatisfied with it (Holton Reference Holton 1969 : 139). But it should be noted that because this is a much-discussed case study, there are a few myths surrounding the Michelson-Morley experiments and the LFC. One is that the experiments played a key role in Einstein's formulation of special relativity. Instead, it is unclear whether Einstein was even aware of these experiments when writing his 1905 paper. Einstein ( Reference Einstein 1905 ) suggests he arrived at his theory mainly via his dissatisfaction with the asymmetries in Maxwell's theory of electrodynamics. Another myth is that the LFC had no new testable consequences. It did, and these consequences were refuted by the Kennedy-Thorndike experiments. (For detailed myth busting, see Grünbaum Reference Grünbaum 1959 and Holton Reference Holton 1960 , Reference Holton 1969 .)

In this subsection I show that Lorentz considered the LFC an ad hoc solution in the sense of (Ad Hoc). I argue that Lorentz thought the LFC provided a diagnosis but was not backed by a good explanation. Thus, the problem with the LFC was that no reasonable explanation could be given for why objects contracted in their direction of travel. The explanatory backing Lorentz ended up giving was not fully satisfactory—not even to himself—because it depended on assumptions for which he could at most give analogical arguments. Accordingly, the LFC was ad hoc because it satisfied conditions (i) (b) and (ii) of (Ad Hoc).

In a letter to Einstein dated 23 January 1915, Lorentz writes that, like Einstein, he also thought that the LFC was ad hoc and that he had said so in print (Kox Reference Kox 2008 : 410; Lorentz seems to refer to his Reference Lorentz and Sommerfeld 1904a .) Lorentz also states that in the absence of a general theory one should be content with explaining a single fact, ‘as long as the explanation is not artificial’ (‘ wenn diese Erklärung nur nicht erkünstelt ist ’, Kox Reference Kox 2008 : 410). He thought that the LFC was not artificial, but rather the only possible explanation (‘ die einzig mögliche’ ) and one that would have seemed less ad hoc and even quite natural when one assumes that the transformation properties of electromagnetic forces also hold for other forces, in particular molecular forces (Kox Reference Kox 2008 : 411). Lorentz thus assumed an analogy: molecular forces are affected in a moving body similar to the way electromagnetic forces change around a moving body. (Note, incidentally, that this assumption is both unifying and, in principle, testable.)

This assumption provides a crucial part of Lorentz's attempt to give a satisfactory explanatory backing of the LFC. Because this, together with other assumptions, allowed him to derive the LFC from the Maxwell equations. But as Lorentz admitted elsewhere, the assumption that molecular forces are affected in a moving body in a way similar to the way electromagnetic forces change around a moving body was by no means unquestionable. He called it ‘bold’ (Dutch: ‘ gewaagd ’, Reference Lorentz 1892 : 78), admitted that ‘we really have no reason’ to suppose it (‘ wozu freilich kein Grund vorliegt’ , Reference Lorentz 1895 : §92), and thought it ‘cannot in itself be pronounced to be either plausible or inadmissible’, Reference Lorentz 1904b : 825).

In The Theory of Electrons ( Reference Lorentz 1915 ) Lorentz seems very much aware that this assumption is on shaky grounds:

We can understand the possibility of the assumed change of dimensions, if we keep in mind that the form of a solid body depends on the forces between its molecules, and that, in all probability, these forces are propagated by the intervening ether in a way more or less resembling that in which electromagnetic actions are transmitted through this medium. From this point of view, it is natural to suppose that, just like the electromagnetic forces the molecular attractions and repulsions are somewhat modified by a translation imparted to the body, and this may very well result in a change of its dimensions. ( Reference Lorentz 1915 : 201–2)

Notice how cautious Lorentz expresses himself: ‘in all probability’, ‘more or less resembling’, ‘it is natural to suppose’, and ‘may very well’. Moreover, the assumption that molecular forces behave similarly to electromagnetic forces is not the only assumption Lorentz makes to derive the LFC. According to one count, Lorentz's explanatory backing of the LFC contains at least eleven additional hypotheses. For a paper dealing with fundamental physics ‘it is veritably obsessed with making hypotheses’ (Holton Reference Holton 1960 : 630).

It is fair to say that Lorentz was not completely convinced by his own solution. In the concluding remarks of The Theory of Electrons ( Reference Lorentz 1915 ) he contrasts his overall solution with Einstein's theory of relativity: ‘Einstein simply postulates what we have deduced, with some difficulty and not altogether satisfactorily , from the fundamental equations of the electromagnetic field’ ( Reference Lorentz 1915 : 230, my italics).

The LFC provided a diagnosis of the null result of Michelson and Morley, but it lacked a satisfactory explanatory backing. Lorentz tried to provide such a backing by deriving the LFC from the Maxwell equations using certain assumptions about the electron. But the backing that Lorentz gave depended on at least one assumption for which there was at best an analogical argument: the idea that molecular forces are affected in a moving body similar to the way electromagnetic forces change around a moving body. Moreover, everyone in the scientific community, including Lorentz, demanded a solution that diagnosed the problem and was backed by a satisfactory explanation. The best explanation that Lorentz was able to give was, however, not fully satisfactory. Hence, the LFC was an ad hoc solution in the sense defined by (Ad Hoc).

4. Concluding Remarks

When an otherwise successful theory is confronted with an empirical anomaly, a paradox, a vicious infinite regress, or some other defect, one may always change the theory to solve the problem. But not every solution is an improvement, and degenerative solutions are often called ‘ad hoc’. A good definition of ‘ad hoc solution’ should help explain why, despite solving more problems than the original theory, the new theory is no improvement. I have argued that the answer lies in its explanatory failure: ad hoc solutions do not diagnose the problem or are not backed up by an explanation. Since a theory should not merely list facts or solve problems but also provide explanations, ad hoc solutions go against the raison d’être of a theory. We thus eschew ad hoc solutions for more than merely aesthetic reasons ( pace Hunt Reference Hunt 2012 ).

My analysis of ad hoc solution applies to all fields of rational inquiry insofar as these fields aim to provide explanations. This is an advantage over other accounts of ad hoccery, which all focus on ad hoc hypotheses that answer to empirical anomalies. Still, other definitions of ad hoccery can supplement (Ad Hoc). (Thanks to a reviewer for this journal for suggesting this to me.) (Ad Hoc) does not detail when a diagnosis or explanation is not good enough. Here Leplin's ( Reference Leplin 1975 ) discussion about ad hoccery might be useful: it may be that the explanation is no good because it fails to be fundamental or because it cannot be generalized. Or one might agree with Schindler ( Reference Schindler 2018 ) that a good explanation must cohere with relevant background theories to be satisfactory.

This also shows that (Ad Hoc) can explain why there are so many different definitions of ‘ad hoc hypothesis’ in the philosophy of science: because there are competing notions of what a good scientific explanation looks like. For example, Popper famously held that a good scientific explanation is falsifiable. It is no wonder, then, that he considered lack of testability—that is, nonfalsifiability—a cornerstone of ad hoccery. Similarly, those who think good explanations are generalizable will likely consider ungeneralizable solutions ad hoc.

Because explanation is what distinguishes ad hoc solutions from genuine solutions, we should thus investigate explanations to gain a better understanding of ad hoccery. Given the current interest in explanation—both in the philosophy of science (Lange Reference Lange 2016 ; Lipton Reference Lipton 2004 ; Reutlinger and Saatsi Reference Reutlinger and Saatsi 2018 ; Woodward Reference Woodward 2003 ) and in metaphysics (Correia and Schnieder Reference Correia and Schnieder 2012 ; Kment Reference Kment 2014 ; Ruben Reference Ruben 2012 )—our understanding of ad hoccery is bound to grow in the foreseeable future.

I am grateful to the Swedish Research Council (Vetenskapsrådet International Postdoc Grant 2017-06160_3) and the Dutch Research Council (NWO grant VI.Veni.201F.006 ‘The Whole Explanation, Part by Part’) for funding my research. This paper was presented at the Higher Seminar in Lund in 2021: thanks to the audience for their constructive feedback. Thanks also to Chris Daly, Hein van den Berg, Ylwa Sjölin Wirling, and three anonymous reviewers for this journal for comments on a previous draft of this paper. Finally, I am especially grateful to David Liggins for the many entertaining and enlightening discussions we had on ad hoccery and for his extensive comments on multiple versions of this paper.

Sociology Plus

Ad hoc Hypothesis

Sociology Plus

Ad hoc hypothesis denotes a supplementary hypothesis given to a theory to prevent it from being refuted. According to Karl Popper’s philosophy of science, the only way that falsifiable intellectual systems like Marxism and Freudianism have been sustained is through the dependence on ad hoc hypotheses to fill gaps. Ad hoc hypotheses are used to account for abnormalities that the theory’s unaltered form could not foresee.


Ad hoc theories are only acceptable if and only if their non-universal, precise nature can be shown, or, to put it another way, if their potential for direct generalization is disproven. It is the hypothesis that is embraced without any other justification in order to save a theory from refutations or criticism. This technique is deployed in sociological research studies.

The derivation of the particular conclusion in an issue may be deemed invalid if an ad hoc hypothesis was proven to be acceptable and non-universal; as a result, the specific example loses its scientific significance. The necessity of repeat testing is implied in the aforementioned working rule for the acceptance of ad hoc hypotheses, which makes this process seem all the more justifiable.

Notably, the system seems to be in question whenever the introduction of an ad hoc hypothesis is required until the acceptability of the ad hoc hypothesis appears to be established by the requisite falsification attempts. The restriction of ad hoc hypotheses and the continuity principle appear to guarantee the objectivity of falsification; in other words, a theory should only be regarded as falsified if its falsification is theoretically testable.

In addition, because it gives a preferential position to a critical evaluation or falsification, this principle of restriction serves as, in a sense, the second part of the working definition for the idea of a theoretical system’s falsification. Ad hoc hypotheses can be used to attempt to prevent falsification based on the continuity principle, but this can only be done if a different hypothesis, the generalized ad hoc hypothesis (which is also subject to the continuity principle), can also be refuted. Therefore, avoiding falsification depends on (yet another) deception. 

The first falsification will take effect if the second one is unsuccessful. This methodological constraint, or the concept of the restriction of ad hoc hypothesis, has effectively eliminated the “conventionalist argument to falsifiability.” The argument that this system is, in theory, not falsifiable has been demonstrated to be inconsistent (via the principle of the restriction of ad hoc hypotheses) provided that a system enables the derivation of empirically verifiable consequences in the first place.

Since the non-falsifiability of any hypothesis (even a generalized ad hoc hypothesis) would necessitate the falsifiability of other hypotheses, this principle gives a workable definition of the term “falsification” (that is, the falsification of the original axiomatic system). This is obviously inconsistent.

The ad hoc hypothesis “This (otherwise accurate) watch showed the wrong time under such and such circumstances” is only a valid ad hoc hypothesis if the universal statement “All (otherwise accurate) watches show the wrong time under such and such circumstances” can be shown to be false, or refuted, by counterexamples.

Related Posts:

  • Aggression Definition & Explanation
  • Action Theory Definition & Explanation
  • Class Consciousness Definition & Explanation
  • Althusserian Marxism Definition & Explanation
  • Analytic Induction Definition & Explanation
  • Age Definition & Explanation
  • Action Research Definition & Explanation
  • Anarchism Definition & Explanation
  • Applied Social Psychology Definition & Explanation
  • Class Definition & Explanation

  • Anthropology
  • Self-Esteem
  • Social Anxiety
  • Experiments >

Ad Hoc Analysis

An ad hoc analysis is an extra type of hypothesis added to the results of an experiment to try to explain away contrary evidence.

This article is a part of the guide:

  • Significance 2
  • Sample Size
  • Experimental Probability
  • Cronbach’s Alpha
  • Systematic Error

Browse Full Outline

  • 1 Inferential Statistics
  • 2.1 Bayesian Probability
  • 3.1.1 Significance 2
  • 3.2 Significant Results
  • 3.3 Sample Size
  • 3.4 Margin of Error
  • 3.5.1 Random Error
  • 3.5.2 Systematic Error
  • 3.5.3 Data Dredging
  • 3.5.4 Ad Hoc Analysis
  • 3.5.5 Regression Toward the Mean
  • 4.1 P-Value
  • 4.2 Effect Size
  • 5.1 Philosophy of Statistics
  • 6.1.1 Reliability 2
The scientific method dictates that, if a hypothesis is rejected, then that is final. The research needs to be redesigned or refined before the hypothesis can be tested again.

Amongst pseudo-scientists, an ad hoc hypothesis is often appended, in an attempt to justify why the expected results were not obtained.

An often quoted example of an ad hoc analysis is of a paranormal investigator investigating psychic waves, under scientific conditions. Upon finding that the experiment did not give positive results, they blame the negative brain waves given out by others.

This is simply trying to deflect criticism and failure by throwing out other, completely random reasons. This ad hoc analysis would need the brain waves of the onlookers to be also tested and eliminated, moving the goalpost and creating a fallacy.

The idea of biorhythms, where the body and mind are affected by deep and regular cycles unrelated to biological circadian rhythms, has long been viewed with skepticism. Every time that scientific research debunks the theory, the adherents move the goal posts, inventing some other underlying reason to explain the results.

Often, astrologers presented with contrary evidence will blame the results upon some ‘unknown’ astrological phenomenon. This, of course, is impossible to prove and so the ad hoc analysis conveniently removes the pseudo-science from the debate.

The insanely stupid Water4Gas scam works along the same principles – when researchers pointed out that the whole idea revolves around the principle of perpetual motion, they invented another ad hoc hypothesis to explain where the ‘money saving’ energy came from.

Ad hoc analysis is not always a bad thing, and can often be part of the process of refining research.

Imagine, for example, that a research group was conducting an experiment into water turbulence, but kept receiving strange results, disproving their hypothesis. Whilst attempting to eliminate any potential confounding variables, they discover that the air conditioning unit is faulty, transmitting vibrations through the lab. This is switched off when the experiment is running and they retest the hypothesis.

This is part of the normal scientific process, and is part of refining the research design rather than trying to move the goalposts.

Ad hoc analysis is only a problem when a non-testable ad hoc hypothesis is added to the results to justify failure and deflect criticisms.

The air conditioning unit hypothesis can be tested very easily, simply by switching it off, and was a result of experimental flaw. Negative brainwaves cannot be easily tested, and therefore the deflection causes a fallacy.

Martyn Shuttleworth (Nov 17, 2008). Ad Hoc Analysis.

what is an example of ad hoc hypothesis

Experimental Research

Philosophy of Science

  • Critical Thinking
  • Cryptozoology
  • ETs & UFOs
  • Frauds, Hoaxes, Conspiracies
  • Junk Science
  • Logic & Perception
  • Science & Philosophy
Ad hoc hypothesis.

An ad hoc hypothesis is one created to explain away facts that seem to refute one’s belief or theory. Ad hoc hypotheses are common in paranormal research and in the work of pseudoscientists . For example, ESP researchers have been known to blame the hostile thoughts of onlookers for unconsciously influencing pointer readings on sensitive instruments. The hostile vibes, they say, made it impossible for them to duplicate a positive ESP experiment. Being able to duplicate an experiment is essential to confirming its validity. Of course, if this objection is taken seriously, then no experiment on ESP can ever fail. Whatever the results, one can always say they were caused by paranormal psychic forces, either the ones being tested or others not being tested.

Martin Gardner reports on this type of ad hoc hypothesizing reaching a ludicrous peak with paraphysicist Helmut Schmidt who put cockroaches in a box where they could give themselves electric shocks. One would assume that cockroaches do not like to be shocked and would give themselves shocks at a chance rate or less, if cockroaches can learn from experience. The cockroaches gave themselves more electric shocks than predicted by chance. Schmidt concluded that "because he hated cockroaches, maybe it was his pk that influenced the randomizer!" (Gardner, p. 59)

Ad hoc hypotheses are common in defense of the pseudoscientific theory known as biorhythm theory . For example, there are very many people who do not fit the predicted patterns of biorhythm theory. Rather than accept this fact as refuting evidence of the theory, a new category of people is created: the arrhythmic. In short, whenever the theory does not seem to work, the contrary evidence is systematically discounted. Advocates of biorhythm theory claimed that the theory could be used to accurately predict the sex of unborn children. However, W. S. Bainbridge, a professor of sociology at the University of Washington, demonstrated that the chance of predicting the sex of an unborn child using biorhythms was  50/50, the same as flipping a coin. An expert in biorhythms tried unsuccessfully to predict accurately the sexes of the children in Bainbridge's study based on Bainbridge's data. The expert's spouse suggested to Bainbridge an interesting ad hoc hypothesis, namely, that the cases where the theory was wrong probably included many homosexuals with indeterminate sex identities!

Astrologers are often fond of using statistical data and analysis to impress us with the scientific nature of astrology . Of course, a scientific analysis of the statistical data does not always pan out for the astrologer. In those cases, the astrologer can make the data fit the astrological paradigm by the ad hoc hypothesis that those who do not fit the mold have other, unknown influences that counteract the influence of the dominant planets.

Using ad hoc hypotheses is not limited to pseudoscientists. Another type of ad hoc hypothesis occurs in science when a new scientific theory is proposed which conflicts with an established theory and which lacks an essential explanatory mechanism. An ad hoc hypothesis is proposed to explain what the new theory cannot explain. For example, when Wegener proposed his theory of continental drift he could not explain how continents move. It was suggested that gravity was the force behind the movement of continents, though there was no scientific evidence for this notion. In fact, scientists could and did show that gravity was too weak a force to account for the movement of continents. Alexis du Toit, a defender of Wegener's theory, argued for radioactive melting of the ocean floor at continental borders as the mechanism by which continents might move. Stephen Jay Gould noted that "this ad hoc hypothesis added no increment of plausibility to Wegener's speculation." (Gould, p. 160)

Finally, rejecting explanations that require belief in occult, supernatural or paranormal forces in favor of simpler and more plausible explanations is called applying Occam's razor. It is not the same as ad hoc hypothesizing. For example, let's say I catch you stealing a watch from a shop. You say you did not steal it. I ask you to empty your pockets. You agree and pull out a watch. I say, "Aha!, I was right. You stole the watch." You reply that you did not steal the watch, but you admit that it was not in your pocket when we went into the store. I ask you to explain how the watch got into your pocket and you say that you used telekinesis: you used your thoughts to transport the watch out of a glass case into your pocket. I ask you to repeat the act with another watch and you say "ok." Try as you will, however, you cannot make a watch magically appear in your pocket. You say that there is too much pressure on you to perform or that there are too many bad vibes in the air for you to work your powers. You have offered an ad hoc hypothesis to explain away what looks like a good refutation of your claim. My hypothesis that the watch is in your pocket because you stole it, is not an ad hoc hypothesis. I have chosen to believe a plausible explanation rather than an implausible one. Likewise, given the choice between believing that my headache went away of its own accord or that it went away because some nurse waved her hands over my hand while chanting a mantra, I will opt for the former every time.

It is always more reasonable to apply Occam's razor than to offer speculative ad hoc hypotheses just to maintain the possibility of something supernatural or paranormal.

See also cold reading , communal reinforcement, control study, Occam's razor, placebo effect , post hoc fallacy , selective thinking , self-deception, subjective validation , testimonial evidence, and wishful thinking.

Unnatural Acts: ad hoc hypothesis

Gardner, Martin. The Whys of a Philosophical Scrivener (New York: Quill, 1983).

Last updated 18-Oct-2015

What are some examples of ad hoc hypotheses in everyday conversations?

I always see the ad hoc hypothesis mentioned in science. I was wondering if the community could provide some examples of how this fallacy is used in everyday conversations so I can better understand it.

What is the Problem of Ad Hoc Hypotheses?

Cite this article

what is an example of ad hoc hypothesis

  • Greg Bamford 1  

626 Accesses

6 Citations

The received view of an ad hochypothesis is that it accounts for only the observation(s) it was designed to account for, and so non-ad hocness is generally held to be necessary or important for an introduced hypothesis or modification to a theory. Attempts by Popper and several others to convincingly explicate this view, however, prove to be unsuccessful or of doubtful value, and familiar and firmer criteria for evaluating the hypotheses or modified theories so classified are characteristically available. These points are obscured largely because the received view fails to adequately separate psychology from methodology or to recognise ambiguities in the use of 'ad hoc_'.

Asimov, I.: 1975, Eyes on the Universe: A History of the Telescope , Houghton Mifflin, Boston.

Google Scholar  

Bamford, G.S.: 1989, Popper, Refutation, and 'Avoidance' of Refutation , Ph.D. thesis, The University of Queensland, Brisbane.

Bamford, G.S.: 1993, 'Popper's Explications of Ad Hoc ness: Circularity, Empirical Content, and Scientific Practice', The British Journal for the Philosophy of Science 44 , 335–55.

Bamford, G.S.: 1996, 'Popper and his Commentators on the Discovery of Neptune: A Close Shave for the Law of Gravitation?', Studies in History and Philosophy of Science 27 (2), 207–32.

Chalmers, A.F.: 1982, What is This Thing Called Science: An Assessment of the Nature and Status of Science and its Methods? (second edition), University of Queensland Press, St. Lucia, Queensland.

Drake, S.: 1957, Introduction to Discoveries and Opinions of Galileo , by Galileo, trans. S. Drake, Anchor Books, New York.

Drake, S.: 1978, Galileo at Work: His Scientific Biography , University of Chicago Press, Chicago.

Fetzer, J.H. and Almeder, R.F.: 1993, s.v. 'Ad hoc/ad hocness/ad hoc hypotheses', Glossary of Epistemology/Philosophy of Science , Paragon House, New York.

Galileo, G.: 1929–39, Le Opere di Galileo Galilei , 20 vols. in 21, vol. 11, Barbera, Firenze.

Gillies, D.: 1990, 'Bayesianism Versus Falsificationism', Ratio (New Series) III , 82–98.

Grant, R.: 1966, History of Physical Astronomy , The Sources of Science, no. 38, Johnson Reprint Corporation, New York.

Grosser, M.: 1962, The Discovery of Neptune , Harvard University Press, Cambridge, Mass.

Grünbaum, A.: 1976, ' Ad Hoc Auxiliary Hypotheses and Falsificationism', The British Journal for the Philosophy of Science 27 , 329–62.

Hempel, C.G.: 1966, Philosophy of Natural Science , Foundations of Philosophy Series, Prentice Hall, Englewood Cliffs, N.J.

Holton, G.: 1969, 'Einstein, Michelson, and the Crucial Experiment', Isis 60 (Summer), 133–97.

Howson, C. and Urbach, P.: 1989, Scientific Reasoning: The Bayesian Approach , Open Court, La Salle, Illinois.

Klüber, H. von.: 1960, 'The Determination of Einstein's Light Deflection in the Gravitational Field of the Sun', in A. Beer (ed.), Vistas in Astronomy 3 , Pergamon Press, London, 47–77.

Lakatos, I.: 1970, 'Falsification and the Methodology of Scientific Research Programmes'. In I. Lakatos and A. Musgrave (eds.), Criticism and the Growth of Knowledge , Cambridge University Press, Cambridge, 91–195.

Laudan, L.: 1977, Progress and its Problems: Towards a Theory of Scientific Growth , University of California Press, Berkeley.

Leplin, J.: 1982, 'The Assessment of Auxiliary Hypotheses', The British Journal for the Philosophy of Science 33 , 235–49.

Mason, S.F.: 1956, Main Currents of Scientific Thought: A History of the Sciences , Routledge and Kegan Paul, London.

Maxwell, G.C.: 1974, 'Corroboration without Demarcation', in Schilpp (ed.), 292–321.

Miller, D.W.: 1981, s.v. ' Ad-Hoc Hypotheses', in W. Bynum, E. Browne and R. Porter (eds.), Dictionary of the History of Science , Princeton University Press, Princeton.

Musgrave, A.E.: 1973, 'Falsification and its Critics', in P. Suppes, L. Henkin, A. Joga, and Gr. Moisil (eds.), Logic, Methodology and Philosophy of Science IV: Proceedings of the Fourth International Congress for Logic, Methodology, and Philosophy of Science, Bucharest, 1971 , North Holland Publishing Co., Amsterdam, 393–406.

Musgrave, A.E.: 1974, 'Logical versus Historical Theories of Confirmation', The British Journal for the Philosophy of Science 25 , 1–23.

Musgrave, A.E.: 1976, 'Method or Madness? Can the Methodology of Scientific Research Programmes be Rescued from Epistemological Anarchism?', in R. Cohen, P. Feyerabend, and M. Wartofsky (eds.), Essays in Memory of Imre Lakatos , Boston Studies in the Philosophy of Science, vol. 39, D. Reidel, Dordrecht, 457–91.

Musgrave, A.E.: 1978, 'Evidential Support, Falsification, Heuristics, and Anarchism'. In G. Radnitzky and G. Andersson (eds.), Progress and Rationality in Science , Boston Studies in the Philosophy of Science, vol. 58, D. Reidel, Dordrecht, 181–201.

Newcomb, S.: 1929, 'Discordances in the Secular Variations of the Inner Planets', in H. Shapely and H. Howarth (eds.), A Source Book in Astronomy , Source Books in the History of Science, McGraw Hill, New York, 330–38.

Newcomb, S.: 1911, 'Neptune', Encyclopedia Britannica (eleventh edition), vol. XIX, Cambridge University Press, Cambridge, 385–87.

Popper, K.R.: 1972, Conjectures and Refutations: The Growth of Scientific Knowledge (fourth revised edition), Routledge and Kegan Paul, London.

Popper, K.R.: 1974, 'Replies to My Critics', in Schilpp (ed.), 961–1197.

Popper, K.R.: 1975, The Logic of Scientific Discovery (eighth impression), Hutchinson and Co., London.

Popper, K.R.: 1976, Unended Quest: An Intellectual Autobiography , Fontana/Collins, London.

Quine, W.V. and Ullian, J.S.: 1980, 'Hypothesis', in E. Klemke, R. Hollinger and A. Kline (eds.), Introductory Readings in the Philosophy of Science , Prometheus Books, New York, 196–206.

Radnitzky, G.: 1981, 'Progress and Rationality in Research', in M. Grmek, R. Cohen and G. Cimino (eds.), On Scientific Discovery: The Erice Lectures 1977 , D. Reidel, Dordrecht, 43–102.

Salmon, W.C.: 1990, 'Rationality and Objectivity in Science, or Tom Kuhn Meets Tom Bayes', in C. Savage (ed.), Scientific Theories , Minnesota Studies in the Philosophy of Science, vol. 14, University of Minneapolis Press, Minneapolis, 175–204.

Schilpp, P.A. (ed.), 1974, The Philosophy of Karl Popper , The Library of Living Philosophers, vol. 14, Open Court, La Salle, Illinois.

Schlesinger, G.: 1987, 'Accommodation and Prediction', Australasian Journal of Philosophy 65 , 33–42.

Sprott, W.F.: 1936, 'Review of A Dynamical Theory of Personality ', by K. Lewin, Mind 45 , 246–51.

Sutton, C.: 1992, Spaceship Neutrino , Cambridge University Press, Cambridge.

The Shorter Oxford English Dictionary: 1973, s.v. 'ad hoc', (third revised edition).

