✓ Assessing Historical Claims

August 8, 2025

◉ An Effective Methodology

History is not a neat photo album; it’s more like a jigsaw puzzle with half the pieces missing, and the rest scattered in someone’s attic. People tell stories. Some get recorded. Some get embellished. Others get forgotten. And sometimes we’re left with claims so unusual that our first instinct is to squint and say, Really?

The problem is, debates about historical claims often get stuck in two unhelpful extremes:
✓ One side says, “It’s written down, so it must be true.”
✓ The other says, “If it’s strange, it must be false.”

Both approaches skip the actual work: figuring out how much the evidence we have should move our belief one way or the other. What we need is a tool that quantifies plausibility — something that treats history a bit like science, where we update our confidence based on the strength and weakness of the evidence.

Why Missing Evidence Isn’t Neutral — It’s a Clue

Imagine your friend claims that last night, during rush hour, an elephant walked across the Brooklyn Bridge. You check the news. Nothing. No photos, no social media posts, no eyewitness chatter. The complete absence of reports isn’t just a “lack of extra evidence” — it’s active evidence against the claim.

Why? Because if it happened, the event would have been highly public, easy to notice, and almost impossible to ignore. Silence in these cases is loud.

This is the core of what historians sometimes call the argument from silence. The trick is knowing when the silence is meaningful. If an event is private, obscure, or likely to go unrecorded, then the absence of sources means little. But if it’s a showstopper — something everyone would see — then missing corroboration is damning.

This distinction is critical, because without it, people can cherry-pick any isolated text or fragment and treat it as sufficient proof for an event that would, in reality, leave far more footprints.

A Plain-Language Decision Framework

Before we get into math, here’s the common-sense version of how to filter historical claims:

Is it extraordinary?
✓ Does it clash with established knowledge about the world?
✓ Example: “A royal decree was issued” — mundane. “A god descended into the marketplace and turned the river to wine” — extraordinary.
Is it public?
✓ Would large numbers of people have directly witnessed it?
✓ Example: A private conversation between two generals — not public. A meteor exploding over a capital city — public.
Would we expect strong reporting?
✓ If it happened, would chroniclers, letters, or records have been made?
✓ Example: Major battles, coronations, or plagues generate records.
How scarce is the evidence?
✓ Do we have multiple accounts, or just one fragile scrap?
How independent and reliable are those accounts?
✓ Multiple copies of one bad source aren’t independent confirmation.
◉ Do we see silence where we’d expect noise?
✓ If trusted observers of the day fail to mention it, that’s highly relevant.

Think of it as a checklist where each “yes” on the left side (extraordinary, public, high-expectation) raises the bar for the kind of evidence we’ll accept.

The Formal Historical-Claims Model

This is where we turn that plain-language checklist into something structured — a set of variables and relationships that can actually be calculated.

$H\colon \mathrm{historical\ claim},\quad Cr(H)\in(0,1)\colon \mathrm{prior\ credence}$
➘ Our starting point is the claim $HHH$ and a prior probability $Cr(H)Cr(H)Cr(H)$ — basically, how likely we thought it was before considering any new evidence.

$Extra(H)\colon \mathrm{extraordinary\ claim},\quad Mund(H)\equiv \neg Extra(H)$
➘ We explicitly tag whether the claim is extraordinary or mundane, since extraordinary claims start with a lower prior credence.

$Pub(H)\colon \mathrm{public\ event},\quad ER(H)\in{\mathrm{low},\mathrm{med},\mathrm{high}}$
➘ Publicness matters because it determines how much reporting we’d expect. “ $ER(H)$ ” captures that expected reportage level — low, medium, or high.

$Scarce(H)\colon \mathrm{few\ surviving\ sources},\quad \mathcal{S}(H)={s_1,\dots,s_n}$
➘ Here we note whether evidence is scarce, and we define the set of sources ( ${s_1, s_2, \dots, s_n}$ ).

$Ind(s_i,s_j)\colon \mathrm{source\ independence}$
➘ This function tells us if two sources are truly independent — critical for avoiding the trap of “copy-paste confirmation.”

$Qual(s)\in(0,1],\quad Gap(s)\ge 0,\quad Bias(s)\in[0,1],\quad Anon(s)\in{0,1},\quad Tamper(s)\in[0,1]$
➘ Each source gets a reliability profile:

Quality score (accuracy, detail, internal consistency)
Gap in years from the event
Bias rating (how motivated the author is to spin the story)
Anonymity flag (1 if anonymous)
Tampering suspicion score

$LR(s)\colon \mathrm{base\ likelihood\ ratio},\quad LR_{\mathrm{eff}}(s)\colon \mathrm{adjusted\ likelihood\ ratio}$
➘ We calculate the likelihood ratio — how much a source moves the probability up or down — then adjust it for quality, bias, and other penalties.

$Sil(H)\colon \mathrm{silence\ in\ expected\ observers}$
➘ This flags cases where credible observers, who should have mentioned the event, say nothing.

$LR_{\mathrm{sil}}(H)=\frac{P(\mathrm{Sil}(H)\mid H)}{P(\mathrm{Sil}(H)\mid \neg H)}$
➘ The likelihood ratio for silence. If silence is much more probable when the event didn’t happen, this number will be small, hammering the claim’s credibility.

Instantiating the Model: Dragons Over Athens

Let’s apply the model to a fictional but instructive case.

Claim: “During the height of the Greek empire, dragons flew over Athens.”

Set Variables:
$H=\mathrm{"dragons\ flew\ over\ Athens"}$
$Cr(H)=0.0001$ — extremely low prior credence because dragons contradict all known zoology and physics.
$Extra(H)=\mathrm{true}$
$Mund(H)=\mathrm{false}$
$Pub(H)=\mathrm{true}$ — thousands would have witnessed it.
$ER(H)=\mathrm{high}$ — it should have flooded ancient records.
$Scarce(H)=\mathrm{true}$ — we have only one surviving source.
$n=1,\ s_1=\mathrm{single\ manuscript}$
$Qual(s_1)=0.4,\ Gap(s_1)=300\ \mathrm{years},\ Bias(s_1)=0.6,\ Anon(s_1)=1,\ Tamper(s_1)=0.2$
$LR(s_1)=1.5$ — base likelihood ratio is weakly supportive.

Adjust for penalties:

$LR_{\mathrm{eff}}(s_1)\approx 1.5\times 0.4\times (1-0.6)\times (1-0.2)=0.192$

Silence factor:
$LR_{\mathrm{sil}}(H)=0.01$ — strong negative weight, since silence from contemporary historians is nearly impossible if the claim were true.

Interpretation:
We start with tiny odds (0.0001). We multiply by a shrunken source likelihood (0.192), then by the silence factor (0.01). The final probability approximates zero.

This is the mathematical expression of common sense: if thousands would have seen it, and there’s just one shaky source written centuries later, it didn’t happen.

Why This Approach Works

The power here is in making the reasoning explicit. Instead of vaguely saying “That’s unlikely,” we specify:

Why the prior is low (extraordinary nature)
Why expected reportage is high (public spectacle)
Why a lone, low-quality, biased, and anonymous source can’t outweigh the silence of all others

When applied to real history, this protects us from giving undue weight to isolated or dubious accounts. It’s not about cynicism — it’s about calibrating our confidence to match the actual evidential landscape.

Applying the Model to The Resurrected Saints Claim

One of the most striking — and often overlooked — supernatural claims in the New Testament is in Matthew 27:52–53, where it is stated that, upon Jesus’ death, “many bodies of the saints who had fallen asleep were raised” and “appeared to many” in Jerusalem. At face value, this is an extraordinary, public, and testable claim. Let’s see what happens when we run it through our historical-claims model.

✓ Step 1 — Setting the Variables

$H = \mathrm{claim\ that\ hundreds\ of\ saints\ resurrected\ and\ appeared\ publicly}$
$Cr(H) \approx 0.01 \ \mathrm{(low\ prior\ due\ to\ violation\ of\ known\ biology)}$
$Extra(H) = \mathrm{true}$
$Mund(H) = \neg Extra(H) = \mathrm{false}$
$Pub(H) = \mathrm{true} \ \mathrm{(Jerusalem,\ many\ witnesses)}$
$ER(H) = \mathrm{high} \ \mathrm{(massive\ public\ spectacle)}$
$Scarce(H) = \mathrm{true} \ \mathrm{(only\ one\ disputed\ account)}$
$\mathcal{S}(H) = {s_1}, \ s_1 = \mathrm{Gospel\ of\ Matthew}$
$Ind(s_i, s_j) = \mathrm{not\ applicable\ (single\ source)}$
$Qual(s_1) \approx 0.5 \ \mathrm{(authorship\ unknown,\ written\ decades\ later)}$
$Gap(s_1) \approx 40 \ \mathrm{years\ (between\ event\ and\ text)}$
$Bias(s_1) \approx 0.8 \ \mathrm{(strong\ theological\ motive)}$
$Anon(s_1) = 1 \ \mathrm{(anonymous\ authorship)}$
$Tamper(s_1) \approx 0.3 \ \mathrm{(possible\ later\ editorial\ changes)}$
$LR(s_1) \approx 1.2 \ \mathrm{(weak\ base\ support)}$
$Sil(H) = \mathrm{true} \ \mathrm{(no\ Roman\ historians,\ no\ other\ Gospels,\ no\ Jewish\ sources\ mention\ it)}$

$LR_{\mathrm{sil}}(H) \approx 0.01 \ \mathrm{(silence\ from\ expected\ observers\ is\ devastating)}$

✓ Step 2 — Walking Through the Reasoning

➘ Extraordinary claim: This is not a mundane historical note; it directly contradicts all observed biology. That sets the base prior credence $Cr(H)$ extremely low.

➘ Public nature: The text says they “appeared to many,” in a major city during a religious festival. This makes $ER(H) = \mathrm{high}$ — meaning, if true, we would expect abundant independent reports.

➘ Scarcity of sources: We have a single, anonymous source written decades later with no corroborating documents, no public inscriptions, no mention in other Gospels, and no Jewish or Roman records — despite this allegedly happening in a politically and religiously volatile city under Roman oversight.

➘ Silence penalty: This is the model’s most devastating factor. For a high-visibility public event, multiple independent attestations are expected. The complete silence of other observers yields a very low $LR_{\mathrm{sil}}(H)$ .

➘ Bias and gaps: The sole source has strong theological motives ( $Bias \approx 0.8$ ) and a significant temporal gap between the supposed event and its recording ( $Gap \approx 40$ years), both of which push credibility down.

✓ Step 3 — Model Output

The combined effect of:
✓ low prior ( $Cr(H) \approx 0.01$ ),
✓ high expected reportage ( $ER(H) = \mathrm{high}$ ),
✓ extreme scarcity ( $Scarce(H) = \mathrm{true}$ ), and
✓ devastating silence penalty ( $LR_{\mathrm{sil}} \approx 0.01$ )

…drives the posterior credence into the negligible range. Under this model, the rational conclusion is that the claim can be safely dismissed as historically implausible.

Why This Matters

The “hundreds of saints” passage is an ideal stress-test for the historical-claims model because it’s the type of event that would absolutely leave multiple independent traces if it happened. The complete lack of such corroboration — combined with the extraordinary nature of the claim — renders its probability extremely low.

Recent posts

✓ Plantinga’s Abandonment of Credence

February 10, 2026

Alvin Plantinga’s “Warrant” isn’t an epistemic upgrade; it’s a design for inaccuracy. My formal proof demonstrates that maximizing the binary status of “knowledge” forces a cognitive system to be less accurate than one simply tracking evidence. We must eliminate “knowledge” as a rigorous concept, replacing it with credencing—the honest pursuit…
✓ The Great Gaslighting Grift

February 1, 2026

This article critiques the stark gap between the New Testament’s unequivocal promises of answered prayer and their empirical failure. It examines the theological “bait-and-switch” where bold pulpit guarantees of supernatural intervention are neutralized by “creative hermeneutics” in small groups, transforming literal promises into unfalsifiable, psychological coping mechanisms through evasive logic…
✓ Theology’s Floating Fortress

January 31, 2026

This article characterizes theology as a “floating fortress”—internally coherent but isolated from empirical reality. It details how specific theological claims regarding prayer, miracles, and scientific facts fail verification tests. The argument posits that theology survives only through evasion tactics like redefinition and metaphor, functioning as a self-contained simulation rather than…
✓ Parsimony and Christianity

January 27, 2026

This post applies parsimony (Occam’s Razor) to evaluate Christian Theism. It contrasts naturalism’s high “inductive density” with the precarious “stack of unverified assumptions” required for Christian belief, such as a disembodied mind and omni-attributes. It argues that ad hoc explanations for divine hiddenness further erode the probability of theistic claims,…
✓ Grounding Ways of Knowing

January 19, 2026

Modern apologists argue that religious belief is a rational map of evidence, likening it to scientific frameworks. However, a deeper analysis reveals a stark contrast. While science adapts to reality through empirical testing and falsifiability, theology insulates belief from contradictory evidence. The theological system absorbs anomalies instead of yielding to…
✓ Childlike Faith

January 19, 2026

This post critiques the concept of “childlike faith” in religion, arguing that it promotes an uncritical acceptance of beliefs without evidence. It highlights that while children naturally trust authority figures, this lack of skepticism can lead to false beliefs. The author emphasizes the importance of cognitive maturity and predictive power…
✓ The Amalekite Infants

January 19, 2026

This analysis examines the agonizing moral conflict presented by the explicit biblical command to slaughter Amalekite infants in 1 Samuel 15:3. Written from a skeptical, moral non-realist perspective, it rigorously deconstructs the various apologetic strategies employed to defend this divine directive as “good.” The post critiques common evasions, such as…
✓ The Evidence-Mapping Illusion

January 17, 2026

Modern Christian apologetics claims faith is based on evidence, but this is contradicted by practices within the faith. Children are encouraged to accept beliefs uncritically, while adults seeking evidence face discouragement. The community rewards conformity over inquiry, using moral obligations to stifle skepticism. Thus, the belief system prioritizes preservation over…
✓ The Divine Judgment Evasion

January 17, 2026

In the realm of Christian apologetics, few topics generate as much palpable discomfort as the Old Testament narratives depicting divinely ordered genocide. While many believers prefer to gloss over these passages, serious apologists feel compelled to defend them. They must reconcile a God described as “perfect love” with a deity…
✓ Answered Prayers?

January 10, 2026

This post examines various conditions Christians often attach to prayer promises, transforming them into unfalsifiable claims. It highlights how these ‘failsafe’ mechanisms protect the belief system from scrutiny, allowing believers to reinterpret prayer outcomes either as successes or failures based on internal states or hidden conditions. This results in a…
✓ Categorical Labels Rarely Reflect Rationality

January 8, 2026

In public discourse, labels such as “atheist,” “agnostic,” and “Christian” often oversimplify complex beliefs, leading to misunderstandings. These tags are low-resolution summaries that hinder rational discussions. Genuine inquiry requires moving beyond labels to assess individual credences and evidence. Understanding belief as a gradient reflects the nuances of thought, promoting clarity…
✓ Why an Intelligible Universe?

January 6, 2026

The featured argument, often employed in Christian apologetics, asserts that the universe’s intelligibility implies a divine mind. However, a meticulous examination reveals logical flaws, such as equivocation on “intelligible,” unsubstantiated jumps from observations to conclusions about authorship, and the failure to consider alternative explanations. Ultimately, while the universe exhibits structure…
✓ How Myths Emerge

January 1, 2026

The piece discusses how historical figures like Jesus and Alexander the Great undergo “legendary inflation,” where narratives evolve into more than mere history, shaped by cultural needs and societal functions. As communities invest meaning in these figures, their stories absorb mythical elements and motifs over time. This phenomenon illustrates how…
✓ Mythical Husk over the Jesus Seed

January 1, 2026

This post argues against extreme views in debates about the historical Jesus, emphasizing the distinction between the theological narrative shaped by scriptural interpretation and the existence of a human core. It maintains that while the Gospels serve theological purposes, they do not negate the likelihood of a historical figure, supported…
✓ The Semantic Creep of Faith

December 19, 2025

Hebrews 11:1 is often misquoted as a clear definition of faith, but its Greek origins reveal ambiguity. Different interpretations exist, leading to confusion in Christian discourse. Faith is described both as assurance and as evidence, contributing to semantic sloppiness. Consequently, discussions about faith lack clarity and rigor, oscillating between certitude…
✓ AI for Christian Apologists

December 18, 2025

This post emphasizes the importance of using AI as a tool for Christian apologetics rather than a replacement for personal discernment. It addresses common concerns among Christians about AI, advocating for its responsible application in improving reasoning, clarity, and theological accuracy. The article outlines various use cases for AI, such…

J on ✓ Theology’s Floating Fortress
Thanks for another interesting piece (as usual). I might add that even the “flying” fortress theologians have tried to build…
Phil Stilwell on ✓ Parsimony and Christianity
Good insights! “William Philosophers” definitely refers to William of Ockham.
J on ✓ Parsimony and Christianity
Given that I read a collection of Russell’s works on religion a few years ago, the reference to the teapot…
J on ✓ Grounding Ways of Knowing
Alright, thanks for the insights. While I’ve read works on ethics and ethical theories (including some of Immanuel Kant’s more…
Phil Stilwell on ✓ Grounding Ways of Knowing
Hi J, I’m a moral non-realist. I hold that there are no legitimate moral obligations since a moral realm has…
J on ✓ Grounding Ways of Knowing
Hi Phil: Apologies if this isn’t really related to skepticism, but I was wondering: What are your thoughts on the…
J on The Mirage of the Cosmic Rulebook:
Has Juan considered the problems with divine command theory and could he address the following?: a.) What makes Christian versions…
Phil Stilwell on The Mirage of the Cosmic Rulebook:
Juan has been blocked. I ran out of both daylight and patience.A debriefing has been added above in the light…
Phil Stilwell on The Mirage of the Cosmic Rulebook:
Check the new formalization section in grey above. Fourty minutes.
J. GonzalezRamos on The Mirage of the Cosmic Rulebook:
Phil, an hour to “get honest” about what exactly? I pointed out the contradiction between your self-identification as a moral…

Phil Stilwell

A Deep Dive into Common Faith-Based Concepts & Claims