An Effective Methodology

History is not a neat photo album; it’s more like a jigsaw puzzle with half the pieces missing, and the rest scattered in someone’s attic. People tell stories. Some get recorded. Some get embellished. Others get forgotten. And sometimes we’re left with claims so unusual that our first instinct is to squint and say, Really?

The problem is, debates about historical claims often get stuck in two unhelpful extremes:
One side says, “It’s written down, so it must be true.”
The other says, “If it’s strange, it must be false.”

Both approaches skip the actual work: figuring out how much the evidence we have should move our belief one way or the other. What we need is a tool that quantifies plausibility — something that treats history a bit like science, where we update our confidence based on the strength and weakness of the evidence.


Why Missing Evidence Isn’t Neutral — It’s a Clue

Imagine your friend claims that last night, during rush hour, an elephant walked across the Brooklyn Bridge. You check the news. Nothing. No photos, no social media posts, no eyewitness chatter. The complete absence of reports isn’t just a “lack of extra evidence” — it’s active evidence against the claim.

Why? Because if it happened, the event would have been highly public, easy to notice, and almost impossible to ignore. Silence in these cases is loud.

This is the core of what historians sometimes call the argument from silence. The trick is knowing when the silence is meaningful. If an event is private, obscure, or likely to go unrecorded, then the absence of sources means little. But if it’s a showstopper — something everyone would see — then missing corroboration is damning.

This distinction is critical, because without it, people can cherry-pick any isolated text or fragment and treat it as sufficient proof for an event that would, in reality, leave far more footprints.


A Plain-Language Decision Framework

Before we get into math, here’s the common-sense version of how to filter historical claims:

  1. Is it extraordinary?
    ✓ Does it clash with established knowledge about the world?
    ✓ Example: “A royal decree was issued” — mundane. “A god descended into the marketplace and turned the river to wine” — extraordinary.
  2. Is it public?
    ✓ Would large numbers of people have directly witnessed it?
    ✓ Example: A private conversation between two generals — not public. A meteor exploding over a capital city — public.
  3. Would we expect strong reporting?
    ✓ If it happened, would chroniclers, letters, or records have been made?
    ✓ Example: Major battles, coronations, or plagues generate records.
  4. How scarce is the evidence?
    ✓ Do we have multiple accounts, or just one fragile scrap?
  5. How independent and reliable are those accounts?
    ✓ Multiple copies of one bad source aren’t independent confirmation.
  6. ◉ Do we see silence where we’d expect noise?
    ✓ If trusted observers of the day fail to mention it, that’s highly relevant.

Think of it as a checklist where each “yes” on the left side (extraordinary, public, high-expectation) raises the bar for the kind of evidence we’ll accept.


The Formal Historical-Claims Model

This is where we turn that plain-language checklist into something structured — a set of variables and relationships that can actually be calculated.

H\colon \mathrm{historical\ claim},\quad Cr(H)\in(0,1)\colon \mathrm{prior\ credence}
Our starting point is the claim HHH and a prior probability Cr(H)Cr(H)Cr(H) — basically, how likely we thought it was before considering any new evidence.

Extra(H)\colon \mathrm{extraordinary\ claim},\quad Mund(H)\equiv \neg Extra(H)
We explicitly tag whether the claim is extraordinary or mundane, since extraordinary claims start with a lower prior credence.

Pub(H)\colon \mathrm{public\ event},\quad ER(H)\in{\mathrm{low},\mathrm{med},\mathrm{high}}
Publicness matters because it determines how much reporting we’d expect. “ER(H)” captures that expected reportage level — low, medium, or high.

Scarce(H)\colon \mathrm{few\ surviving\ sources},\quad \mathcal{S}(H)={s_1,\dots,s_n}
Here we note whether evidence is scarce, and we define the set of sources ({s_1, s_2, \dots, s_n}).

Ind(s_i,s_j)\colon \mathrm{source\ independence}
This function tells us if two sources are truly independent — critical for avoiding the trap of “copy-paste confirmation.”

Qual(s)\in(0,1],\quad Gap(s)\ge 0,\quad Bias(s)\in[0,1],\quad Anon(s)\in{0,1},\quad Tamper(s)\in[0,1]
Each source gets a reliability profile:

  • Quality score (accuracy, detail, internal consistency)
  • Gap in years from the event
  • Bias rating (how motivated the author is to spin the story)
  • Anonymity flag (1 if anonymous)
  • Tampering suspicion score

LR(s)\colon \mathrm{base\ likelihood\ ratio},\quad LR_{\mathrm{eff}}(s)\colon \mathrm{adjusted\ likelihood\ ratio}
We calculate the likelihood ratio — how much a source moves the probability up or down — then adjust it for quality, bias, and other penalties.

Sil(H)\colon \mathrm{silence\ in\ expected\ observers}
This flags cases where credible observers, who should have mentioned the event, say nothing.

LR_{\mathrm{sil}}(H)=\frac{P(\mathrm{Sil}(H)\mid H)}{P(\mathrm{Sil}(H)\mid \neg H)}
The likelihood ratio for silence. If silence is much more probable when the event didn’t happen, this number will be small, hammering the claim’s credibility.


Instantiating the Model: Dragons Over Athens

Let’s apply the model to a fictional but instructive case.

Claim: “During the height of the Greek empire, dragons flew over Athens.

Set Variables:
H=\mathrm{"dragons\ flew\ over\ Athens"}
Cr(H)=0.0001 — extremely low prior credence because dragons contradict all known zoology and physics.
Extra(H)=\mathrm{true}
Mund(H)=\mathrm{false}
Pub(H)=\mathrm{true} — thousands would have witnessed it.
ER(H)=\mathrm{high} — it should have flooded ancient records.
Scarce(H)=\mathrm{true} — we have only one surviving source.
n=1,\ s_1=\mathrm{single\ manuscript}
Qual(s_1)=0.4,\ Gap(s_1)=300\ \mathrm{years},\ Bias(s_1)=0.6,\ Anon(s_1)=1,\ Tamper(s_1)=0.2
LR(s_1)=1.5 — base likelihood ratio is weakly supportive.

Adjust for penalties:

LR_{\mathrm{eff}}(s_1)\approx 1.5\times 0.4\times (1-0.6)\times (1-0.2)=0.192

Silence factor:
LR_{\mathrm{sil}}(H)=0.01 — strong negative weight, since silence from contemporary historians is nearly impossible if the claim were true.

Interpretation:
We start with tiny odds (0.0001). We multiply by a shrunken source likelihood (0.192), then by the silence factor (0.01). The final probability approximates zero.

This is the mathematical expression of common sense: if thousands would have seen it, and there’s just one shaky source written centuries later, it didn’t happen.


Why This Approach Works

The power here is in making the reasoning explicit. Instead of vaguely saying “That’s unlikely,” we specify:

  • Why the prior is low (extraordinary nature)
  • Why expected reportage is high (public spectacle)
  • Why a lone, low-quality, biased, and anonymous source can’t outweigh the silence of all others

When applied to real history, this protects us from giving undue weight to isolated or dubious accounts. It’s not about cynicism — it’s about calibrating our confidence to match the actual evidential landscape.


Applying the Model to The Resurrected Saints Claim

One of the most striking — and often overlooked — supernatural claims in the New Testament is in Matthew 27:52–53, where it is stated that, upon Jesus’ death, “many bodies of the saints who had fallen asleep were raised” and “appeared to many” in Jerusalem. At face value, this is an extraordinary, public, and testable claim. Let’s see what happens when we run it through our historical-claims model.


✓ Step 1 — Setting the Variables

H = \mathrm{claim\ that\ hundreds\ of\ saints\ resurrected\ and\ appeared\ publicly}
Cr(H) \approx 0.01 \ \mathrm{(low\ prior\ due\ to\ violation\ of\ known\ biology)}
Extra(H) = \mathrm{true}
Mund(H) = \neg Extra(H) = \mathrm{false}
Pub(H) = \mathrm{true} \ \mathrm{(Jerusalem,\ many\ witnesses)}
ER(H) = \mathrm{high} \ \mathrm{(massive\ public\ spectacle)}
Scarce(H) = \mathrm{true} \ \mathrm{(only\ one\ disputed\ account)}
\mathcal{S}(H) = {s_1}, \ s_1 = \mathrm{Gospel\ of\ Matthew}
Ind(s_i, s_j) = \mathrm{not\ applicable\ (single\ source)}
Qual(s_1) \approx 0.5 \ \mathrm{(authorship\ unknown,\ written\ decades\ later)}
Gap(s_1) \approx 40 \ \mathrm{years\ (between\ event\ and\ text)}
Bias(s_1) \approx 0.8 \ \mathrm{(strong\ theological\ motive)}
Anon(s_1) = 1 \ \mathrm{(anonymous\ authorship)}
Tamper(s_1) \approx 0.3 \ \mathrm{(possible\ later\ editorial\ changes)}
LR(s_1) \approx 1.2 \ \mathrm{(weak\ base\ support)}
Sil(H) = \mathrm{true} \ \mathrm{(no\ Roman\ historians,\ no\ other\ Gospels,\ no\ Jewish\ sources\ mention\ it)}

LR_{\mathrm{sil}}(H) \approx 0.01 \ \mathrm{(silence\ from\ expected\ observers\ is\ devastating)}

✓ Step 2 — Walking Through the Reasoning

Extraordinary claim: This is not a mundane historical note; it directly contradicts all observed biology. That sets the base prior credence Cr(H) extremely low.

Public nature: The text says they “appeared to many,” in a major city during a religious festival. This makes ER(H) = \mathrm{high} — meaning, if true, we would expect abundant independent reports.

Scarcity of sources: We have a single, anonymous source written decades later with no corroborating documents, no public inscriptions, no mention in other Gospels, and no Jewish or Roman records — despite this allegedly happening in a politically and religiously volatile city under Roman oversight.

Silence penalty: This is the model’s most devastating factor. For a high-visibility public event, multiple independent attestations are expected. The complete silence of other observers yields a very low LR_{\mathrm{sil}}(H).

Bias and gaps: The sole source has strong theological motives (Bias \approx 0.8) and a significant temporal gap between the supposed event and its recording (Gap \approx 40 years), both of which push credibility down.


✓ Step 3 — Model Output

The combined effect of:
✓ low prior (Cr(H) \approx 0.01),
✓ high expected reportage (ER(H) = \mathrm{high}),
✓ extreme scarcity (Scarce(H) = \mathrm{true}), and
devastating silence penalty (LR_{\mathrm{sil}} \approx 0.01)

…drives the posterior credence into the negligible range. Under this model, the rational conclusion is that the claim can be safely dismissed as historically implausible.


Why This Matters

The “hundreds of saints” passage is an ideal stress-test for the historical-claims model because it’s the type of event that would absolutely leave multiple independent traces if it happened. The complete lack of such corroboration — combined with the extraordinary nature of the claim — renders its probability extremely low.


Recent posts

  • Alvin Plantinga’s “Warrant” isn’t an epistemic upgrade; it’s a design for inaccuracy. My formal proof demonstrates that maximizing the binary status of “knowledge” forces a cognitive system to be less accurate than one simply tracking evidence. We must eliminate “knowledge” as a rigorous concept, replacing it with credencing—the honest pursuit…

  • This article critiques the stark gap between the New Testament’s unequivocal promises of answered prayer and their empirical failure. It examines the theological “bait-and-switch” where bold pulpit guarantees of supernatural intervention are neutralized by “creative hermeneutics” in small groups, transforming literal promises into unfalsifiable, psychological coping mechanisms through evasive logic…

  • This article characterizes theology as a “floating fortress”—internally coherent but isolated from empirical reality. It details how specific theological claims regarding prayer, miracles, and scientific facts fail verification tests. The argument posits that theology survives only through evasion tactics like redefinition and metaphor, functioning as a self-contained simulation rather than…

  • This post applies parsimony (Occam’s Razor) to evaluate Christian Theism. It contrasts naturalism’s high “inductive density” with the precarious “stack of unverified assumptions” required for Christian belief, such as a disembodied mind and omni-attributes. It argues that ad hoc explanations for divine hiddenness further erode the probability of theistic claims,…

  • Modern apologists argue that religious belief is a rational map of evidence, likening it to scientific frameworks. However, a deeper analysis reveals a stark contrast. While science adapts to reality through empirical testing and falsifiability, theology insulates belief from contradictory evidence. The theological system absorbs anomalies instead of yielding to…

  • This post critiques the concept of “childlike faith” in religion, arguing that it promotes an uncritical acceptance of beliefs without evidence. It highlights that while children naturally trust authority figures, this lack of skepticism can lead to false beliefs. The author emphasizes the importance of cognitive maturity and predictive power…

  • This analysis examines the agonizing moral conflict presented by the explicit biblical command to slaughter Amalekite infants in 1 Samuel 15:3. Written from a skeptical, moral non-realist perspective, it rigorously deconstructs the various apologetic strategies employed to defend this divine directive as “good.” The post critiques common evasions, such as…

  • Modern Christian apologetics claims faith is based on evidence, but this is contradicted by practices within the faith. Children are encouraged to accept beliefs uncritically, while adults seeking evidence face discouragement. The community rewards conformity over inquiry, using moral obligations to stifle skepticism. Thus, the belief system prioritizes preservation over…

  • In the realm of Christian apologetics, few topics generate as much palpable discomfort as the Old Testament narratives depicting divinely ordered genocide. While many believers prefer to gloss over these passages, serious apologists feel compelled to defend them. They must reconcile a God described as “perfect love” with a deity…

  • This post examines various conditions Christians often attach to prayer promises, transforming them into unfalsifiable claims. It highlights how these ‘failsafe’ mechanisms protect the belief system from scrutiny, allowing believers to reinterpret prayer outcomes either as successes or failures based on internal states or hidden conditions. This results in a…

  • In public discourse, labels such as “atheist,” “agnostic,” and “Christian” often oversimplify complex beliefs, leading to misunderstandings. These tags are low-resolution summaries that hinder rational discussions. Genuine inquiry requires moving beyond labels to assess individual credences and evidence. Understanding belief as a gradient reflects the nuances of thought, promoting clarity…

  • The featured argument, often employed in Christian apologetics, asserts that the universe’s intelligibility implies a divine mind. However, a meticulous examination reveals logical flaws, such as equivocation on “intelligible,” unsubstantiated jumps from observations to conclusions about authorship, and the failure to consider alternative explanations. Ultimately, while the universe exhibits structure…

  • The piece discusses how historical figures like Jesus and Alexander the Great undergo “legendary inflation,” where narratives evolve into more than mere history, shaped by cultural needs and societal functions. As communities invest meaning in these figures, their stories absorb mythical elements and motifs over time. This phenomenon illustrates how…

  • This post argues against extreme views in debates about the historical Jesus, emphasizing the distinction between the theological narrative shaped by scriptural interpretation and the existence of a human core. It maintains that while the Gospels serve theological purposes, they do not negate the likelihood of a historical figure, supported…

  • Hebrews 11:1 is often misquoted as a clear definition of faith, but its Greek origins reveal ambiguity. Different interpretations exist, leading to confusion in Christian discourse. Faith is described both as assurance and as evidence, contributing to semantic sloppiness. Consequently, discussions about faith lack clarity and rigor, oscillating between certitude…

  • This post emphasizes the importance of using AI as a tool for Christian apologetics rather than a replacement for personal discernment. It addresses common concerns among Christians about AI, advocating for its responsible application in improving reasoning, clarity, and theological accuracy. The article outlines various use cases for AI, such…