Conspirafy Data

The full Conspirafy corpus is available as static, gzipped data files. Updated nightly. Free to use under CC BY-SA 4.0.

7 artifactsNightly refreshCORS-enabledgzip transparent decompression

Attribution requirement

All data is published under Creative Commons Attribution-ShareAlike 4.0 (CC BY-SA 4.0). You are free to copy, redistribute, remix, and build upon this data for any purpose, even commercially, as long as you:

Credit Conspirafy (conspirafy.com) visibly in your project.
Link to https://conspirafy.com from any published derivative work.
License your derivative work under CC BY-SA 4.0 or a compatible license.
Do not imply that Conspirafy endorses your product or its conclusions.

Available artifacts

Artifact	URL	Content-Type	Refresh	Size
Theory Lookup Table	https://data.conspirafy.com/theories.json.gz	application/json (gzip)	Nightly (3 AM UTC)	~110 KB gzipped
Full Theory Archive	https://data.conspirafy.com/theories-full.json.gz	application/json (gzip)	Nightly (3 AM UTC)	~2.3 MB gzipped
Source Health CSV	https://data.conspirafy.com/sources.csv.gz	text/csv (gzip)	Nightly (3 AM UTC)	~10 KB gzipped
Primary Documents Index	https://data.conspirafy.com/documents.json.gz	application/json (gzip)	Nightly (3 AM UTC)	~2 KB gzipped
Theory Relations Graph	https://data.conspirafy.com/graph.json.gz	application/json (gzip)	Nightly (3 AM UTC)	~13 KB gzipped
Full URL Sitemap	https://data.conspirafy.com/sitemap-full.json.gz	application/json (gzip)	Nightly (3 AM UTC)	~16 KB gzipped
Source Health Snapshot	https://data.conspirafy.com/source-health.json.gz	application/json (gzip)	Nightly (3 AM UTC)	~1 KB gzipped

All .gz files are served with content-encoding: gzip so the browser or fetch() transparently decompresses. JSON is parsed directly from the response without additional steps.

Schema documentation

theories.json.gz — Theory Lookup Table

Slim record per approved theory. Designed for browser extension lookup. Fields: slug, title, verdict, verdict_confidence, summary, category_slug, last_updated, evidence_count, source_count, related_slugs.

{
  "generated_at": "2026-06-13T03:00:00.000Z",
  "version": 1,
  "license": "CC BY-SA 4.0",
  "count": 438,
  "records": [
    {
      "slug": "jfk-assassination",
      "title": "JFK Assassination Conspiracy",
      "verdict": "partially_true",
      "verdict_confidence": 72,
      "summary": "Multiple credible lines of evidence...",
      "category_slug": "government-politics",
      "last_updated": "2026-06-10T12:00:00.000Z",
      "evidence_count": 14,
      "source_count": 22,
      "related_slugs": ["cia-drug-trafficking", "watergate"]
    }
    // ... more records
  ]
}

theories-full.json.gz — Full Theory Archive

Full theory records including body_md, evidence arrays (title, type, strength, description), sources, timeline events, key figures, FAQ, and related-theory pairs. For LLM RAG and academic consumers.

{
  "generated_at": "...",
  "version": 1,
  "license": "CC BY-SA 4.0",
  "count": 438,
  "records": [
    {
      "slug": "jfk-assassination",
      "title": "JFK Assassination Conspiracy",
      "verdict": "partially_true",
      "verdict_confidence": 72,
      "verdict_summary": "...",
      "summary": "...",
      "body_md": "## Background\n...",
      "category_slug": "government-politics",
      "last_updated": "...",
      "evidence": [
        { "title": "Warren Commission", "type": "debunking", "strength": "strong", "description": "...", "sort_order": 1 }
      ],
      "sources": [
        { "url": "https://...", "credibility": "high", "archived_url": "https://archive.org/..." }
      ],
      "timeline": [
        { "event_date": "1963-11-22", "headline": "Assassination in Dallas", "body": "..." }
      ],
      "key_figures": [
        { "name": "Lee Harvey Oswald", "role": "subject", "bio": "..." }
      ],
      "faq": [
        { "question": "Was there a second shooter?", "answer": "..." }
      ],
      "related_theories": [
        { "slug": "cia-drug-trafficking", "relation": "similar_claim" }
      ]
    }
  ]
}

sources.csv.gz — Source Health CSV

Every source domain aggregated by domain: credibility tier, citation count, alive status, last HTTP check date and status code.

domain,credibility,citation_count,is_alive,last_checked,http_status
"nytimes.com","high",42,true,"2026-06-12T03:00:00Z",200
"infowars.com","low",8,true,"2026-06-12T03:00:00Z",200
"example-broken.com","unknown",3,false,"2026-06-12T03:00:00Z",403

documents.json.gz — Primary Documents Index

Index of primary-source documents archived from theory_primary_documents: court filings, FOIA releases, congressional reports, transcripts, and more.

{
  "generated_at": "...",
  "version": 1,
  "license": "CC BY-SA 4.0",
  "count": 10,
  "records": [
    {
      "id": "uuid",
      "theory_slug": "mkultra",
      "title": "Church Committee MK-ULTRA Report",
      "doc_type": "congressional_report",
      "source_url": "https://govinfo.gov/...",
      "archived_org_url": "https://archive.org/...",
      "page_count": 50,
      "byte_size": 2048000,
      "ingest_date": "2026-06-13T..."
    }
  ]
}

graph.json.gz — Theory Relations Graph

Force-directed graph data: top-200 most-connected theory nodes with verdict metadata, and all edges between them. Used to power the /graph visualization.

{
  "generated_at": "...",
  "version": 1,
  "license": "CC BY-SA 4.0",
  "meta": {
    "node_cap": 200,
    "total_theories": 438
  },
  "nodes": [
    { "slug": "qanon", "title": "QAnon", "verdict": "debunked", "related_count": 18 }
  ],
  "edges": [
    { "from_slug": "qanon", "to_slug": "deep-state", "relation": "evolved_from" }
  ]
}

sitemap-full.json.gz — Full URL Sitemap

Flat list of all public URLs on conspirafy.com: theory pages, source domain pages, document pages, categories, topic hubs, verdict pages, and static routes.

{
  "generated_at": "...",
  "version": 1,
  "license": "CC BY-SA 4.0",
  "count": 1200,
  "urls": [
    "https://conspirafy.com/",
    "https://conspirafy.com/conspiracies/jfk-assassination",
    "https://conspirafy.com/sources/nytimes.com",
    "..."
  ]
}

source-health.json.gz — Source Health Snapshot

Daily aggregate of source alive/broken/unchecked counts plus top-50 broken domains. Used to power the /sources/health dashboard.

{
  "generated_at": "...",
  "version": 1,
  "license": "CC BY-SA 4.0",
  "snapshot_date": "2026-06-13",
  "total": 4200,
  "alive": 3800,
  "broken": 280,
  "unchecked": 120,
  "alive_percentage": 90.48,
  "top_broken_domains": [
    { "domain": "example-broken.com", "broken_count": 12 }
  ]
}

Usage

Browser / fetch

// fetch() transparently decompresses gzip when served with content-encoding
const res = await fetch('https://data.conspirafy.com/theories.json.gz');
const data = await res.json(); // already decompressed
console.log(data.records.length); // e.g. 438 theories

curl

curl -L --compressed https://data.conspirafy.com/theories.json.gz | jq '.count'

Python

import requests, gzip, json

r = requests.get('https://data.conspirafy.com/theories.json.gz')
# requests auto-decompresses gzip content-encoding
data = r.json()
print(f"{data['count']} theories")

Enterprise API

Need higher-frequency refreshes, custom filtered exports, a dedicated API key, or SLA guarantees? We offer enterprise access for research institutions, journalists, and large-scale platforms.