Skip to main content

Conspirafy Data

The full Conspirafy corpus is available as static, gzipped data files. Updated nightly. Free to use under CC BY-SA 4.0.

7 artifactsNightly refreshCORS-enabledgzip transparent decompression

Attribution requirement

All data is published under Creative Commons Attribution-ShareAlike 4.0 (CC BY-SA 4.0). You are free to copy, redistribute, remix, and build upon this data for any purpose, even commercially, as long as you:

  • Credit Conspirafy (conspirafy.com) visibly in your project.
  • Link to https://conspirafy.com from any published derivative work.
  • License your derivative work under CC BY-SA 4.0 or a compatible license.
  • Do not imply that Conspirafy endorses your product or its conclusions.

Available artifacts

ArtifactURLContent-TypeRefreshSize
Theory Lookup Tablehttps://data.conspirafy.com/theories.json.gzapplication/json (gzip)Nightly (3 AM UTC)~110 KB gzipped
Full Theory Archivehttps://data.conspirafy.com/theories-full.json.gzapplication/json (gzip)Nightly (3 AM UTC)~2.3 MB gzipped
Source Health CSVhttps://data.conspirafy.com/sources.csv.gztext/csv (gzip)Nightly (3 AM UTC)~10 KB gzipped
Primary Documents Indexhttps://data.conspirafy.com/documents.json.gzapplication/json (gzip)Nightly (3 AM UTC)~2 KB gzipped
Theory Relations Graphhttps://data.conspirafy.com/graph.json.gzapplication/json (gzip)Nightly (3 AM UTC)~13 KB gzipped
Full URL Sitemaphttps://data.conspirafy.com/sitemap-full.json.gzapplication/json (gzip)Nightly (3 AM UTC)~16 KB gzipped
Source Health Snapshothttps://data.conspirafy.com/source-health.json.gzapplication/json (gzip)Nightly (3 AM UTC)~1 KB gzipped

All .gz files are served with content-encoding: gzip so the browser or fetch() transparently decompresses. JSON is parsed directly from the response without additional steps.

Schema documentation

theories.json.gzTheory Lookup Table

Slim record per approved theory. Designed for browser extension lookup. Fields: slug, title, verdict, verdict_confidence, summary, category_slug, last_updated, evidence_count, source_count, related_slugs.

{
  "generated_at": "2026-06-13T03:00:00.000Z",
  "version": 1,
  "license": "CC BY-SA 4.0",
  "count": 438,
  "records": [
    {
      "slug": "jfk-assassination",
      "title": "JFK Assassination Conspiracy",
      "verdict": "partially_true",
      "verdict_confidence": 72,
      "summary": "Multiple credible lines of evidence...",
      "category_slug": "government-politics",
      "last_updated": "2026-06-10T12:00:00.000Z",
      "evidence_count": 14,
      "source_count": 22,
      "related_slugs": ["cia-drug-trafficking", "watergate"]
    }
    // ... more records
  ]
}
theories-full.json.gzFull Theory Archive

Full theory records including body_md, evidence arrays (title, type, strength, description), sources, timeline events, key figures, FAQ, and related-theory pairs. For LLM RAG and academic consumers.

{
  "generated_at": "...",
  "version": 1,
  "license": "CC BY-SA 4.0",
  "count": 438,
  "records": [
    {
      "slug": "jfk-assassination",
      "title": "JFK Assassination Conspiracy",
      "verdict": "partially_true",
      "verdict_confidence": 72,
      "verdict_summary": "...",
      "summary": "...",
      "body_md": "## Background\n...",
      "category_slug": "government-politics",
      "last_updated": "...",
      "evidence": [
        { "title": "Warren Commission", "type": "debunking", "strength": "strong", "description": "...", "sort_order": 1 }
      ],
      "sources": [
        { "url": "https://...", "credibility": "high", "archived_url": "https://archive.org/..." }
      ],
      "timeline": [
        { "event_date": "1963-11-22", "headline": "Assassination in Dallas", "body": "..." }
      ],
      "key_figures": [
        { "name": "Lee Harvey Oswald", "role": "subject", "bio": "..." }
      ],
      "faq": [
        { "question": "Was there a second shooter?", "answer": "..." }
      ],
      "related_theories": [
        { "slug": "cia-drug-trafficking", "relation": "similar_claim" }
      ]
    }
  ]
}
sources.csv.gzSource Health CSV

Every source domain aggregated by domain: credibility tier, citation count, alive status, last HTTP check date and status code.

domain,credibility,citation_count,is_alive,last_checked,http_status
"nytimes.com","high",42,true,"2026-06-12T03:00:00Z",200
"infowars.com","low",8,true,"2026-06-12T03:00:00Z",200
"example-broken.com","unknown",3,false,"2026-06-12T03:00:00Z",403
documents.json.gzPrimary Documents Index

Index of primary-source documents archived from theory_primary_documents: court filings, FOIA releases, congressional reports, transcripts, and more.

{
  "generated_at": "...",
  "version": 1,
  "license": "CC BY-SA 4.0",
  "count": 10,
  "records": [
    {
      "id": "uuid",
      "theory_slug": "mkultra",
      "title": "Church Committee MK-ULTRA Report",
      "doc_type": "congressional_report",
      "source_url": "https://govinfo.gov/...",
      "archived_org_url": "https://archive.org/...",
      "page_count": 50,
      "byte_size": 2048000,
      "ingest_date": "2026-06-13T..."
    }
  ]
}
graph.json.gzTheory Relations Graph

Force-directed graph data: top-200 most-connected theory nodes with verdict metadata, and all edges between them. Used to power the /graph visualization.

{
  "generated_at": "...",
  "version": 1,
  "license": "CC BY-SA 4.0",
  "meta": {
    "node_cap": 200,
    "total_theories": 438
  },
  "nodes": [
    { "slug": "qanon", "title": "QAnon", "verdict": "debunked", "related_count": 18 }
  ],
  "edges": [
    { "from_slug": "qanon", "to_slug": "deep-state", "relation": "evolved_from" }
  ]
}
sitemap-full.json.gzFull URL Sitemap

Flat list of all public URLs on conspirafy.com: theory pages, source domain pages, document pages, categories, topic hubs, verdict pages, and static routes.

{
  "generated_at": "...",
  "version": 1,
  "license": "CC BY-SA 4.0",
  "count": 1200,
  "urls": [
    "https://conspirafy.com/",
    "https://conspirafy.com/conspiracies/jfk-assassination",
    "https://conspirafy.com/sources/nytimes.com",
    "..."
  ]
}
source-health.json.gzSource Health Snapshot

Daily aggregate of source alive/broken/unchecked counts plus top-50 broken domains. Used to power the /sources/health dashboard.

{
  "generated_at": "...",
  "version": 1,
  "license": "CC BY-SA 4.0",
  "snapshot_date": "2026-06-13",
  "total": 4200,
  "alive": 3800,
  "broken": 280,
  "unchecked": 120,
  "alive_percentage": 90.48,
  "top_broken_domains": [
    { "domain": "example-broken.com", "broken_count": 12 }
  ]
}

Usage

Browser / fetch

// fetch() transparently decompresses gzip when served with content-encoding
const res = await fetch('https://data.conspirafy.com/theories.json.gz');
const data = await res.json(); // already decompressed
console.log(data.records.length); // e.g. 438 theories

curl

curl -L --compressed https://data.conspirafy.com/theories.json.gz | jq '.count'

Python

import requests, gzip, json

r = requests.get('https://data.conspirafy.com/theories.json.gz')
# requests auto-decompresses gzip content-encoding
data = r.json()
print(f"{data['count']} theories")

Enterprise API

Need higher-frequency refreshes, custom filtered exports, a dedicated API key, or SLA guarantees? We offer enterprise access for research institutions, journalists, and large-scale platforms.

Contact us about enterprise access