Six years

Six years observing the open internet.

AI Analytics has been operating since 2020. Below is the public-facing trajectory: when we started measuring, when the probe network expanded, when we shipped what. Internal milestones aren't listed.

2026

22 milestones

Q4Writing
94 technical writeups published
Field guides and deep-dives across the federal data catalog and the accountability datasets.
Q3Voidly
The Genetic Privacy Ledger launched — DNA custody and the statute wave
Fifteenth accountability dataset: the 23andMe custody chain — the 2023 credential-stuffing breach, the UK ICO’s £2.31M penalty, the $305M bankruptcy sale of the genetic database, and the settlements still resolving — as 42 documented events with register and procedural status, plus the 12-state DTC genetic-privacy statute annex and the federal gap map. Statute and court records only, zero personal data; keyless JSON, CC BY.
Q3Voidly
The BOP Ledger launched — every federal prison, weekly
Fourteenth accountability dataset: all 133 Federal Bureau of Prisons institutions rebuilt weekly from BOP’s own keyless JSON feeds — 138,553 people in custody with security level, type, region, and per-institution population; the 155-contract halfway-house layer with named operators; the FY1980–present system series; and the documented arc of federal private prisons (14,095 people before the 2021 ban, zero today, still zero after the 2025 rescission). Facility/system-level, fail-closed gates, a standing reappearance tripwire; keyless JSON, CC0.
Q3Voidly
The 287(g) Wave launched — who signed up to enforce immigration law
Thirteenth accountability dataset: every signed 287(g) agreement in ICE’s own participating-agencies file — 2,123 agreements across 1,804 law-enforcement agencies, 94% of them signed since January 2025, with the month-by-month signing wave, county FIPS joins, and the source file’s own defect ledger preserved and corrected in the open. Agency-level, fail-closed schema gates; keyless JSON, CC0.
Q3Voidly
The Detention Ledger launched — who runs ICE detention beds
All 203 ICE detention facilities from ICE’s own statistics file — 66,161 held on an average day, last-inspection status, guaranteed-minimum bed arithmetic, and an evidence-tiered operator spine that names a private operator only where a federal award or the operator’s own SEC filing does. Built behind schema-enforced privacy gates; keyless JSON, CC0.
Q3Voidly
GridOwners launched — who owns US generating capacity
Eleventh accountability dataset: all 27,768 operable US generators (1.38 TW at 14,189 plants) resolved to entity-level owners from Form EIA-860 2025 Early Release — operator by default, Schedule-4 ownership splits where filed (261 GW). Independent power producers (37.4%) now out-own investor-owned utilities (32.4%); FPL is the largest single owner at 40 GW.
Q3Voidly
Section 117 Ledger launched — foreign money in US higher education
$62.4B in foreign gifts and contracts disclosed by 528 US institutions since 1981, aggregated from the Department of Education public file by institution and source country — with the anonymity built into the statute (2.9% of dollars carry a named source). Aggregate-only, zero-PII pipeline; keyless JSON, CC0.
Q3Voidly
The Shell Map launched — 38 verified ownership chains, interactive
38 ownership chains behind the largest conduit-flagged and no-country blocks of foreign-held US land, traced through public documents with adversarial verification and a chain-by-chain defamation and privacy review. Sovereign funds behind quiet flags, blank filings resolving to documented parents, and chains that end where no record names the investors — every node sourced.
Q3Writing
95 technical writeups published
Field guides and deep-dives across the federal data catalog and the accountability datasets.
Q2Voidly
DarkRegister — the beneficial-ownership transparency tracker — launched
A Voidly accountability record tracking the public-access status of 31 national beneficial-ownership registers (EU-27 plus the UK, US, Ukraine, Canada) after the 2022 CJEU ruling closed public access — 25 of 31 are no longer fully open. Captures the open, CC0, PII-free GLEIF ownership graph (3.3M legal entities) as the preservable counterweight.
Q2Voidly
SpyLedger — the surveillance-industry accountability record — launched
A Voidly section documenting the public corporate identity and government-designation status of 20 marquee spyware and mass-surveillance vendors (NSO Group, Intellexa, Hikvision, Huawei and more). Every designation is rebuilt from a primary US/EU government source and precisely typed — export control, sanction, equipment-authorization, or investment restriction.
Q2Voidly
Verboten — the Global Banned-Books Index — launched
A Voidly section indexing book censorship worldwide: 19,283 banned or restricted titles across 119 countries and 34,987 dated, source-cited ban events, built on the CC-BY banned-books.org Open Censorship Core. Static per-country and per-title pages, a title lookup, and a keyless JSON API answer the question — is this book banned in that country, and why?
Q2Data hub
Federal Regulatory Data Hub — 208 datasets, 50M+ records, CC0 1.0
Cross-agency regulatory catalog spanning SEC, FDA, OFAC, DOJ, EPA, CFPB, IRS, FEMA, CDC, NHTSA, FAA, CFTC, CMS, MSHA, OSHA and 40+ other agencies. Entity bridge joins every regulatory event for a company in one query.
Q2Org
Site rebuild · governance + methodology published
Public methodology and governance pages; press kit; intel feed; full Schema.org graph; RSS / JSON syndication. Intel briefs, structured data coverage, and cross-linking across all three flagship projects.
Q2Voidly
OrganWatch launched — US organ-procurement accountability
A sourced public-record map of the US organ system: OPO dossiers with CMS performance tiers, the OPTN/UNOS oversight structure, a 51-jurisdiction consent-law map, and the for-profit tissue industry — institution-level facts only, zero personal data. Keyless JSON, CC BY 4.0.
Q2Voidly
The reference layer — sanctions programs, information rights, privacy law
Three law-and-authority references joined the stack: 41 OFAC sanctions programs, access-to-information statutes across 61 countries, and data-protection law across 61 countries — the law, not the people. Keyless JSON, CC BY 4.0.
Q2Writing
132 technical writeups published
Field guides and deep-dives across the federal data catalog and the accountability datasets.
Q1Data hub
Regulatory API v1 — cross-agency entity bridge, 150 datasets
Public launch of api.ai-analytics.org. Cross-agency entity bridge keyed on CIK, ticker, UEI, LEI, DUNS, NPI.
Q1Swarm
Swarm SDK v0.4
Situational Awareness, EW Coordination, Adversarial Resilience, RF Fingerprinting & Tracking. 163 new tests (465 total).
Q1Coverage
Voidly coverage hits 200 countries
37+ probe nodes spanning every continent. 80-domain test list.
Q1Tooling
MCP server — 83 tools
voidly-ai/mcp-server enables Claude / GPT / agent frameworks to query the censorship dataset directly.
Q1Writing
110 technical writeups published
Field guides and deep-dives across the federal data catalog and the accountability datasets.

2025

8 milestones

Q4Data hub
Regulatory data pipeline — D1 ingestion for 130+ federal datasets
Built Cloudflare D1 + Workers daily ingest pipeline: EDGAR FTP, openFDA API, OFAC XML, EPA ECHO, SAM.gov, USAspending, FinCEN, FDIC BankFind, CMS provider files, NIST NVD, CISA KEV, NTSB, NHTSA, FAA, MSHA, OSHA. Entity normalization across CIK, UEI, LEI, NPI, DUNS.
Q4Swarm
Swarm SDK v0.3
Sender Keys for O(1) group encryption. Sealed Sender.
Q4Writing
17 technical writeups published
Long-form technical writeups on the measurement network, the data infrastructure, and OSINT methods.
Q3Swarm
Swarm SDK v0.2
MAVLink v2 transport adapter. PX4 / ArduPilot / MAVSDK compatibility.
Q3Writing
19 technical writeups published
Long-form technical writeups on the measurement network, the data infrastructure, and OSINT methods.
Q2Swarm
Swarm SDK v0.1 — initial release
Gossip mesh routing, Double Ratchet forward secrecy, ML-KEM-768 + X25519 hybrid post-quantum key exchange.
Q2Writing
19 technical writeups published
Long-form technical writeups on the measurement network, the data infrastructure, and OSINT methods.
Q1Writing
16 technical writeups published
Long-form technical writeups on the measurement network, the data infrastructure, and OSINT methods.

2024

2 milestones

Q4Writing
18 technical writeups published
Long-form technical writeups on the measurement network, the data infrastructure, and OSINT methods.
Q3Writing
6 technical writeups published
Long-form technical writeups on the measurement network, the data infrastructure, and OSINT methods.

2023

2 milestones

Q4Open data
OONI corpus mirrored on HuggingFace
ooni-censorship-historical dataset published; passes 1.66M cumulative downloads over the next two years.
Q2Voidly
Voidly cross-source verification online
Reconciler ships across OONI / CensoredPlanet / IODA. Verified-incident tier becomes the default surface.

2022

2 milestones

Q3Coverage
Probe network → 100 countries
Vantage selection rules formalized: presence inside affected jurisdictions, ASN diversity, operator safety. Test list grows to 60 domains.
Q1Voidly
ML anomaly classifier in production
Five interference classes (DNS, TLS, HTTP, BGP, throttling) graded with confidence scores; corroborated tier introduced.

2021

2 milestones

Q3Open data
CC BY 4.0 dataset published
First public release of the Voidly measurement archive — committing to open data as a permanent operating principle.
Q2Tooling
Voidly probe v1
Cross-platform Tauri desktop probe with boringtun + tun-rs. Anyone with a network can run a probe; keys never leave the device.

2020

2 milestones

Q4Voidly
First measurements collected
Initial 6-country probe network active. The censorship dataset begins.
Q2Org
AI Analytics founded
Operator-led collective forms around a shared belief: internet censorship should be measurable, verifiable, and citable.

Have an event you cited from us in a publication or research paper? Tell us at info@ai-analytics.org and we'll add it to the timeline.

Six years observing the open internet.

2026

94 technical writeups published

The Genetic Privacy Ledger launched — DNA custody and the statute wave

The BOP Ledger launched — every federal prison, weekly

The 287(g) Wave launched — who signed up to enforce immigration law

The Detention Ledger launched — who runs ICE detention beds

GridOwners launched — who owns US generating capacity

Section 117 Ledger launched — foreign money in US higher education

The Shell Map launched — 38 verified ownership chains, interactive

95 technical writeups published

DarkRegister — the beneficial-ownership transparency tracker — launched

SpyLedger — the surveillance-industry accountability record — launched

Verboten — the Global Banned-Books Index — launched

Federal Regulatory Data Hub — 208 datasets, 50M+ records, CC0 1.0

Site rebuild · governance + methodology published

OrganWatch launched — US organ-procurement accountability

The reference layer — sanctions programs, information rights, privacy law

132 technical writeups published

Regulatory API v1 — cross-agency entity bridge, 150 datasets

Swarm SDK v0.4

Voidly coverage hits 200 countries

MCP server — 83 tools

110 technical writeups published

2025

Regulatory data pipeline — D1 ingestion for 130+ federal datasets

Swarm SDK v0.3

17 technical writeups published

Swarm SDK v0.2

19 technical writeups published

Swarm SDK v0.1 — initial release

19 technical writeups published

16 technical writeups published

2024

18 technical writeups published

6 technical writeups published

2023

OONI corpus mirrored on HuggingFace

Voidly cross-source verification online

2022

Probe network → 100 countries

ML anomaly classifier in production

2021

CC BY 4.0 dataset published

Voidly probe v1

2020

First measurements collected

AI Analytics founded