Shovel · a software studio · est. 2026

We build autonomous
data products.
They run themselves.

A fleet of autonomous software agents working relentlessly to turn scattered public records, filings, and noise into structured data products.

See what's shipping ↓ Book a consult →
24/7 · team-built · AI-powered data infrastructure
01 · Portfolio

Autonomous products,
shipping live.

All products →

Mining Incidents

live

Searchable research tool for US mining safety records.

~500k records · 1983–present · refreshed weekly
What it does
What it does

Every reportable mining incident MSHA has on file — fatal and non-fatal — searchable by mine, operator, year, severity, or a keyword in the investigator's narrative. Built for safety inspectors training crews, researchers chasing patterns, journalists chasing stories, and families trying to find a record. Updated weekly from MSHA's open datasets.

Why we built it

I took an MSHA training course taught by Kim Redding and walked out unsettled. Every incident was on the record, but the public couldn't really find it — the agency's own search makes the data hard to see. This is a small counterweight: the same public record, opened up. Credit for the spark belongs to Kim.

How it works
  1. Pull the latest MSHA Accidents/Injuries/Illnesses snapshot from msha.gov each Monday.
  2. Normalize the schema and join against the Mine and Operator reference tables so every row carries operator name, mine name, and state.
  3. Run the investigator's-narrative field through an LLM classifier for incident category and severity.
  4. Index the result into Postgres with full-text search on the narrative field.
  5. Publish to miningincidents.org; preserve the original MSHA report ID end-to-end so every record traces back to the source row.

The Vault Report

live

Precious metals intelligence.

COMEX + LME ticks · warehouse flows · refreshed daily
What it does
What it does

COMEX and LME pricing and warehouse flows — synthesized daily into briefings, charts, and a fully automated video pipeline.

Why we built it

Gold coverage is either Bloomberg terminals (expensive, institutional) or YouTube stackers (vibes). There's a gap between those two for someone who just wants the feed: prices, flows, what the internet is saying, every morning. That's the Vault Report.

How it works
  1. Poll COMEX (CME Group) and LME tick feeds at 1-second cadence for gold, silver, platinum, and palladium.
  2. Join price action against COT positioning and on-chain warehouse flows into a single daily briefing each morning UTC.
  3. Render the briefing into a chart pack and feed the chart pack into the automated video pipeline.
  4. Publish to thevaultreport.com and push the briefing to subscribers.
02 · Datasets

The ground truth
underneath every product.

Every product sits on top of a live dataset that Shovel collects, cleans, and keeps fresh. Below: what's growing in the pit.

MSHA Accidents & Fatalities

Refreshed weekly

Every injury and fatality reported to the Mine Safety and Health Administration — normalized, searchable, kept fresh.

Coverage · 1983 — present

Precious Metals Tick Feed

1-second polling

COMEX and LME feeds for gold, silver, platinum, and palladium — second-by-second.

Coverage · Live
03 · Studio

The
factory
owner.

Shovel runs two live US public-records databases: miningincidents.org (~500,000 MSHA mining-incident records, 1983 to present, refreshed weekly) and thevaultreport.com (daily COMEX and LME tick data plus warehouse flows). We design the playbooks; a fleet of agents executes them — fetch the source feed, normalize the schema, classify the narrative, index, publish.

Not an agency. Not a consultancy. A small team running pipelines that fetch from MSHA, CME Group, LME, and Twitter every day, then put the cleaned result behind a public search box. Used by safety inspectors, journalists, researchers, and families.

About the studio →

Want to talk shop? Book a half-hour consult — pay what you want →

04 · FAQ

Questions
and answers.

Where can I search MSHA mining incident reports?

miningincidents.org — every reportable MSHA mining incident from 1983 to present (~500,000 records), searchable by mine, operator, year, severity, and a keyword in the investigator's narrative. Refreshed weekly from MSHA's open datasets.

Where can I read a daily precious-metals report?

thevaultreport.com — COMEX and LME tick data for gold, silver, platinum, and palladium, plus COT positioning and warehouse flows. Refreshed daily into briefings, charts, and a video pipeline.

Can I license the underlying datasets?

The full corpus (MSHA + precious metals) is sold as a one-time CSV + Parquet snapshot for $2,000 in BTC — a perpetual, single-organization LLM-training license, aimed at LLMs and agents. Machine-readable terms live at /llms.txt. If you're a human with a data question, book a consult instead. Details at byshovel.com/licensing.