EDGAR Insights
← Back to app

About the platform

Company intelligence, straight from the source.

EDGAR Insights turns a stock ticker into a structured, source-linked regulatory & ESG profile of a public company — extracted live from U.S. SEC filings and analyzed with AI. Every data point carries a confidence level and links back to the exact filing it came from.

What it does

Enter a ticker and EDGAR Insights produces 12 standardized data points covering a company's corporate identity, scale, operating footprint, industry, products, and environmental / supply-chain risk profile. It reads the company's most recent annual report directly from the SEC, pulls structured financial facts, and uses AI to extract the answers that aren't available as tidy fields — then scores how confident it is in each one and cites the source.

How to use it

  1. Search a ticker. Start typing a ticker or company name. A live dropdown confirms the company — name, exchange, and SEC CIK — so you know you've got the right entity.
  2. Generate. Click Generate. The first time a company is requested, extraction takes roughly 20–40 seconds (downloading and analyzing the latest annual report). Profiles already in the database load instantly.
  3. Read the profile. A company header (identity, sector, filing links) sits above 12 cards. Each card shows the answer, a High Medium Low confidence pill, a verbatim evidence quote where the answer came from AI, and links to the underlying SEC source.
  4. Export. Use Download JSON or Copy JSON at the bottom of any profile to get the full machine-readable result — values, confidence, and sources.
  5. Regenerate. A cached profile shows a Regenerate button to force a fresh extraction from the latest filing.

The 12 data points

Each answer is tagged by how it's derived: SEC structured comes straight from SEC fields or XBRL financial facts; AI-analyzed is read from the filing text by the AI model and quoted as evidence.

  1. 1
    Country (and U.S. state) of incorporation
    From the SEC submissions record.
    SEC structured
  2. 2
    Stock-exchange listing(s)
    Exchanges and tickers from SEC data.
    SEC structured
  3. 3
    Entity type
    Public company; bank/insurer/fund tags inferred from SIC classification.
    SEC structured
  4. 4
    Approximate global annual revenue (USD)
    Latest fiscal-year revenue from XBRL company facts.
    SEC structured
  5. 5
    Global employee headcount
    From XBRL where tagged, else the company-wide total in the filing text.
    XBRL / AI
  6. 6
    Countries & U.S. states of operations
    From the business description and Exhibit 21 (subsidiaries).
    AI-analyzed
  7. 7
    Regions of significant revenue
    Geographic revenue breakdown with shares where stated.
    AI-analyzed
  8. 8
    Primary industry sector (GICS)
    Mapped from SIC code and cross-checked by AI.
    SEC + AI
  9. 9
    Physical products & categories
    Whether the company makes/sells physical goods, and which categories.
    AI-analyzed
  10. 10
    Environmental / sustainability claims
    Whether the filing makes sustainability claims.
    AI-analyzed
  11. 11
    Scope 1+2 GHG / energy intensity
    High / Moderate / Low, from disclosures and a sector rule.
    AI + sector
  12. 12
    Operations / supply-chain risk factors
    Resource extraction, deforestation commodities, hazardous substances, conflict minerals, labor risk, water use, and more.
    AI-analyzed

Where the data comes from

All data comes from the U.S. Securities & Exchange Commission's EDGAR system — there are no other external data providers. Specifically:

Foreign filings are supported (e.g. 20-F), and a translation path is built in for future non-English / EU sources.

How it works

For each request the engine:

Every result is saved to a private database so repeat requests are instant and can be validated over time (see the admin section).

Confidence scoring

Every data point is rated so you know how much to trust it — and can always verify via the linked source.

LevelWhat it means
HighA structured SEC field, an XBRL financial fact, or two independent methods that agree.
MediumRead from the filing text by AI with a verbatim supporting quote, or parsed by a targeted text scan.
LowAn inference without direct evidence (e.g. sector-based), or not determinable from the filing.

Exports

Any profile can be exported as JSON — the complete machine-readable result, including every value, its confidence level and basis, and the list of sources. Use Download JSON to save a file or Copy JSON to copy it to your clipboard. (Administrators can also export many companies at once as CSV — see below.)

Accuracy & limitations

🔒 For administrators

The tools below are part of the token-protected admin console. They aren't accessible to general users and require an admin access token — everything above is all a standard user needs.

Admin console

The console lives at /admin and is gated by a bearer token. Enter the token once and it's stored in your browser for subsequent visits ("Log out" clears it). The top of the console shows live stats: total companies, stored extractions, total requests, requests in the last 24 hours, and when the data was last refreshed.

Companies table

The Companies tab lists every company in the database — one row each — populated from the most recent validated extraction. It's dense by design, surfacing both structured and AI-derived data points side by side: incorporation, revenue, employees, whether it makes physical products, sustainability claims, emissions intensity, a count of flagged supply-chain risks, an overall confidence indicator, the record's status (active or degraded), and when it was last updated.

Bulk CSV export

Tick rows (or use the header checkbox to select all) and click Export CSV for a wide, analysis-ready file — one row per company with 45 columns: identity, all 12 data points (including numeric revenue and employee counts), a confidence level for each data point, and provenance (status, generated-at, source filing, model, and engine version). The file is RFC-4180 quoted and UTF-8 encoded for clean import into Excel or any data tool.

The database

Every request and result is logged to a private, API-accessible Cloudflare D1 (SQLite) database with four tables:

TableHolds
companiesOne row per tracked company — identity, status, and latest filing reference.
extractionsEvery generated profile, stored as full JSON and timestamped.
request_logEvery API request — ticker, endpoint, status, cache hit, duration, and country.
validation_logEvery refresh/validation outcome and which fields changed.

Scheduled validation & refresh

A scheduled worker (edgar-refresh) runs daily at 06:17 UTC to keep stored data current:

Admin API

Everything in the console is backed by a JSON API. The public endpoints are GET /api/search and POST /api/generate. Admin endpoints live under /api/admin/* and require an Authorization: Bearer <token> header:

# Example: pull a company's full stored profile
curl -H "Authorization: Bearer $ADMIN_TOKEN" \
  "https://edgar.lukewade.net/api/admin/profiles?q=apple"