SIGNAL // THECIRCUIT.FELINEUNION.ORG / EDGAR CORPUS PENDING HITS PENDING FRX // 24.05.26 // 13:09:06
> opening data.sec.gov submissions API .............[OK]
> cohort: MSFT AMZN GOOG ORCL META IBM CRM NVDA
> corpus: 8 cos × 4 filings = 32 SEC 10-K / 10-Q
> patterns (round 2):
compute-credit · equity-investment · unrealized-gain
obligated-to-purchase · strategic-investment · mark-to-market
non-GAAP-exclude · perf-obligation · related-party-revenue
customer-concentration
> running regex pass over stripped HTML ............[DONE]
> verdict: PATTERN CONFIRMED ACROSS COHORT
> filing under: not one company. all of them, at different doses.

THIS ISN'T ONE COMPANY

// 1,280 pattern hits across 32 EDGAR filings. The structure is everywhere — at different doses.
Editions I and II read like indictments of named individuals. They aren't. The arrangement they describe is structural — a tax-and-accounting environment shaped to produce this exact behavior at the top of the distribution. So we pulled the SEC's own corpus and grepped it. Eight hyperscaler-or-adjacent companies. Four recent filings each. Ten regex patterns. The chart below shows hits per 100,000 characters of filing text, so a small company writing densely isn't drowned out by a big one writing sparsely. The structure isn't anecdotal. It scales with how much of the company runs on it.
Filed2026.05.24
Sourcedata.sec.gov submissions API
ClassificationReplicable — scraper in repo
SeriesEDITION III
[ 01 ]

Pattern density per company

Each bar is the total number of times any of our eight patterns appears across that company's four most recent 10-K and 10-Q filings. The patterns are blunt regexes — "cloud credits," "strategic investments," "mark-to-market," "non-GAAP exclude," "remaining performance obligations." They don't catch every variant. They establish a floor, not a ceiling. The bar chart below is live, pulled from /data/edgar-prevalence.json.

SCAN: COHORT: FILINGS/CO:
[ PENDING SCRAPE — run `npm run edgar:scan` ]

The single biggest read in this chart is the spread itself. Some companies are saturated in the language of the structure. Others barely register. That doesn't mean the structure isn't there — it means the regex isn't finding it in the corpus, which can happen for two reasons: the company describes the same arrangement differently, or the company genuinely doesn't run on it. Edition III treats both possibilities as open. The scraper is in the repo. Run it yourself.

[ 02 ]

Top of the leaderboard

A handful of headline numbers, computed live from the same data file. They will shift the next time the scraper runs — these aren't permanent claims, they're a snapshot of what the SEC corpus looked like on the date this page was generated.

PEAK COMPANY
pending
TOTAL HITS
across all filings
DOMINANT PATTERN
most-cited regex
EDGAR FILINGS
scanned
[ 03 ]

Which pattern lights up which company

Below each company is its dominant pattern — the single regex that accounts for most of its hits. This is where the texture lives. Microsoft, Oracle, and Salesforce all lean hard on remaining performance obligations — the accounting line where future cloud backlog is disclosed. Salesforce still tops the cohort on both raw and normalized scoring, driven by an unusually heavy strategic investments count from its venture arm. Amazon, which was nearly invisible in the first scan, now scores in line with the cohort once the regex was broadened to cover its preferred vocabulary — equity investments and unrealized gain rather than the literal "cloud credits" phrase. That's the lesson of round one: low scores in regex work usually mean the regex, not the absence.

[ PENDING SCRAPE ]

"The point isn't that the regex caught everything. The point is that even a regex catches this much."

— THE CIRCUIT // METHODOLOGY NOTE
[ 04 ]

What this is, and what it isn't

This is a floor measurement, not a verdict. A regex hit on "cloud credits" doesn't prove anything by itself. It tells you the company uses the language. The volume tells you how central that language is to how they describe themselves to investors. The interesting comparisons are across the cohort, not within any single filing.

Three honest caveats. First, the patterns are still tuned to a specific structural story; a company describing an identical arrangement in entirely different words may still be invisible to them. Second, the corpus is only the four most recent 10-K / 10-Q filings per company; older filings might tell a different story. Third, even normalized counts conflate "high signal" with "high verbosity about the same disclosed item" — Oracle disclosing one performance obligation eight times in one filing scores the same as eight separate obligations.

Round 2 (2026-05-24). Round 1 used a narrower regex set that missed Amazon's vocabulary almost entirely — they scored 4 hits across 4 filings, an obvious artifact. Round 2 broadened cloud-credit to cover "compute / AWS / Azure / GCP credits" and added four patterns covering equity stakes, unrealized gains, purchase commitments, and equity-method investments. Amazon\'s raw count went from 4 → 100. Salesforce stayed the outlier. The change is captured in scripts/edgar-scan.mjs; replication welcome. The right reading: this is still the floor of the structure's visibility — real prevalence is at least this high.

[ 05 ]

How to run this yourself

The scraper is scripts/edgar-scan.mjs in the repo. It is a zero-dependency Node 20 script. The SEC's submissions API does not require an account or a key — it requires a User-Agent header that identifies you with an email. Set it and run:

$ git clone https://github.com/flipKoin/thecircuit.felineunion.org
$ cd thecircuit.felineunion.org
$ cp .env.example .env.local // then set EDGAR_USER_AGENT="you (your@email)"
$ npm run edgar:scan
writes public/data/edgar-prevalence.json
$ npm run edgar:scan -- --only MSFT,ORCL // subset cohort
$ npm run edgar:scan -- --limit 8 // deeper history

Fork the cohort. Fork the regex list. If your version returns different numbers, the cohort and regex you used are part of the disagreement — that's a feature, not a bug. Open for replication. Open for correction.

// END TRANSMISSION

FILED FROM A LO-TEK BUNKER IN REGINA, SK // NO SPONSORS // NO TRACKERS

Source: data.sec.gov submissions API (free, public, no key). Cohort: MSFT, AMZN, GOOG, ORCL, META, IBM, CRM, NVDA. Patterns: cloud-credit, strategic-investment, related-party-revenue, mark-to-market, non-GAAP-exclude, perf-obligation, customer-concentration, credit-arrangement.

SIBLING PROPERTIES //

theloop.felineunion.org — the conditioning works. WP01.
thelaundering.felineunion.org — institutional reputation laundering.
felineunion.org — fediverse mutual aid + community streaming.

THE CIRCUIT // EDITION III // FILED 2026.05.24
OPEN FOR REPLICATION. CITE FREELY. SHARE WIDELY.