Skip to main content
Breakout Brands Timeline

TWCPG Archive — the deal-flow corpus

1,913 brands, 2,321 events, Sep 2022 – May 2026. Downloadable.

A second dataset, complementary to the 419 curated breakout-list brands: the full structured deal flow underneath. We extracted every M&A and fundraising event from the This Week in CPG newsletter archive (2022-09-24 → 2026-05-07), then enriched each new brand to the same standard as the original cohort — founded year, ownership, revenue, acquirer, sources cited.

The merged dataset is 2,274 brands, 99.7% web-research-enriched. This page documents what's in the deal-flow corpus and how to use it.

If you want to slice it yourself, head to Explore. If you want every brand, Brands is the canonical merged table.


The corpus, in numbers

Metric Value
Newsletter posts processed 413
Date range 2022-09-24 → 2026-05-07
Total deal events 2,321
M&A events 1,095
Fundraising events 1,226
Unique brands in corpus 1,913
Brands in both corpus and breakout lists 55
Brands in corpus only (not on any list) 1,857

The TWCPG-corpus brands plus the 362 breakout-list-only brands make up the 2,274-brand merged dataset on Brands. All numbers below are raw counts from the corpus — interpret them as you see fit.


Top acquirers

By deal count across the full corpus:

Acquirer Deals Total disclosed
L Catterton 15 $20.0B
L'Oreal 11 $17.4B
Authentic Brands Group 8 $2.9B
LVMH 8 $2.7B
Keurig Dr Pepper 6 $37.2B
Diageo 6 undisclosed throughout
Bansk Group 5 $100M
Lactalis 5 $4.3B

Top lead investors

By rounds led:

Investor Rounds led Distinct brands
Unilever Ventures 15 14
L Catterton 14 14
Iris Ventures 10 10
L'Oreal 9 9
Silas Capital 9 9
CAVU Consumer Partners 7 7
InvestBev 6 6
Sandbridge Capital 6 6

How the dataset was built

Three extraction phases against the newsletter archive, then a merged enrichment pass against the same WebSearch-backed worker that produced the original 419-brand cohort:

  1. Content sample (May 2026). 24 stratified posts → structural taxonomy of how the newsletter formats its bullets.
  2. Full-archive extraction (May 2026). Per-post haiku call with bullet-level validation across 413 posts. Output: structured M&A + fundraising bullets per post.
  3. Bullet enrichment (May 2026). Re-parse every deal bullet to extract founded_year, amount_usd, round_stage, normalized acquirer/investor names. Output: data/twcpg/enriched.merged.cleaned.json.
  4. Merged brand enrichment (May 2026). The 1,857 brands surfaced by TWCPG that weren't in the original cohort were seeded as records in data/brands.json and run through the same enrich_brand worker that produced the original 419-brand enrichment — full WebSearch, full source citation per field. This brings the entire merged dataset to a single enrichment standard.

Specs live under docs/superpowers/ in the project repo. The original 419-brand pipeline is documented on Loop.


Caveats


Downloads

The full per-event corpus and the derived per-brand index are downloadable as JSON. If you want to use this data in your own analysis or build on top of it, these are the canonical files:

For the full merged brand database (TWCPG corpus joined with breakout-list cohort), the canonical view is Brands. For interactive pivoting with saved views, use Explore.


If you find something interesting in the data, ping [email protected].