← Fellow Signal dashboardData, Methods & Sources

Data, methods & sources

Everything here is built from public data with an open, reproducible pipeline. This page documents every source, how each figure is computed, what has been verified exactly versus what is directional, and the honest limitations. The whole pipeline is on GitHub and refreshes monthly.

Sources

Source
What it provides
Access & caveats
Activate companies directory
224 ventures: name, cohort year, hub, verticals, fellows, websites, and the human-written Critical Need / Technology Vision / Potential Impact.
Softr app over an Airtable base; harvested via headless browser (JS-rendered). The complete public directory (re-harvested, unchanged at 224). Activate's marketing pages cite ~235 total, a broader count that includes ventures not in the public directory.
Activate fellows directory
292 fellows: biography, cohort, hub, company, LinkedIn. The bios state degree + university for ~92%.
Separate Softr/Airtable table, same harvest. The complete public directory; Activate cites ~294 total.
USAspending.gov
Federal awards, per-company non-dilutive outcomes, per-space ecosystem funding, and agency breakdowns.
Public API. Recipient exact-match + $25M/award cap to exclude institutional name collisions.
OpenAlex
Founder research footprints (works, citations, h-index, pre-founding topics, affiliations) and field publication velocity.
Public API (large open scholarly index). Name disambiguation is field/domain-aware; 100 of 224 companies resolved, 112 founders.
SEC EDGAR
Form D filings as a private-capital-raised signal.
Full-text search API. Presence/absence only.
IRS Form 990 (ProPublica)
Activate's own revenue, expenses, and net assets by year.
Public nonprofit filings. EIN 47-5502184.
The Engine (engine.xyz)
MIT's 'Tough Tech' venture firm (founded 2016, invests for equity). Its 57 public portfolio companies, mapped onto Activate's verticals for a model-vs-model comparison.
Harvested via headless browser; their taxonomy is coarser, so the comparison is directional positioning.

How each figure is computed

Verified exactly vs. directional

Verified to the number (recomputed from the source records): company and fellow counts, 88% PhD, the 7.2× revenue growth from the 990s, the cohort-shift percentages, the convergence pair counts, and the hub over-index figures.

Entity-matched public-record estimate: the $227.2M federal total is summed from USAspending assistance awards matched to each company by exact recipient name, with a $25M/award cap to drop institutional collisions. The top recipients are confirmed real Activate companies with real DOE grants, but exact-name matching is conservative, it can miss awards (11 companies show NSF-API awards not captured in the all-agency total, and DOD SBIR contracts are excluded). Treat it as a careful lower-bound estimate, not an exact figure.

Directional(trust the ranking, not the magnitude): all keyword-based momentum, research velocity, federal funding by space, and the peer-funder comparison, where the other funder's taxonomy differs from Activate's. These are labeled as such in the interface.

Reported as null: the selection loop closure. The hypothesis that founder research depth predicts funding outcomes does not hold at the current sample, and the dashboard says so rather than dressing up an outlier-driven average.

Limitations

Read the synthesis in the Point of View, or explore the live data in the dashboard.