How CivicCrawl Works

Everything you need to understand the platform β€” from IRS raw data to the score on every org's profile.

What is CivicCrawl?

CivicCrawl is a data intelligence platform built on the IRS 501(c)(3) Exempt Organizations Master File β€” the official public record of every tax-exempt nonprofit in the United States. We ingest, geocode, score, and visualize over 1.6 million organizations so that nonprofits, funders, and researchers can understand the landscape of civil society at any scale.

How the data flows

1

IRS publishes

eo1–eo4 CSV files + 990 XML extracts

β†’
2

We ingest

Python pipeline normalizes, geocodes, and scores 1.6M records

β†’
3

SQLite DB

impact_mapper.db β€” nonprofits, zip_stats, ntee_weights tables

β†’
4

API routes

/api/nonprofits, /api/map/orgs, /api/map/heatmap, /api/deserts

β†’
5

You explore

Map, search, profiles, dashboard, regional insights