Regenerate TANF data for 2026 (policyengine-us 1.715.3 + Indiana fix) — complete 56/56#10
Merged
Conversation
…rtial 23/56 - 9 changed state files, incl. Indiana fix: size-2 $255->$409, size-3 $320->$513 (amount of assistance per IC 12-14-2-5, replacing the standard of need) - Also updated: HI (763->611), KY (524->341, Nov-2025 cut), CT (833->897), KS (403->429), IL (759->782), AK (924->925), CA_1/CA_2 (region cells) - precompute.py: YEAR=2026, OUTPUT_DIR fix (frontend/->public/), FPG 2026, stdout line-buffering, --workers flag - metadata.json: year 2026, FPG 2026, version 1.715.3 - README: data version note (partial regeneration) Remaining 33 files still on 1.598.0 — pending follow-up (draft). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Regenerate the remaining 33 state files at tax year 2026 on policyengine-us 1.715.3 + Indiana TANF fix (#8543), completing the prior partial 23/56 update. All 56 files now reflect the same model + year. Add precompute_vec.py: a PolicyEngine-`axes` vectorized generator that computes a full state in ~2-5s (~600x faster than the cell-by-cell path) by building 16 Simulations per state (one per adults x children structure) instead of 15,376. Validated bit-for-bit identical to precompute.py on 10 states covering every code path (plain, person-monthly special vars, county selection, SPM-unit annual vars) — 0 mismatches across 138,384+ cells. Includes a per-structure fallback to the cell-by-cell path. Add scripts/README.md documenting both generators, the vectorization method, and the validation. Update README data-version line to 56/56. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Regenerates all 56 precomputed TANF data files from policyengine-us
1.715.3(+ the Indiana TANF fix, PR #8543) at tax year 2026, replacing the stale1.598.0/ 2025 data. Also adds a vectorized data generator that is ~600× faster than the existing one and validated bit-for-bit identical to it.Of the 51 jurisdictions, 23 produced changed values; the other 28 recomputed to byte-identical data (no diff from the prior committed values).
New: vectorized generator (
scripts/precompute_vec.py)The old generator built 15,376
Simulationobjects per state (one per grid cell) — that construction, not the math, was the bottleneck (~25–45 min/state). The new generator builds 16 per state:axesgroups → oneSimulationcomputes all 961 cells in a vectorized pass.Measured: 4.3s vs ~47.6 min on Illinois (~664×). Full 33-state completion run took 74s.
Validated bit-for-bit identical to the cell-by-cell generator on 10 states covering every code path — plain states, person-level monthly special vars (IL), county selection (CA), and SPM-unit annual vars (CA/CO): 0 mismatches across 138,384+ cells. A per-structure fallback to the cell-by-cell path guarantees the run can never emit wrong/missing data. Documented in
scripts/README.md.Notable changed values (vs
1.598.0, $0-income household)Benchmark verification (NC / CBPP "maximum benefit")
Compared all 51 jurisdictions, both household sizes (1 adult + 1 child, 1 adult + 2 children), against the external benchmark:
No genuine model concerns remain (the previously-flagged Indiana gap is fixed by #8543, which this data already carries).
Generator / metadata changes (
scripts/precompute.py,metadata.json)YEAR2025 → 2026OUTPUT_DIRfixed:../frontend/public/data→../public/data(repo moved Vite → Next; old path no longer exists)--workersflagmetadata.json: year 2026, FPG 2026, version 1.715.3, CA/PA/VA county-gatedProvenance
Generated locally from
policyengine-us1.715.3+ the Indiana fix (PR #8543). Once #8543 is released, a clean re-run on the released version reproduces this output.🤖 Generated with Claude Code