PolicyEngine US Development Guidelines

Commands

Build: make build
Install: make install or pip install -e .[dev]
Format: make format (runs black with line length 79)
Test: make test (all tests)
Run single test: pytest path/to/test_file.py::test_function
Run specific YAML tests: policyengine-core test path/to/tests -c policyengine_us [-v]
Run microsimulation test: pytest policyengine_us/tests/microsimulation/test_microsim.py
Checkout a PR: gh pr checkout [PR-NUMBER]

Unit tests with scalar values can pass while vectorized microsimulation fails
When implementing a previously empty variable, be sure to check for dependent formulas
When using defined_for, ensure it's tested in microsimulation context
Be careful with chained comparisons in formulas - they work with scalars but fail with arrays
Prefer explicit vectorized comparison operators joined with & and |
For scale parameters that return integers, avoid using rate_unit: int in metadata (use /1 instead) due to parameter validation issues
Parameter file naming matters: make sure variables reference the exact parameter file name
When refactoring from enum to numeric values, update all downstream dependencies consistently
Prefer parameter-driven calculations over hardcoded logic in formulas when possible
Use bool instead of int or /1 in rate_unit for scale parameters when appropriate to avoid validation issues
For categorization logic, create separate variables for each test (e.g., is_school_age) and use them in downstream variables
When dynamically determining numeric groups/categories, calculate values rather than hardcoding them (e.g., max_value + 1)
Structure parameter files to match program documentation and real-world rules to improve maintainability
Program takeup is assigned during microdata construction, not simulation time
- Changes to takeup parameters (SNAP, EITC, etc.) have no effect in the web app
- These parameters should include economy: false in their metadata

Cross-check parameter values against authoritative external sources (gov websites, calculators)
Document the source, publication date, and effective dates in parameter metadata
Include both title and href for references to maintain traceability
Use multiple sources to validate complex parameters (e.g., tax brackets, benefit amounts)
For annually updated values, document the adjustment methodology and inflation indices
Test parameter values with real-world examples through YAML tests
Be especially careful with multiplicative factors, which can significantly magnify errors
Consider interactions between parameters when updating values
When updating parameters, verify both individual values and downstream impacts
Use descriptive changelog entries that reference authoritative sources for updates

When using breakdown metadata in parameters, avoid using variable references for integer values. Instead use Python expressions like range(1, 5).
The parameter validation system in policyengine-core has issues with certain parameter structures:
- Using boolean keys (True/False) as parameter names can cause validation errors
- Using integer output variables in breakdown metadata can cause parameter validation errors
- If you encounter UnboundLocalError: cannot access local variable 'possible_values', it's likely due to parameter validation issues
To fix parameter validation issues:
- Split complex parameters into separate, simpler parameter files
- Use string names instead of boolean keys (e.g., "eligible" instead of True)
- Use Python expressions like range() instead of variable references for enumerated values
- Use breakdown patterns that match existing working examples in the codebase
- See GitHub issue #346 for more details