Census API Configuration

Overview

OpenAVM Kit can enrich your property data with demographic information from the U.S. Census Bureau using the Census API. This adds valuable context like median income, population density, and other block group-level statistics to your dataset.

Why Use Census Enrichment?

Census data provides important neighborhood context for property valuation:

Demographic characteristics - Population, age distribution, household composition
Economic indicators - Median income, employment rates, poverty levels
Housing statistics - Occupancy rates, housing values, rent levels
Geographic context - Block group and tract-level aggregations

This data is automatically joined to your parcels using spatial relationships, adding valuable features for your models.

Getting a Census API Key

The Census API requires a free API key.

Visit the Census API Key Signup page: api.census.gov/data/key_signup.html
Fill out the registration form with your information
Agree to the Census Bureau’s Terms of Service
Submit the form
Check your email for the API key (usually arrives within minutes)

The Census API key is free and has generous rate limits for most use cases. Keep your key secure but note that it’s less sensitive than authentication tokens for other services.

Configuration

Add API Key to .env File

Store your Census API key in the .env file located in the notebooks/ directory:

# Census API Configuration
CENSUS_API_KEY=your_api_key_here

Keep Credentials SecureNever commit your .env file to version control. The file is already in .gitignore, but always verify it’s not being tracked.

Enable Census Enrichment

Add Census configuration to your locality’s settings.json:

{
  "process": {
    "enrich": {
      "census": {
        "enabled": true,
        "year": 2022,
        "fips": "24510",
        "fields": [
          "median_income",
          "total_pop"
        ]
      }
    }
  }
}

Configuration Options

process.enrich.census.enabled

boolean

default:"false"

required

Set to true to enable Census data enrichment

process.enrich.census.year

number

default:"2022"

The Census year to query. Use the most recent available year for current data.Common years:

2020 - Decennial Census
2022 - American Community Survey (ACS) 5-year estimates
2021 - American Community Survey (ACS) 5-year estimates

process.enrich.census.fips

string

required

The 5-digit FIPS code identifying your locality (state + county code)Format: SSCCC where:

SS = 2-digit state code
CCC = 3-digit county code

Examples:

48453 - Travis County, Texas (state 48, county 453)
24510 - Baltimore City, Maryland (state 24, county 510)
06037 - Los Angeles County, California (state 06, county 037)

Find your FIPS code at census.gov/library/reference/code-lists/fips-county-codes.html

process.enrich.census.fields

array

required

List of Census variables to include in your datasetCommon fields:

median_income - Median household income
total_pop - Total population
median_age - Median age
median_home_value - Median home value
pct_owner_occupied - Percentage of owner-occupied housing units
pct_renter_occupied - Percentage of renter-occupied housing units
unemployment_rate - Unemployment rate

Example Configurations

{
  "process": {
    "enrich": {
      "census": {
        "enabled": true,
        "year": 2022,
        "fips": "48453",
        "fields": [
          "median_income",
          "total_pop"
        ]
      }
    }
  }
}

How It Works

Spatial Join Process

Download Census Data - The library queries the Census API for your specified fields at the block group level
Match Geography - Census block groups are spatially joined to your parcels based on location
Add Fields - Census variables are added as new columns to your parcel dataset
Use in Models - The enriched fields are available for feature engineering and modeling

Data Granularity

Census enrichment uses block group level data, which provides a good balance between:

Granularity - Small enough to capture neighborhood variation
Privacy - Large enough to protect individual privacy
Stability - Sufficient sample sizes for reliable estimates

Block groups typically contain 600-3,000 people.

Using Census Data in Code

The Census enrichment happens automatically during the data processing pipeline:

from openavmkit.pipeline import run_pipeline

# Run the full pipeline with Census enrichment enabled
run_pipeline(
    locality="us-tx-travis",
    settings_file="in/settings.json"
)

# The output dataset will include Census fields

You can also use the Census utilities directly:

from openavmkit.utilities.census import enrich_with_census
import geopandas as gpd

# Load your parcel data
parcels = gpd.read_file("parcels.geojson")

# Enrich with Census data
enriched = enrich_with_census(
    gdf=parcels,
    year=2022,
    fips="48453",
    fields=["median_income", "total_pop"]
)

Available Census Variables

The Census API provides hundreds of variables. Here are some commonly used ones:

Economic Variables

median_income - Median household income
per_capita_income - Per capita income
poverty_rate - Percentage below poverty line
unemployment_rate - Unemployment rate
median_earnings - Median earnings for workers

Demographic Variables

total_pop - Total population
median_age - Median age
avg_household_size - Average household size
pct_male - Percentage male
pct_female - Percentage female

Housing Variables

median_home_value - Median home value
median_rent - Median gross rent
pct_owner_occupied - Percentage owner-occupied units
pct_renter_occupied - Percentage renter-occupied units
vacancy_rate - Housing vacancy rate

Education Variables

pct_high_school - Percentage with high school diploma
pct_bachelor_degree - Percentage with bachelor’s degree
pct_graduate_degree - Percentage with graduate degree

Finding More VariablesExplore the full Census API variable list at api.census.gov/data.html. Look for the American Community Survey (ACS) datasets for the most comprehensive economic and demographic data.

Troubleshooting

API Key Not Found

Error: Missing 'CENSUS_API_KEY' in environment Solution: Verify your .env file contains the CENSUS_API_KEY variable and is located in the notebooks/ directory.

Invalid FIPS Code

Error: Invalid FIPS code or no data returned Solution: Double-check your 5-digit FIPS code. Ensure it matches your locality exactly. Use the Census Bureau’s FIPS code reference.

Variable Not Found

Error: Census variable not found Solution: The variable name may have changed between Census years. Check the Census API documentation for your selected year to verify variable names.

Rate Limiting

Error: API rate limit exceeded Solution: The Census API has rate limits. If you’re making many requests:

Add delays between requests
Cache results locally
Contact the Census Bureau for higher limits if needed

Best Practices

Use Recent DataUse the most recent Census year available (typically 2022 for ACS 5-year estimates) to ensure your demographic data is current.

Select Relevant FieldsOnly include Census fields that are relevant to your analysis. This keeps your dataset manageable and reduces API calls.

Understand Block GroupsRemember that Census data is aggregated at the block group level. Multiple parcels in the same block group will have identical Census values.

Don’t Over-Rely on DemographicsWhile Census data is valuable, property characteristics and location typically have stronger predictive power for valuation. Use Census data as supplementary context.

Next Steps

OpenStreetMap

Add geographic features from OpenStreetMap

Settings Configuration

Learn more about settings.json structure

Cloud Storage

Configure cloud storage for your data

Data Processing

Learn about the data processing pipeline

Get Started

Core Concepts

Guides

Configuration

Advanced Topics

Census API Configuration

Overview

Why Use Census Enrichment?

Getting a Census API Key

Configuration

Add API Key to .env File

Enable Census Enrichment

Configuration Options

Example Configurations

How It Works

Spatial Join Process

Data Granularity

Using Census Data in Code

Available Census Variables

Economic Variables

Demographic Variables

Housing Variables

Education Variables

Troubleshooting

API Key Not Found

Invalid FIPS Code

Variable Not Found

Rate Limiting

Best Practices

Next Steps

OpenStreetMap

Settings Configuration

Cloud Storage

Data Processing

Build docs developers (and LLMs) love

Get Started

Core Concepts

Guides

Configuration

Advanced Topics

​Overview

​Why Use Census Enrichment?

​Getting a Census API Key

​Sign Up for an API Key

​Configuration

​Add API Key to .env File

​Enable Census Enrichment

​Configuration Options

​Example Configurations

​How It Works

​Spatial Join Process

​Data Granularity

​Using Census Data in Code

​Available Census Variables

​Economic Variables

​Demographic Variables

​Housing Variables

​Education Variables

​Troubleshooting

​API Key Not Found

​Invalid FIPS Code

​Variable Not Found

​Rate Limiting

​Best Practices

​Next Steps

OpenStreetMap

Settings Configuration

Cloud Storage

Data Processing

Build docs developers (and LLMs) love

Overview

Why Use Census Enrichment?

Getting a Census API Key

Sign Up for an API Key

Configuration

Add API Key to .env File

Enable Census Enrichment

Configuration Options

Example Configurations

How It Works

Spatial Join Process

Data Granularity

Using Census Data in Code

Available Census Variables

Economic Variables

Demographic Variables

Housing Variables

Education Variables

Troubleshooting

API Key Not Found

Invalid FIPS Code

Variable Not Found

Rate Limiting

Best Practices

Next Steps