Overview
OpenAVM Kit can automatically enrich your property data with geographic features from OpenStreetMap (OSM). This adds distance-based features that measure proximity to important amenities like:- Water bodies (lakes, rivers, reservoirs)
- Parks and green spaces
- Educational institutions (universities, colleges)
- Transportation infrastructure (major roads, railways)
- Golf courses
How It Works
OpenStreetMap enrichment:- Downloads OSM data for your locality
- Filters features by type and size (e.g., parks larger than 2,000 m²)
- Identifies top features (e.g., the 5 largest parks)
- Calculates distances from each parcel to nearby features
- Adds distance fields to your dataset
No API key required! OpenStreetMap data is freely available under the Open Database License.
Configuration
Add OpenStreetMap enrichment to yoursettings.json:
Feature Types
Water Bodies
Rivers, lakes, reservoirs, and other water features.Enable water body enrichment
Minimum area in square meters. Filters out small ponds and streams.Recommended values:
10000(1 hectare) - Significant water bodies only5000(0.5 hectare) - Include medium-sized features1000(0.1 hectare) - Include smaller features
Number of largest water bodies to track individually. The top N features will have individual distance fields created.
Property to sort by when selecting top N features. Use
area for water bodies.Parks and Green Spaces
Public parks, gardens, playgrounds, and recreational areas.Enable parks enrichment
Minimum park area in square metersRecommended values:
2000(0.2 hectare) - Neighborhood parks and larger5000(0.5 hectare) - Community parks and larger10000(1 hectare) - Regional parks only
Number of largest parks to track individually
Educational Institutions
Universities, colleges, and other educational facilities.Enable educational institution enrichment
Minimum campus area in square metersRecommended values:
1000(0.1 hectare) - Small colleges and up5000(0.5 hectare) - Medium campuses and up10000(1 hectare) - Large universities only
Number of largest educational institutions to track individually
Transportation
Major roads, highways, railways, and transit infrastructure.Enable transportation enrichment
Minimum feature length in meters. Filters out small road segments.Recommended values:
1000(1 km) - Major routes only500(0.5 km) - Include medium routes100(100 m) - Include most routes
Number of longest transportation routes to track individually
Property to sort by. Use
length for linear features like roads and railways.Golf Courses
Golf courses and related facilities.Enable golf course enrichment
Minimum golf course area in square metersRecommended values:
10000(1 hectare) - Small courses and up50000(5 hectares) - Standard courses only100000(10 hectares) - Large courses only
Number of largest golf courses to track individually
Distance Calculations
Thedistances array defines how to calculate distance features:
Identifier for the feature typeAggregate distances:
water_bodies- Distance to any water bodyparks- Distance to any parkeducational- Distance to any educational institutiontransportation- Distance to any transportation routegolf_courses- Distance to any golf course
water_bodies_top- Distance to each named water bodyparks_top- Distance to each named parkeducational_top- Distance to each named institutiontransportation_top- Distance to each named routegolf_courses_top- Distance to each named course
Field to use for naming individual features. Typically
"name" for top N features. Omit this for aggregate distances.Maximum distance to calculate in the specified unit. Features beyond this distance will be marked as null or max distance.Recommended values:
800m - Walking distance (parks, schools)1500m - Short drive (water, golf courses)3000m - Medium drive (universities, major amenities)
Unit of measurement. Currently
"m" (meters) is standard.Output Fields
OpenStreetMap enrichment creates distance fields in your dataset:Aggregate Distance Fields
Distance to the nearest feature of each type:dist_to_water_bodies_any- Distance to nearest water body (meters)dist_to_parks_any- Distance to nearest park (meters)dist_to_educational_any- Distance to nearest educational institution (meters)dist_to_transportation_any- Distance to nearest transportation route (meters)dist_to_golf_courses_any- Distance to nearest golf course (meters)
Individual Distance Fields
Distance to each of the top N named features:dist_to_water_bodies_lake_travis- Distance to Lake Travis (meters)dist_to_parks_zilker_park- Distance to Zilker Park (meters)dist_to_educational_university_of_texas- Distance to University of Texas (meters)dist_to_transportation_interstate_35- Distance to Interstate 35 (meters)dist_to_golf_courses_barton_creek- Distance to Barton Creek Golf Course (meters)
Field names are automatically sanitized (lowercased, spaces replaced with underscores) to ensure compatibility with most data formats.
Example Configurations
Using in Models
The distance fields are automatically added to your dataset and can be used in models:- Fields starting with
dist_to_are treated as numeric land features - They’re included in feature selection and modeling
- No additional configuration needed
Performance Considerations
Best Practices
Choose Relevant Features
Select feature types relevant to your market:- Residential: Parks, schools, water bodies
- Luxury residential: Golf courses, water bodies, large parks
- Urban: Transportation, parks, educational institutions
- Suburban: Parks, schools, golf courses
Set Appropriate Distances
Consider typical travel modes in your area:- Walking distance: 400-800 meters (parks, schools)
- Short drive: 1,000-2,000 meters (amenities)
- Longer drive: 2,000-5,000 meters (special features)
Balance Granularity
Use both aggregate and individual distances:- Aggregate (
parks) - Captures general proximity to amenity type - Individual (
parks_top) - Captures proximity to specific high-value features
Troubleshooting
No Features Found
Issue: OpenStreetMap enrichment runs but no features are found. Solutions:- Lower the
min_areaormin_lengththresholds - Verify OSM has data for your locality
- Check that feature types exist in your area (e.g., golf courses may be rare in urban areas)
Too Many Features
Issue: Hundreds of distance fields are created. Solutions:- Reduce
top_nto track fewer individual features - Increase
min_areaormin_lengthto filter smaller features - Focus on aggregate distances instead of individual features
Slow Processing
Issue: OpenStreetMap enrichment takes too long. Solutions:- Reduce the number of enabled feature types
- Increase minimum size thresholds
- Reduce
top_nvalues - Process a smaller geographic area
Data Quality Notes
Feature Names May Be MissingNot all OSM features have names. Unnamed features will be skipped when creating individual distance fields. This is normal and expected.
Next Steps
Census API
Add demographic data enrichment
Settings Configuration
Learn more about settings.json structure
Data Processing
Understand the data processing pipeline
Feature Engineering
Learn about feature engineering techniques