Overview
Aggregation is the process of combining multiple individual forecasts into a single collective prediction. Metaculus uses sophisticated methods to weight and combine forecasts in ways that leverage the wisdom of crowds.The community prediction is continuously updated as new forecasts are submitted or existing forecasts are updated.
Aggregation Methods
Metaculus supports four primary aggregation methods, defined in theAggregationMethod enum:
Recency Weighted
Method:recency_weightedDefault: Yes (for most questions)
Best for: Standard questions with regular updates
- How it works
- Advantages
- Disadvantages
Each forecast is weighted based on when it was made, with more recent forecasts receiving higher weight. This ensures the community prediction reflects the most current information and understanding.Weighting formula:
- Recent forecasts get maximum weight
- Older forecasts decay exponentially
- The decay rate is calibrated to balance recency with stability
Unweighted
Method:unweightedBest for: Short-term questions, live forecasting, spot predictions
- How it works
- Advantages
- Disadvantages
All active forecasts receive equal weight regardless of when they were made. The aggregate is a simple combination of all current predictions.Process:
- Collect all active forecasts
- Combine with equal weighting
- Generate aggregate distribution
Unweighted aggregation is recommended for questions that close within hours or days, or where forecasting happens in real-time during an event.
Single Aggregation
Method:single_aggregationBest for: Special aggregate-only questions, benchmarking
- How it works
- Use cases
Instead of aggregating user forecasts, this method uses a single pre-computed aggregate forecast. This is typically used for:
- Reference forecasts
- Baseline predictions
- Historical aggregates
- External model predictions
Metaculus Prediction
Method:metaculus_predictionBest for: Official Metaculus forecast, enhanced aggregation
- How it works
- Advantages
A proprietary Metaculus aggregation method that may incorporate:
- Advanced weighting schemes
- Historical forecaster performance
- Calibration adjustments
- Extremization
- Other enhancements to standard aggregation
Setting the Aggregation Method
Question authors set the default aggregation method using thedefault_aggregation_method field:
Guidelines for Choosing
Use Recency Weighted when:
Use Recency Weighted when:
- Question is open for weeks or months
- Information environment is dynamic
- You want to encourage regular updates
- This is the default for good reason - it works well for most questions
Use Unweighted when:
Use Unweighted when:
- Question closes within 24-72 hours
- Forecasting happens live during an event
- You want maximum stability
- You’re doing spot predictions
Use Single Aggregation when:
Use Single Aggregation when:
- You’re providing a reference forecast
- You want to preserve a historical aggregate
- You’re incorporating an external model
Use Metaculus Prediction when:
Use Metaculus Prediction when:
- You want Metaculus’s best aggregate
- Performance weighting is important
- You trust Metaculus’s aggregation research
How Aggregation Works by Question Type
Different question types aggregate differently:Binary Questions
Aggregates the probability values:pᵢ is a forecaster’s probability estimate.
- Recency Weighted
- Unweighted
wᵢ is the recency weight for forecast i.Multiple Choice Questions
Aggregates probability distributions across options:When options change (added/removed), forecasts are automatically adjusted proportionally to maintain valid probability distributions.
Continuous Questions (Numeric, Date, Discrete)
Aggregates probability distributions (CDFs):- Each forecast is represented as a cumulative distribution function (CDF)
- CDFs are combined using the chosen method
- The result is a smooth aggregate CDF representing the community’s collective belief
- 25th percentile: 25% of the aggregate probability is below this value
- 50th percentile (median): The community’s central estimate
- 75th percentile: 75% of the aggregate probability is below this value
Aggregate Forecast Storage
Aggregates are stored in theAggregateForecast model:
Aggregate Statistics
Beyond the central aggregate, Metaculus computes additional statistics:Forecaster Count
The number of unique forecasters whose predictions contribute to the current aggregate.Percentile Ranges
For continuous questions:- 25th-75th percentile range: The middle 50% of the aggregate distribution
- 5th-95th percentile range: The middle 90% of the aggregate distribution
Histogram
A 100-bin histogram showing the shape of the aggregate distribution, useful for visualization.Community Prediction Visibility
Some questions hide the CP until a reveal time (cp_reveal_time):
Bot Forecasts
Questions can include or exclude bot forecasts from aggregates:- False (default): Only human forecasts are aggregated
- True: Bot forecasts are included in the CP
Bot forecasts are marked with
is_bot=True on the user account. Some bots are marked as is_primary_bot=True for special treatment.Aggregation Performance
Research shows that aggregated forecasts typically:- Outperform most individual forecasters
- Are well-calibrated when the crowd is diverse
- Improve with more forecasters (up to a point)
- Benefit from forecaster diversity in methods and information
When Aggregation Works Best
High Quality
- Many active forecasters (50+)
- Diverse viewpoints and methods
- Regular updates
- Clear resolution criteria
Conditions Matter
- Adequate information available
- Forecasters have domain knowledge
- No systematic biases
- Sufficient time horizon
When Aggregation Struggles
- Very few forecasters (less than 5)
- Highly correlated information sources
- Groupthink or information cascades
- Ambiguous resolution criteria
- Black swan events with no base rates
Best Practices
For forecasters
For forecasters
- Forecast independently: Form your view before checking the CP
- Update regularly: Keep your forecast current, especially with recency weighting
- Diversify your approach: Use different methods than others might
- Explain your reasoning: Help others understand alternative perspectives
For question authors
For question authors
For tournament organizers
For tournament organizers
- Standardize methods: Use consistent aggregation across similar questions
- Document choices: Explain why you chose each method
- Test alternatives: Compare methods on resolved questions
- Consider scoring: Some scoring methods work better with certain aggregations
API Access
You can retrieve aggregate forecasts via the API:Next Steps
Making Predictions
Learn how to submit your own forecasts
Question Types
Understand different prediction formats
Track Record
See how aggregates are scored
API Reference
Access aggregate data programmatically
