Add an agentic value to submission-level model-type metadata so agentic forecasting systems can self-declare and be filtered on the leaderboard. Metadata only. No change to tasks, metrics, EvaluationWindow, or aggregation.
Motivation
Agentic systems (tool-using, multi-step pipelines that produce forecasts) are now a distinct submission class, not plain pretrained or fine-tuned models. GIFT-Eval added an Agentic model type for this reason, and it gets used: the Moirai 2.0 paper filters out agentic solutions to get a clean foundation-model comparison. Without a tag, agentic submissions get excluded informally or silently skew a board readers assume is model v model.
Approach
- Extend the submission-level
model_type field with an agentic value, surfaced as a filterable leaderboard column.
- Submission-level, not task-level, so it describes how the forecast was produced, orthogonal to the task taxonomy.
- Same scoring and statistical treatment as everything else. The tag only enables include or exclude, not a separate ranking.
Happy to send a PR for this.
Add an
agenticvalue to submission-level model-type metadata so agentic forecasting systems can self-declare and be filtered on the leaderboard. Metadata only. No change to tasks, metrics, EvaluationWindow, or aggregation.Motivation
Agentic systems (tool-using, multi-step pipelines that produce forecasts) are now a distinct submission class, not plain pretrained or fine-tuned models. GIFT-Eval added an
Agenticmodel type for this reason, and it gets used: the Moirai 2.0 paper filters out agentic solutions to get a clean foundation-model comparison. Without a tag, agentic submissions get excluded informally or silently skew a board readers assume is model v model.Approach
model_typefield with anagenticvalue, surfaced as a filterable leaderboard column.Happy to send a PR for this.