Skip to content

Add "agentic" model type to submission metadata & leaderboard (similar to GIFT-Eval) #153

Description

@Jameswlepage

Add an agentic value to submission-level model-type metadata so agentic forecasting systems can self-declare and be filtered on the leaderboard. Metadata only. No change to tasks, metrics, EvaluationWindow, or aggregation.

Motivation
Agentic systems (tool-using, multi-step pipelines that produce forecasts) are now a distinct submission class, not plain pretrained or fine-tuned models. GIFT-Eval added an Agentic model type for this reason, and it gets used: the Moirai 2.0 paper filters out agentic solutions to get a clean foundation-model comparison. Without a tag, agentic submissions get excluded informally or silently skew a board readers assume is model v model.

Approach

  • Extend the submission-level model_type field with an agentic value, surfaced as a filterable leaderboard column.
  • Submission-level, not task-level, so it describes how the forecast was produced, orthogonal to the task taxonomy.
  • Same scoring and statistical treatment as everything else. The tag only enables include or exclude, not a separate ranking.

Happy to send a PR for this.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions