Overview
Evaluate OpenSymbolicAI against LiveCodeBench — a continuously refreshed, contamination-free coding benchmark sourcing problems from LeetCode, AtCoder, and CodeForces.
Why this benchmark
- Contamination-free: regularly refreshed problems eliminate memorization advantage
- Tests code generation, self-repair, code execution, and test output prediction
- Well-known leaderboard tracked by Artificial Analysis
- Demonstrates OpenSymbolicAI works on algorithmic problem-solving, not just tool orchestration
References
Tasks