Export Code

Initial Benchmark Results

Curran Kelleher

Last edited Mar 29, 2025
Created on Mar 29, 2025

Model Challenge Performance Visualization

This visualization displays the performance of different language models on various challenges.

  • X-axis: Challenges
  • Y-axis: Models
  • Color: Indicates pass (green), fail (red), or error (orange) status
MIT Licensed