Skip to content

Conversation

@jordanhunt22
Copy link
Collaborator

@jordanhunt22 jordanhunt22 commented Feb 26, 2025

Updates guidelines.py and adds a bit of clarity to some TASK.txt files.

These changes decrease the performance on gpt-4o by ~5%, but increase performance on other models:

  • o3: +13%
  • claude 3.5: +2%

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants