Skip to content

Tracks which OpenAI Eval contributions are being accepted

License

Notifications You must be signed in to change notification settings

AI-RandD/openai-eval-tracker

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 

Repository files navigation

Update 5/31/2023: It looks like they've assigned specific engineers to do code reviews on the Evals repo so significantly more Evals are getting accepted. They seem to be responding to most PRs. I'm going to stop maintaining this since OpenAI is being more communicative about and involved in the acceptance process.

OpenAI Eval Tracker

With the significant improvements that come with GPT-4, there is massive demand for access to the API. There is a waitlist here, but there's no guarantee of when access will be granted. OpenAI has provided a way to get earlier access, however, through contributions to its recently open-sourced Evals repo (see this note and this article). The standards are vague, noting that access will be granted to "exceptional model evaluations." A little more information can be found in the PR template. Looking at the active PRs and which ones have been accepted, there seems to be little feedback on the quality of a submission, and the ones that are accepted appear to be the ones that the models perform poorly on and the moderators deem "interesting" or "clever" (see this comment, this comment, or this comment). Some PRs sit for days while others are approved almost immediately.

To help people that are looking for inspiration for good Evals and to make sure they don't work on one that has already been merged, I decided to create this repo that tracks all of the Evals that have been accepted by OpenAI.

Accepted Evals

Format: <date-merged> <short-title>: <link-to-PR>

About

Tracks which OpenAI Eval contributions are being accepted

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published