2501 Engine scores 96.951% on Full HumanEval Benchmark
Following our latest core updates, 2501 has achieved a remarkable score of 96.951% on the full HumanEval benchmark (29th June 2024). Entering the top 3 of the leaderboard demonstrates 2501's exceptional performance in generating code from natural language instructions. (Paper in progress, stay tuned for more details !)