Can AI really compete with human data scientists? OpenAI’s new benchmark puts it to the test
October 11, 2024
OpenAI has introduced a new tool to measure artificial intelligence capabilities in machine learning engineering. The benchmark, called MLE-bench, challenges AI systems with 75 real-world data science competitions from Kaggle, a popular platform for machine learning contests.
This benchmark emerges as tech companies intensify efforts to develop more capable AI systems. MLE-bench goes beyond…