Primate Labs has released Geekbench AI 1.0, a benchmarking app for machine learning and other AI workloads, in an effort to standardize performance ratings across platforms. The app is a successor to Geekbench ML and aims to provide a clear understanding of its purpose and function. OpenAI has also announced a new benchmark, SWE-bench Verified, which uses human validation to assess the effectiveness of AI models in solving real-world problems.
