List of Flash News about Humanity’s Last Exam
Time | Details |
---|---|
2025-08-01 11:10 |
Google DeepMind AI Model Achieves State-of-the-Art Performance on LiveCodeBench V6 and Humanity’s Last Exam
According to Google DeepMind, their latest AI model delivers state-of-the-art results on LiveCodeBench V6, a benchmark for competitive code performance, and Humanity’s Last Exam, which tests expertise in multiple domains including science. This advancement signals increased AI capabilities that could boost automation in financial software and crypto algorithmic trading, potentially impacting the pace and efficiency of the cryptocurrency market. Source: Google DeepMind |
2025-03-25 17:06 |
Gemini 2.5 Pro Experimental Achieves Leading Scores in Math and Science Benchmarks
According to Google DeepMind, Gemini 2.5 Pro Experimental has achieved leading scores in math and science benchmarks, specifically GPQA and AIME 2025, without test-time optimizations. This indicates its robust performance capabilities. Additionally, it scored 18.8% on Humanity’s Last Exam, showcasing its advanced reasoning and knowledge capabilities. |