Back to Articles
Google DeepMind Claims 'Historic' AI Breakthrough in Problem Solving

The Guardian

SKIPPED

Details

Date Published
16 Sept 2025
Priority Score
4
Australian
No
Created
17 Sept 2025, 05:00 pm

Authors (1)

Description

Version of company’s Gemini 2.5 AI model solved complex real-world problem that stumped human programmers

Summary

Google DeepMind's Gemini 2.5 AI model achieved a major milestone by solving a complex real-world problem that had eluded human programmers, earning a gold medal at an international programming competition. This development is compared to past AI achievements such as Deep Blue's chess victory and AlphaGo's performance in Go, signaling a significant leap towards Artificial General Intelligence (AGI). The model is praised for its potential applications in various scientific fields, although some skepticism remains about the computing power required and the overall impact. While significant, the article suggests a degree of caution as ongoing breakthroughs in AI capabilities are declared by leading tech companies.

Body

The version of Google’s Gemini 2.5 AI model used in the competition was not the same as that available to an average subscriber to its $250-a-month Google AI Ultra service.Photograph: Bloomberg/GettyView image in fullscreenThe version of Google’s Gemini 2.5 AI model used in the competition was not the same as that available to an average subscriber to its $250-a-month Google AI Ultra service.Photograph: Bloomberg/GettyGoogle DeepMind claims ‘historic’ AI breakthrough in problem solvingVersion of company’s Gemini 2.5 AI model solved complex real-world problem that stumped human programmersGoogle DeepMind claims it has made a “historic” artificial intelligence breakthrough akin to the Deep Blue computer defeating Garry Kasparov at chess in 1997 andan AIbeating a human Go champion in 2016.A version of the company’s Gemini 2.5 AI model solved a complex real-world problem that stumped human computer programmers to become the first AI model to win a gold medal at an international programming competition held earlier this month in Azerbaijan.In a performance that the tech company called a “profound leap in abstract problem-solving”, it took less than half an hour to work out how to weigh up an infinite number of possibilities in order to send a liquid through a network of ducts to a set of interconnected reservoirs. The goal was to distribute it as quickly as possible.None of the human teams, including the top performers from universities in Russia, China and Japan, got it right.The AI failed two of the 12 tasks it was set, but its overall performance ranked it in second place out of 139 of the world’s strongest college-level computer programmers.Googlesaid it was a “historic moment, towards AGI [artificial general intelligence]”, which is widely considered human-level intelligence at a wide range of tasks.“For me it’s a moment that is equivalent to Deep Blue for Chess and AlphaGo for Go,” said Quoc Le, Google DeepMind’s vice-president.“Even bigger, it is reasoning more towards the real world, not just a constrained environment [likeChessand Go] … because of that I think this advance has the potential to transform many scientific and engineering disciplines.” He cited drug and chip design.The model is a general purpose AI but was specially trained to solve very hard coding, maths and reasoning problems. It performed “as well as a top 20 coder in the world”, Google said.“Solving complex tasks at these competitions requires deep abstract reasoning, creativity, the ability to synthesise novel solutions to problems never seen before and a genuine spark of ingenuity,” the company said.View image in fullscreenGarry Kasparov plays chess against IBM’s Big Blue in New York in 1997.Photograph: Sipa/ShutterstockSpeaking before the details were made public, Stuart Russell, a professor of computer science at the University of California at Berkeley, said the “claims of epochal significance seem overblown”.He said AI systems had been doing well on programming tasks for a while and the Deep Blue chess breakthrough had “essentially no impact on the real world of applied AI”.However, he said “to get an ICPC [International Collegiate Programming Contest] question right, the code actually has to work correctly (at least on a finite number of test cases), so this performance may show progress towards making AI-based coding systems sufficiently accurate for producing high-quality code”.He added: “The pressure on AI companies to keep claiming breakthroughs is enormous.”Michael Wooldridge, Ashall professor of the foundations of artificial intelligence at the University of Oxford, said it sounded like an impressive achievement and “being able to solve problems at this level is exciting”.But he questioned how much computing power was needed. Google declined to say, apart from confirming it was more than available to an average subscriber to its $250-a-month Google AI Ultra service using the lightweight version of Gemini 2.5 Deep Think in the Gemini App.Dr Bill Poucher, executive director of the ICPC, said: “Gemini successfully joining this arena, and achieving gold-level results, marks a key moment in defining the AI tools and academic standards needed for the next generation.”skip past newsletter promotionafter newsletter promotionFour machine intelligence breakthroughs1957 The PerceptronFrank Rosenblatt, an academic at Cornell University, worked out that it should be possible to create a “perceiving and recognising automaton”. He named it the Perceptron andsaidan electronic system would be able to learn to recognise patterns in optical, electrical or tonal information “in a manner which may be closely analogous to the perceptual process of a biological brain”.The following year he built the device, which was the size of a small room. It was considered one of the early breakthroughs in artificial intelligence based on neural networks.1997 Big BlueIn May 1997, IBM’s Big Blue became the first computer system to defeat a reigning world chess champion in a match under standard tournament controls. ItbeatGarry Kasparov in what became an inflection point in computing power, but the contest was close.Kasparov won the first game, Deep Blue the second followed by three draws. Deep Blue won game 6 to secure the win. It showed how brute force computing power could create a system to defeat a human, albeit at a narrow task. “The computer is far stronger than anybody expected,” said Kasparov, conceding defeat.2016AlphaGoGo is one of the most complex games ever devised, and one of the world’s master players was Lee Sedol, a South Korean professional. In 2016, DeepMind, the UK AI company set up by Demis Hassabis, took him on with its computer AlphaGo.It won 4-1 and some of its moves seemed to display truly original thinking. Move 37 in particular went down in lore. Hassabissaid: “It might be the first glimpse of a bright and bold future where humanity harnesses AI as a powerful new tool, helping us discover new knowledge that can solve some of our most pressing scientific problems.”2020AlphaFoldAnother breakthrough by Hassabis and DeepMind was an AI program that can predict how proteins fold into 3D shapes, a highly complex process fundamental to understanding life’s biological machinery. The Royal Society, the 360-year old London scientific institution,calledit “a stunning advance”.When researchers know how a protein folds up, they can start to uncover mysteries such as how insulin controls sugar levels in the blood or how antibodies fight viruses. After further iterations, the system helped Hassibis and his colleague John Jumpershare a Nobel prizefor chemistry in 2024.Explore more on these topicsArtificial intelligence (AI)DeepMindGoogleComputingChessnewsShareReuse this content