Back to Articles
Microsoft lets one AI model critique the other's responses

iTnews

READ

Details

Date Published
30 Mar 2026
Priority Score
3
Australian
Yes
Created
30 Mar 2026, 08:00 pm

Authors (1)

Description

Updates Copilot's Researcher ‌agent.

Summary

Microsoft has introduced a 'Critique' feature for its Copilot Researcher agent that utilizes multi-model safety protocols, specifically having Anthropic's Claude review outputs from OpenAI's GPT for accuracy. This development represents a commercial implementation of 'Constitutional AI' or 'AI-critiquing-AI' frameworks designed to reduce hallucinations and improve output reliability in frontier models. While primarily a productivity enhancement, the use of cross-model verification is a notable governance step in managing the unpredictable behaviors of autonomous agents. The integration reflects a growing trend Toward agentic AI systems that employ internal checks and balances to mitigate the risks of misinformation and unreliable autonomous reasoning.

Body

Microsoft has unveiled new features ⁠in ⁠its Copilot research assistant that would allow users to utilise multiple AI models simultaneously within the same workflow, the latest move by the tech giant to improve its AI offering and ‌boost adoption. In a new feature called "Critique", Copilot's Researcher ‌agent ‌will now be able to pull outputs from ‌both OpenAI's GPT and Anthropic's Claude models ⁠for every response, rather than relying on a single model. While GPT generates the response, Claude will review the output for accuracy and quality before presenting it to the user, Microsoft said. The ​company expects to make that workflow bi-directional in the future, allowing GPT to review Claude's drafts as well. "Having ⁠various different models from different vendors in Copilot is highly attractive - but we're taking this to the next level, where customers actually get the benefits of the models working together," Nicole Herskowitz, corporate vice president of Microsoft 365 and Copilot, said in an interview with Reuters. The multi-model approach will help speed up user workflow, keep in check AI hallucinations - where systems generate false information - and produce ​more reliable outputs, boosting productivity and quality ⁠for customers, Herskowitz added. Microsoft is also launching 'Council', ⁠a feature that will allow users to compare responses from different AI models side-by-side. The ​upgrades come as Microsoft makes its new Copilot Cowork agentic AI tool ‌more widely ⁠available to members in its 'Frontier' program, which provides customers with early access to some of its latest AI features. Microsoft unveiled Copilot Cowork - a tool based ​on Anthropic's viral Claude Cowork product - in testing mode earlier this month, capitalizing on the growing demand for autonomous AI agents. The Windows maker has been racing to improve ‌its Copilot ⁠assistant to drive better adoption ​amid intense competition from rivals including Google's Gemini and autonomous agents such as Claude Cowork.