OpenAI is launching a brand new common function AI agent in ChatGPT, which the corporate says can full all kinds of computer-based duties on behalf of customers. OpenAI says the agent can robotically navigate a consumer’s calendar, generate editable displays and slideshows, and run code.
The software, known as ChatGPT agent, combines a number of capabilities from OpenAI’s earlier agentic instruments, together with Operator’s means to click on round on web sites, in addition to Deep Analysis’s means to synthesize data from dozens of internet sites right into a concise analysis report. OpenAI says customers will have the ability to work together with the agent just by prompting ChatGPT in pure language.
On Thursday, OpenAI is rolling out ChatGPT agent for subscribers to its Professional, Plus, and Staff plans. To activate the software, customers can choose “agent mode” in ChatGPT’s dropdown menu of instruments.
The launch of ChatGPT agent represents OpenAI’s boldest try but to show ChatGPT into an agentic product that may take actions and offload duties for customers, slightly than simply answering questions. Lately, Silicon Valley firms together with OpenAI, Google, and Perplexity have unveiled dozens of AI brokers which have promised to just do that. Nonetheless, these early model of AI brokers have confirmed to battle with advanced duties, and appear much less compelling as merchandise than the last word imaginative and prescient tech executives pitch round AI brokers.
That stated, OpenAI says ChatGPT agent is way extra succesful than its earlier choices.
OpenAI’s new agent can entry ChatGPT connectors, permitting customers to attach apps like Gmail and GitHub in order that the agent can discover related data to your prompts. Moreover, OpenAI says ChatGPT agent has entry to a terminal, and might use APIs to entry sure apps.
The mannequin underlying ChatGPT agent affords state-of-the-art efficiency on a number of benchmarks, in accordance with OpenAI.
Techcrunch occasion
San Francisco
|
October 27-29, 2025
The corporate says the ChatGPT agent mannequin scores 41.6% on Humanity’s Final Examination (go@1), a tough check made up of hundreds of questions throughout multiple hundred topics. That’s roughly double what OpenAI’s o3 and o4-mini scored on the check.
On FrontierMath, one of many hardest identified math benchmarks, OpenAI says ChatGPT agent scores 27.4% when it has entry to instruments, comparable to a terminal for code execution. The earlier state-of-the-art rating comes from o4-mini, which scored simply 6.3%.
OpenAI notes that it developed ChatGPT agent with security in thoughts, largely as a result of the product presents some newfound capabilities that would make it extra harmful within the arms of a nasty actor. How succesful ChatGPT agent actually is, nonetheless, stays to be seen.