Technological Advances in AIAgent-Based AIInnovation & Competitiveness Through AI

ChatGPT Agent: OpenAI Introduces an AI Capable of Planning, Executing… and Learning

Since the emergence of large language models, conversational artificial intelligence has made rapid strides in understanding, content generation, and interactivity. But until now, assistants like ChatGPT, Claude, and Gemini have been limited to the role of a conversational partner, unable to take direct action within a digital environment.

With the launch of ChatGPT Agent, OpenAI has reached a major milestone. It is no longer just a conversational tool, but an autonomous agent capable of planning actions, executing them in a secure virtual environment, and learning from the results. In other words, it is a system capable of interacting with software, files, or APIs without constant human supervision.

This strategic shift comes amid intense competition. In 2024, more than 45% of digital professionals reported using an AI assistant in their daily work, but only 12% relied on agents capable of performing autonomous tasks1. Demand for more operational systems capable of taking action is growing rapidly.

Unlike previous versions of ChatGPT, this new version uses a virtual workspace (sandbox) in which the model can perform a sequence of actions, interact with files, use tools such as a web browser, text editor, or terminal, and, most importantly, plan multiple steps to achieve a goal.

ChatGPT Agent can now execute a script, read the results, interpret them, and adjust its approach if necessary. This perception-action loop logic draws inspiration from research on cognitive agents and brings generative AI closer to executive AI.

OpenAI notes that this process is strictly regulated; the environment is isolated from the user’s system to ensure the security, confidentiality, and traceability of the actions performed.

ChatGPT Agent's features open up new use cases, particularly in the following areas:

  • automation of digital tasks such as file sorting, report generation, or database updates
  • integration with third-party services (email, APIs, project management tools)
  • data analysis through file import and the use of scripts
  • assisted programming, bug fixing, documentation generation
  • synthesis of information from various sources, resulting in operational deliverables

According to initial laboratory tests conducted by OpenAI, ChatGPT Agent is reportedly capable of completing certain sequences of actions on average 30% faster than a human trained in simple file-handling tasks2.

Despite its advances, ChatGPT Agent still has several limitations:

  • planning that is sometimes imprecise, particularly in multi-step scenarios
  • vulnerability to unforeseen events, data errors, or execution errors
  • requires human oversight, particularly to validate certain critical decisions
  • risk of ambiguous interpretation of instructions written in natural language

In April 2025, a report from the Stanford Center for Research on Foundation Models indicated that AI agents succeeded on average at 48% of complex, multi-step tasks, but their success rate dropped to 33% when no assistance or clarification was provided3.

The emergence of agents capable of operating in digital environments raises several key questions:

  • liability for actions taken by the agent
  • legal framework for permits and technical capabilities
  • transparency requirements regarding decisions made by the machine
  • prevention of misuse or unintended use

These concerns align with the European Union’s considerations under the AI Act, which identifies autonomous systems with the capacity to act as high-risk AI. At the European level, nearly 61% of respondents to a 2024 public consultation expressed support for strict regulation of AI agents capable of modifying systems or accessing data4.

With ChatGPT Agent, OpenAI is no longer limited to text generation but is ushering in a convergence of understanding, planning, and execution. This development paves the way for new forms of digital intelligence, capable not only of producing content but also of taking action within a structured framework.

This innovation could transform professional, educational, and organizational practices. It also raises fundamental questions about the governance of autonomous systems, shared responsibility, and the role of humans in the decision-making process.

See also on our blog ChatGPT Introduces Connectors: Toward AI Integrated into Business Tools, an article that explores how the OpenAI ecosystem is evolving toward more autonomous agents connected to business applications.

1. McKinsey & Company. (2024). The State of AI in 2024.
https://www.mckinsey.com/

2. OpenAI. (2025). Introducing ChatGPT Agents: Early Results and Capabilities.
https://openai.com/

3. Stanford CRFM. (2025). Evaluating the Performance and Limitations of Autonomous AI Agents.
https://crfm.stanford.ed/

4. European Commission. (2024). Results of the Public Consultation on the AI Act.
https://digital-strategy.ec.europa.eu/

Don't miss our upcoming articles!

Get the latest articles written by aivancity experts and professors delivered straight to your inbox.

We don't send spam! Please see our privacy policy for more information.

Don't miss our upcoming articles!

Get the latest articles written by aivancity experts and professors delivered straight to your inbox.

We don't send spam! Please see our privacy policy for more information.

Related posts
Agent-Based AI

With Personal Computer, Perplexity aims to turn your Mac into a permanent AI agent

Artificial intelligence continues to become an integral part of personal computing. Perplexity, a company known for its AI-powered conversational search engine, is now exploring a new frontier: transforming a personal computer into an intelligent agent…
Agent-Based AI

Musk unveils “Macrohard,” a joint AI project between Tesla and xAI aimed at transforming software

Elon Musk continues to explore new avenues in the field of artificial intelligence. After developing the Grok model with his xAI lab and accelerating work on the Optimus humanoid robot at Tesla, the…
Technological Advances in AI

Claude Code Voice: Anthropic finally lets you control your code with your voice

Artificial intelligence is gradually transforming the way developers interact with their programming environment. Following the emergence of code assistants capable of suggesting or generating entire functions, a new phase is taking shape: the…
The AI Clinic

Would you like to submit a project to the AI Clinic and work with our students?

Leave a comment

Your email address will not be published. Required fields are marked with *