Instead of relying on specialized APIs, the system uses screenshots for visual input and virtual mouse and keyboard actions to complete tasks.
On Thursday, OpenAI released a research preview of " Operator ," a web automation tool that uses a new AI model called Computer-Using Agent (CUA) to control computers through a visual interface. The ...
The new tool, called Operator, can shop for groceries or book a restaurant reservation. But it still needs help from humans.
It can also ask follow-up questions to further personalize the tasks it completes, such as login information for other websites. Users can take control of the screen at any time.
The company says the CUA’s reasoning technique, which they call an “inner monologue,” helps the model understand intermediate steps and adapt to unexpected input. Under the hood, CUA takes screenshots ...
OpenAI is best known for its AI models, which to date exist primarily on cloud servers, its website and in its apps for PCs and mobile devices.
Hiya, folks, welcome to TechCrunch’s regular AI newsletter. If you want this in your inbox every Wednesday, sign up here.
Dan Shipper and Alex Duffy in Chain of Thought Was this newsletter forwarded to you? Sign up to get it in your inbox. Today, OpenAI announced Operator, a new research preview of ChatGPT that acts as ...
That being said, other leakers have found traces of Operator, a.k.a. OpenAI's agentic system, via the Mac version of the ChatGPT app, with references to the tool already available on the company's ...
Sam Altman's OpenAI is backing a new AI initiative from a group where John Kerry was a founding member that has voiced ...
A series of job listings for the ChatGPT maker's robotics team suggest the company is finally ready to leap into hardware.