Be a part of our every day and weekly newsletters for the most recent updates and unique content material on industry-leading AI protection. Be taught Extra
Shock! Simply days after experiences emerged suggesting OpenAI was shopping for white-hot coding startup Windsurf, the previous firm seems to be launching its personal competitor service as a analysis preview below its model title Codex, going head-to-head in opposition to Windsurf, Cursor, and the rising listing of AI coding instruments provided by startups and enormous tech corporations together with Microsoft and Amazon.
In contrast to OpenAI’s earlier Codex code completion AI mannequin, the brand new model is a full cloud-based AI software program engineering (SWE) agent constructed atop a fine-tuned model of OpenAI’s o3 reasoning mannequin that may execute a number of growth duties in parallel.
Beginning right this moment it is going to be accessible for ChatGPT Professional, Enterprise, and Staff customers, with assist for Plus and Edu customers anticipated quickly.
Codex’s evolution: from mannequin to autonomous AI coding agent
This launch marks a major step ahead in Codex’s growth. The unique Codex debuted in 2021 as a mannequin for translating pure language into code accessible by way of OpenAI’s nascent software programming interface.
It was the engine behind GitHub Copilot, the favored autocomplete-style coding assistant designed to work inside IDEs like Visible Studio Code.
That preliminary iteration targeted on code technology and completion, skilled on billions of traces of public supply code.
Nevertheless, the early model got here with limitations. It was susceptible to syntactic errors, insecure code ideas, and biases embedded in its coaching knowledge. Codex sometimes proposed superficially appropriate code that failed functionally, and in some circumstances, made problematic associations primarily based on prompts.
Regardless of these flaws, it confirmed sufficient promise to determine AI coding instruments as a quickly rising product class. That unique mannequin has since been deprecated and was the title of a brand new suite of merchandise, in line with an OpenAI spokesperson.
GitHub Copilot formally transitioned off OpenAI’s Codex mannequin in March 2023, adopting GPT-4 as a part of its Copilot X improve to allow deeper IDE integration, chat capabilities, and extra context-aware code ideas.
Agentic visions
The brand new Codex goes far past its predecessor. Now constructed to behave autonomously over longer durations, Codex can write options, repair bugs, reply codebase-specific questions, run checks, and suggest pull requests—every process operating in a safe, remoted cloud sandbox.
The design displays OpenAI’s broader ambition to maneuver past fast solutions and into collaborative work.
Josh Tobin, who leads the Brokers Analysis Staff at OpenAI, mentioned throughout a latest briefing: “We consider brokers as AI programs that may function in your behalf for an extended time frame to perform massive chunks of labor by interacting with the true world.” Codex suits squarely into this definition. “Our imaginative and prescient is that ChatGPT will change into nearly like a digital coworker—not simply answering fast questions, however collaborating on substantial work throughout a spread of duties,” he added.
Figures launched by OpenAI present that the brand new Codex-1 SWE agent outperforms all of OpenAI’s newest reasoning fashions on inside SWE duties.
New capabilities, new interface, new workflows
Codex duties are initiated by way of a sidebar interface in ChatGPT, permitting customers to immediate the agent with duties or questions.
The agent processes every request in an air-gapped atmosphere loaded with the person’s repository and configured to reflect the event setup. It logs its actions, cites take a look at outputs, and summarizes adjustments—making its work traceable and reviewable.
Alexander Embiricos, head of OpenAI’s Desktop & Brokers workforce (and the previous CEO and co-founder of screenshare collaboration startup Multi that OpenAI acquired for an undisclosed sum final yr) mentioned in a briefing with journalists that “the Codex agent is a cloud-based software program engineering agent that may work on many duties in parallel, with its personal laptop to run safely and independently.”
Internally, he mentioned, engineers already use it “like a morning to-do listing—fireplace off duties to Codex and return to a batch of draft options able to assessment or merge.”
Codex additionally helps configuration by way of AGENTS.md information—project-level guides that educate the agent find out how to navigate a codebase, run particular checks, and comply with home coding kinds.
“We skilled our mannequin to learn code and infer fashion—like whether or not or to not use an Oxford comma—as a result of code fashion issues as a lot as correctness,” Embiricos mentioned.
Safety and sensible use
Codex executes duties with out web entry, drawing solely on user-provided code and dependencies. This design ensures safe operation and minimizes potential misuse.
“That is greater than only a mannequin API,” mentioned Embiricos. “As a result of it runs in an air-gapped atmosphere with human assessment, we may give the mannequin much more freedom safely.”
OpenAI additionally experiences early exterior use circumstances. Cisco is evaluating Codex for accelerating engineering work throughout its product traces. Temporal makes use of it to run background duties like debugging and take a look at writing. Superhuman leverages Codex to enhance take a look at protection and allow non-engineers to recommend light-weight code adjustments. Kodiak, an autonomous automobile agency, applies it to enhance code reliability and acquire insights into unfamiliar stack elements.
OpenAI can also be rolling out updates to Codex CLI, its light-weight terminal agent for native growth. The CLI now makes use of a smaller mannequin—codex-mini-latest—optimized for low-latency enhancing and Q&A.
The pricing is about at $1.50 per million enter tokens and $6 per million output tokens, with a 75% caching low cost. Codex is presently free to make use of in the course of the rollout interval, with price limits and on-demand pricing choices deliberate.
Does this imply OpenAI IS NOT shopping for Windsurf? *Considering face emoji*
The discharge of Codex comes amid elevated competitors within the AI coding instruments house—and alerts that OpenAI is intent on constructing, relatively than shopping for, its subsequent part of merchandise.
In keeping with latest knowledge from SimilarWeb, visitors to developer-focused AI instruments has surged by 75% over the previous 12 weeks, underscoring the rising demand for coding assistants as important infrastructure relatively than experimental add-ons.
Experiences from TechCrunch and Bloomberg recommend OpenAI held acquisition talks with fast-growing AI dev device startups Cursor and Windsurf. Cursor allegedly walked away from the desk; Windsurf reportedly agreed in precept to be acquired by OpenAI for a worth of $3 billion, although no deal has been formally confirmed by both OpenAI or Windsurf.
Simply yesterday, in actual fact, Windsurf debuted its circle of relatives of coding-focused basis fashions, SWE-1, purpose-built to assist the complete software program engineering lifecycle, from debugging to long-running challenge upkeep. SWE-1 fashions have been reported customized made, skilled totally in-house utilizing a brand new sequential knowledge mannequin tailor-made to real-world growth workflows.
Many issues could also be occurring behind the scenes between the 2 corporations, however to me, the timing of Windsurf launching its personal coding basis mannequin — as a substitute of its technique to-date of utilizing Llama variants and giving customers the choice to fit in OpenAI and Anthropic fashions — adopted someday later by OpenAI releasing its personal Windsurf competitor, appears to recommend the 2 usually are not aligning quickly.
However then again, the truth that this new Codex AI SWE agent is in “analysis preview” to start out could also be a type of OpenAI pressuring Windsurf or Cursor or anybody else to return to the bargaining desk and strike a deal. Requested concerning the potential for a Windsurf acquisition and experiences of 1 thereof, an OpenAI spokesperson informed VentureBeat they’d nothing to share on that entrance.
In both case, Embiricos frames Codex as excess of a mere code device or assistant.
“We’re about to bear a seismic shift in how builders work with brokers—not simply pairing with them in actual time, however totally delegating duties,” he mentioned. “The primary experiments have been simply reasoning fashions with terminal entry. The expertise was magical—they began doing issues for us.”
Constructed for dev groups, not merely solo devs
Codex is designed with skilled builders in thoughts, however Embiricos famous that even product managers have discovered it useful for suggesting or validating adjustments earlier than pulling in human SWEs. This versatility displays OpenAI’s technique of constructing instruments that increase productiveness throughout technical groups.
Trini, an engineering lead on the challenge, summarized the broader ambition behind Codex: “This can be a transformative change in how software program engineers interface with AI and computer systems typically. It amplifies every particular person’s potential.”
OpenAI envisions Codex because the centerpiece of a brand new growth workflow the place engineers assign high-level duties to brokers and collaborate with them asynchronously. The corporate is constructing towards deeper integrations throughout GitHub, ChatGPT Desktop, difficulty trackers, and CI programs. The long-term objective is to mix real-time pairing and long-horizon process delegation right into a seamless growth expertise.
As Josh Tobin put it, “Coding underpins so many helpful issues throughout the financial system. Accelerating coding is a very high-leverage solution to distribute the advantages of AI to humanity, together with ourselves.”
Whether or not or not OpenAI closes offers for rivals, the message is evident: Codex is right here, and OpenAI is betting by itself brokers to steer the subsequent chapter in developer productiveness.
Day by day insights on enterprise use circumstances with VB Day by day
If you wish to impress your boss, VB Day by day has you coated. We provide the inside scoop on what corporations are doing with generative AI, from regulatory shifts to sensible deployments, so you possibly can share insights for optimum ROI.
Thanks for subscribing. Take a look at extra VB newsletters right here.
An error occured.