Elon Musk’s AI firm, xAI, late on Wednesday launched its newest flagship AI mannequin, Grok 4, and unveiled a brand new $300-per-month AI subscription plan, SuperGrok Heavy.
Grok is xAI’s reply to fashions like OpenAI’s ChatGPT and Google’s Gemini, and may analyze photographs and reply to questions. In current months, Grok has turn out to be extra deeply built-in into Musk’s social community, X, which was not too long ago acquired by xAI. Nevertheless, that has additionally put Grok’s misbehavior entrance and heart for hundreds of thousands of customers.
The expectations are excessive for Grok 4. The newest AI mannequin from xAI can be stacked up in opposition to OpenAI’s forthcoming AI mannequin, GPT-5, which is predicted to launch later this summer time.
“With respect to tutorial questions, Grok 4 is healthier than PhD degree in each topic, no exceptions,” stated Elon Musk throughout a livestream Wednesday night time. “At instances, it could lack frequent sense, and it has not but invented new applied sciences or found new physics, however that’s only a matter of time.”
Musk wore a leather-based jacket and sat moreover xAI leaders to launch Grok 4 (Credit score: xAI)
The launch of Grok 4 comes amid a tumultuous week for Elon Musk’s corporations. Earlier on Wednesday, Linda Yaccarino stepped down from her function because the CEO of X after roughly two years with the corporate. X has but to announce her successor.
Yaccarino’s departure comes simply days after Grok’s official, automated X account responded to customers with antisemitic feedback criticizing Hollywood’s “Jewish executives” and praising Hitler. xAI needed to briefly restrict Grok’s account and delete the offensive posts. In response to the incident, xAI appeared to have eliminated a not too long ago added part from Grok’s public system immediate, a listing of directions for the AI chatbot to observe, that advised it to not shrink back from making “politically incorrect” claims.
Musk and xAI’s leaders largely prevented discussing the incident, as a substitute specializing in Grok 4’s efficiency and capabilities.
xAI launched two fashions on Wednesday: Grok 4 and Grok 4 Heavy — the latter being the corporate’s “multi-agent model” that gives elevated efficiency. Musk claimed that Grok 4 Heavy spawns a number of brokers to work on an issue concurrently, after which all of them evaluate their work “like a examine group” to search out one of the best reply.
xAI claims that Grok 4 exhibits frontier degree efficiency on a number of benchmarks, together with Humanity’s Final Examination— a difficult check measuring AI’s capability to reply hundreds of crowdsourced questions on topics like math, humanities, and pure science. In response to xAI, Grok 4 scored 25.4% on Humanity’s Final Examination with out “instruments,” outperforming Google’s Gemini 2.5 Professional, which scored 21.6%, and OpenAI’s o3 (excessive), which scored 21%.
xAI claims that Grok 4 Heavy, with “instruments,” was capable of obtain a rating of 44.4%, outperforming Gemini 2.5 Professional with instruments, which scored 26.9%.
The nonprofit Arc Prize says that Grok achieves a brand new state-of-the-art rating on its ARC-AGI-2 check — one other tough benchmark that consists of puzzle-like issues the place an AI has to establish visible patterns — scoring 16.2%. That’s almost twice the rating of the subsequent greatest business AI mannequin, Claude Opus 4.
Grok 4’s efficiency on a number of tutorial benchmarks (Credit score: xAI)
Alongside Grok 4 and Grok 4 Heavy, xAI launched its costliest AI subscription plan but, a $300-per-month subscription known as SuperGrok Heavy. Subscribers to the plan will get an early preview to Grok 4 Heavy, in addition to early entry to new options. The plan is much like ultra-premium tiers supplied by OpenAI, Google, and Anthropic, however xAI now affords the most costly subscription amongst main AI suppliers.
SuperGrok Heavy subscribers might get early entry to some new merchandise xAI plans to launch within the coming months. The corporate stated Wednesday that an AI coding mannequin is coming in August, a multi-modal agent in September, and a video era mannequin in October.
xAI is releasing Grok 4 by its API in an effort to get builders to construct purposes with the mannequin. The corporate notes that xAI’s enterprise sector is just two months previous, nevertheless, it plans to work with hyperscalers to make Grok obtainable by their cloud platforms.
Regardless of Grok’s frontier-level efficiency on benchmarks, it could show tough for xAI to maneuver previous its current mishaps because it tries to pitch Grok to companies as an actual contender to ChatGPT, Claude, and Gemini. Whether or not companies are able to undertake Grok, flaws and all, stays to be seen.