OpenAI Unleashes GPT-5.2: A New Benchmark in AI Reasoning and Coding

Pasukan Editorial BigGo

OpenAI Unleashes GPT-5.2: A New Benchmark in AI Reasoning and Coding

In a rapid-fire response to mounting competition, OpenAI has officially launched GPT-5.2, its most advanced generative AI model to date. The release, coming less than a month after GPT-5.1, signals an intense phase in the AI arms race and delivers significant claimed improvements in reasoning, accuracy, and professional task performance. This article delves into what's new, how it performs, and what it means for users and the competitive landscape.

The Strategic Launch of GPT-5.2

OpenAI's announcement of GPT-5.2 on December 11th was framed by the company as a direct competitive move. CEO Sam Altman had previously declared a "red alert" status within the company following the impressive launch of Google's Gemini 3 model in November. This accelerated release cycle—from GPT-5 in August to GPT-5.1 in November and now GPT-5.2 in December—underscores the fierce pressure OpenAI feels from rivals like Anthropic and Google. Altman has indicated that the impact of Gemini 3 was less severe than initially feared and projected that OpenAI would exit its "red alert" state by January 2026 in a "very strong" position.

Key Improvements and Model Variants

GPT-5.2 is not a one-size-fits-all update. OpenAI has introduced three distinct versions tailored for different use cases, all now available to paying ChatGPT users (Plus, Pro, Go, Business, Enterprise) with API access for developers. The GPT-5.1 model will remain accessible for the next three months. The new lineup includes GPT-5.2 Instant, designed as a fast and efficient assistant for daily tasks with an improved conversational tone. GPT-5.2 Thinking is engineered for deep, complex work like coding, long-document analysis, and multi-step logic, boasting major advancements in these areas. Finally, GPT-5.2 Pro is positioned as the most intelligent and reliable option for high-stakes, complex problem-solving, with fewer major errors.

GPT-5.2 Model Variants & Focus:

Instant: Optimized for speed and daily tasks (queries, translations, technical writing).
Thinking: Designed for deep, complex work (coding, long-context analysis, logic).
Pro: Positioned as the most reliable for high-stakes, complex problem-solving.

Key Benchmark Claims:

GDPval (44 professions): 70.9% of tasks performed at or above human expert level.
SWE-bench Verified (Coding): 80% problem-solving rate (new record).
GPQA Diamond (Science): 93.2% accuracy (GPT-5.2 Pro).
FrontierMath (Expert Math): 40.3% of problems solved by GPT-5.2 Thinking (new record).

Pricing (API):

Input: USD 1.75 per million tokens
Output: USD 14 per million tokens
Cached Input: 90% discount

Benchmark Performance and "Human Expert" Claims

OpenAI has made bold claims about GPT-5.2's capabilities, backed by a suite of benchmark results. The company states that GPT-5.2 Thinking is its first model to perform at or above human expert levels in specific domains. On the GPQA Diamond benchmark for expert-level science questions, GPT-5.2 Pro achieved a 93.2% accuracy rate. In coding, GPT-5.2 Thinking set a new record on the SWE-bench Verified test, solving 80% of real-world software engineering tasks. Perhaps most notably, on a proprietary test (GDPval) covering knowledge work across 44 professions, the model's performance matched or exceeded that of industry experts 70.9% of the time, while operating over 11 times faster and at less than 1% of the cost.

Pricing, Accessibility, and the Disney Partnership

While the raw performance is headline-grabbing, practical considerations like cost and access are crucial. The GPT-5.2 API is priced at USD 1.75 per million input tokens and USD 14 per million output tokens, with a 90% discount for cached inputs. Although this represents a per-token price increase over GPT-5.1, OpenAI argues that the model's higher efficiency leads to lower total costs for achieving the same quality of output. In a separate but significant development, OpenAI also announced a USD 1 billion investment from Disney. This partnership will integrate over 200 licensed Disney, Marvel, Pixar, and Star Wars characters into OpenAI's Sora video generation model, opening new avenues for personalized entertainment and content creation.

The Road Ahead and Safety Measures

The launch of GPT-5.2 appears to be just the beginning of OpenAI's holiday-season push. Altman teased "little Christmas gifts" for users coming the following week, and industry rumors suggest another model with enhanced image capabilities and personalization could arrive in January 2026. Alongside these advancements, OpenAI is implementing new safety measures. Chief Product Officer Fidji Simo confirmed the rollout of an age estimation system in some regions to better control content for users under 18, a precursor to a planned "adult mode" feature expected in the first quarter of 2026.