DiscoverLessWrong (30+ Karma)“GPT-5.2 Is Frontier Only For The Frontier” by Zvi
“GPT-5.2 Is Frontier Only For The Frontier” by Zvi

“GPT-5.2 Is Frontier Only For The Frontier” by Zvi

Update: 2025-12-16
Share

Description

Here we go again, only a few weeks after GPT-5.1 and a few more weeks after 5.0.


There weren’t major safety concerns with GPT-5.2, so I’ll start with capabilities, and only cover safety briefly starting with ‘Model Card and Safety Training’ near the end.




Table of Contents





  1. The Bottom Line.

  2. Introducing GPT-5.2.

  3. Official Benchmarks.

  4. GDPVal.

  5. Unofficial Benchmarks.

  6. Official Hype.

  7. Public Reactions.

  8. Positive Reactions.

  9. Personality Clash.

  10. Vibing the Code.

  11. Negative Reactions.

  12. But Thou Must (Follow The System Prompt).

  13. Slow.

  14. Model Card And Safety Training.

  15. Deception.

  16. Preparedness Framework.

  17. Rush Job.

  18. Frontier Or Bust.




The Bottom Line




ChatGPT-5.2 is a frontier model for those who need a frontier model.









It is not the step change that is implied by its headline benchmarks. It is rather slow.


Reaction was remarkably muted. People have new model fatigue. So we know less about it than we would have known about prior models after this length of time.


If you’re coding, compare it to Claude Opus 4.5 and choose what works best for you.


If you’re doing intellectually [...]

---

Outline:

(00:29 ) The Bottom Line

(01:58 ) Introducing GPT-5.2

(03:49 ) Official Benchmarks

(05:54 ) GDPVal

(08:14 ) Unofficial Benchmarks

(11:11 ) Official Hype

(12:36 ) Public Reactions

(12:59 ) Positive Reactions

(19:09 ) Personality Clash

(24:30 ) Vibing the Code

(27:25 ) Negative Reactions

(30:37 ) But Thou Must (Follow The System Prompt)

(33:09 ) Slow

(34:16 ) Model Card And Safety Training

(36:23 ) Deception

(38:10 ) Preparedness Framework

(40:10 ) Rush Job

(41:29 ) Frontier Or Bust

---


First published:

December 15th, 2025



Source:

https://www.lesswrong.com/posts/Do4eWro8E552isGi5/gpt-5-2-is-frontier-only-for-the-frontier


---


Narrated by TYPE III AUDIO.


---

Images from the article:

Table showing
Bar graph showing GPT model win rates versus industry professionals on knowledge tasks.
Bar graph titled
Bar chart comparing AI model win rates, showing parity with industry expert at 50%.
Bar chart showing
Table showing production benchmarks across four GPT model variants for eleven content categories.
Table showing prompt injection evaluation scores across different GPT models and methods.
Table comparing deception rates between gpt-5.1-thinking and gpt-5.2-thinking across six evaluation categories.

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Comments 
In Channel
loading
00:00
00:00
x

0.5x

0.8x

1.0x

1.25x

1.5x

2.0x

3.0x

Sleep Timer

Off

End of Episode

5 Minutes

10 Minutes

15 Minutes

30 Minutes

45 Minutes

60 Minutes

120 Minutes

“GPT-5.2 Is Frontier Only For The Frontier” by Zvi

“GPT-5.2 Is Frontier Only For The Frontier” by Zvi