It's insane how FAR AHEAD GPT is compared to the competition
| startled theatre cumskin | 01/15/26 | | startled theatre cumskin | 01/15/26 | | impressive mauve site | 01/16/26 | | startled theatre cumskin | 01/16/26 | | wonderful national | 01/16/26 | | Floppy maroon hall patrolman | 01/16/26 | | zombie-like athletic conference shitlib | 01/16/26 | | big-titted associate | 01/16/26 | | startled theatre cumskin | 01/16/26 | | Pontificating Theater Stage | 01/16/26 | | startled theatre cumskin | 01/16/26 | | Pontificating Theater Stage | 01/16/26 | | startled theatre cumskin | 01/16/26 | | Pontificating Theater Stage | 01/16/26 | | startled theatre cumskin | 01/16/26 | | Primrose racy potus | 01/16/26 | | startled theatre cumskin | 01/16/26 | | Primrose racy potus | 01/16/26 | | concupiscible rambunctious library | 01/16/26 | | crimson office keepsake machete | 01/16/26 | | Pontificating Theater Stage | 01/16/26 | | Floppy maroon hall patrolman | 01/16/26 |
Poast new message in this thread
Date: January 15th, 2026 10:39 PM Author: startled theatre cumskin
Claude is the closest and GPT is absolutely blowing it away.
Abstract Reasoning (ARC-AGI-2)
The substantial lead is most visible in abstract reasoning, which allows a model to solve novel engineering problems it hasn't seen in its training data:
The Gap: GPT 5.2 Thinking scores 52.9%–54.2% on the ARC-AGI-2 benchmark.
Opus Lag: Claude Opus 4.5 scores only 37.6%.
Impact: This means in 2026, GPT 5.2 is roughly 40% better at handling "first-of-their-kind" software architectural challenges that require true first principles thinking rather than pattern matching.
(http://www.autoadmit.com/thread.php?thread_id=5822697&forum_id=2\u0026mark_id=5310908#49593065) |
Date: January 15th, 2026 10:47 PM Author: startled theatre cumskin
lol open AI has also pointed out an "intelligence overhang" where almost all humans are not intelligent enough to notice that it is better than other models or utilize its full capability. This is literally exactly what I have argued a million times, I have said this here before :
OpenAI leadership has recently warned of an "intelligence overhang," where the model's raw capabilities already exceed most current software workflows and human ability to use them effectively. For those using it for deep architectural work, that "superhuman" edge is becoming increasingly obvious.
(http://www.autoadmit.com/thread.php?thread_id=5822697&forum_id=2\u0026mark_id=5310908#49593075) |
Date: January 16th, 2026 12:24 PM Author: Pontificating Theater Stage
gemini is potus across the board
but if you stack latest GPT for JSON prompts, gemini for nano and notebook lm etc reviewing editing research etc and the claude cowork or projects to have an AI slave grind out shit you will be literally full bore raping anyone that doesnt use AI
its so unfair, i would seriously consider killing myself if i didnt know how to use these satanic sigils
(http://www.autoadmit.com/thread.php?thread_id=5822697&forum_id=2\u0026mark_id=5310908#49593880) |
|
|