Date: February 6th, 2026 12:31 AM Author: Fiercely-loyal Contagious Gunner
AI-related development. I just used it on extra high mode and it edited (and created some new unit tests) 14 files in one shot, and fixed it to a quality level that would have taken like 12 rewrites with codex 5.2. plus it was like twice as fast.
Date: February 6th, 2026 1:37 AM Author: harsh theater filthpig
The SWE Bench Pro graph with number of tokens vs. accuracy is interesting. At the limit, it converges on 5.2 codex max. Models are now becoming more token efficient, but not necessarily developing new capabilities?
Date: February 10th, 2026 2:04 PM Author: Fiercely-loyal Contagious Gunner
its so 180. I have heard people complain that its an electron app and that developers won't use it. but whats funny is I normally do almost everything headless but now that this app is out i just want to do everything on here. and speed and scale aren't really an issue. its fast enough and in terms of scale the credits are enough if I wanted to add more compute and more complex automation it would just get out of control and I'd have no idea what the hell is going on in my codebase. the credits on pro are enough. i could probably push it past the limits but i just would lose track of whats happening