\
  The most prestigious law school admissions discussion board in the world.
BackRefresh Options Favorite

OpenAI releases o3-pro, the most capable model yet

OpenAI has launched o3-pro, an AI model that the company cla...
Salmon insecure corner sweet tailpipe
  06/10/25
...
fiercely-loyal boyish site
  06/10/25
...
Orchid Mildly Autistic Bawdyhouse
  06/10/25
people are saying it's actually dumber than the old version ...
Pale cracking stain
  06/10/25
We need a new Biglaw Bench score for this and the new Claude...
Salmon insecure corner sweet tailpipe
  06/10/25
how expensive is it compared to gemini? I require large ...
Lime locus pozpig
  06/10/25
For law?
Salmon insecure corner sweet tailpipe
  06/10/25
Who gives a shit about AIME? They should be reporting liveco...
Ebony ungodly meetinghouse associate
  06/10/25
And Biglaw Bench. I want to see someone dethrone Gemini
Salmon insecure corner sweet tailpipe
  06/10/25
The latest 2.5 pro model seems noticeably better than the la...
Ebony ungodly meetinghouse associate
  06/10/25
It’s getting insane. Half the US will be unemployed by...
Salmon insecure corner sweet tailpipe
  06/10/25


Poast new message in this thread



Reply Favorite

Date: June 10th, 2025 4:48 PM
Author: Salmon insecure corner sweet tailpipe

OpenAI has launched o3-pro, an AI model that the company claims is its most capable yet.

O3-pro is a version of OpenAI’s o3, a reasoning model that the startup launched earlier this year. As opposed to conventional AI models, reasoning models work through problems step by step, enabling them to perform more reliably in domains like physics, math, and coding.

O3-pro is available for ChatGPT Pro and Team users starting Tuesday, replacing the o1-pro model. Enterprise and Edu users will get access the week after, OpenAI says. O3-pro is also live in OpenAI’s developer API as of this afternoon.

O3-pro is priced at $20 per million input tokens and $80 per million output tokens in the API. Input tokens are tokens fed into the model, while output tokens are tokens that the model generates based on the input tokens.

A million input tokens is equivalent to about 750,000 words, a bit longer than “War and Peace.”

“In expert evaluations, reviewers consistently prefer o3-pro over o3 in every tested category and especially in key domains like science, education, programming, business, and writing help,” OpenAI writes in a changelog. “Reviewers also rated o3-pro consistently higher for clarity, comprehensiveness, instruction-following, and accuracy.”

O3-pro has access to tools, according to OpenAI, allowing it to search the web, analyze files, reason about visual inputs, use Python, personalize its responses leveraging memory, and more. As a drawback, the model’s responses typically take longer than o1-pro to complete, according to OpenAI.

O3-pro has other limitations. Temporary chats with the model in ChatGPT are disabled for now while OpenAI resolves a “technical issue.” O3-pro can’t generate images. And Canvas, OpenAI’s AI-powered workspace feature, isn’t supported by o3-pro.

On the plus side, o3-pro achieves impressive scores in popular AI benchmarks, according to OpenAI’s internal testing. On AIME 2024, which evaluates a model’s math skills, o3-pro scores better than Google’s top-performing AI model, Gemini 2.5 Pro. O3-pro also beats Anthropic’s recently released Claude 4 Opus on GPQA Diamond, a test of PhD-level science knowledge.

(http://www.autoadmit.com/thread.php?thread_id=5735763&forum_id=2\u0026mark_id=5310486#49002909)



Reply Favorite

Date: June 10th, 2025 5:04 PM
Author: fiercely-loyal boyish site



(http://www.autoadmit.com/thread.php?thread_id=5735763&forum_id=2\u0026mark_id=5310486#49002971)



Reply Favorite

Date: June 10th, 2025 5:45 PM
Author: Orchid Mildly Autistic Bawdyhouse



(http://www.autoadmit.com/thread.php?thread_id=5735763&forum_id=2\u0026mark_id=5310486#49003138)



Reply Favorite

Date: June 10th, 2025 5:08 PM
Author: Pale cracking stain

people are saying it's actually dumber than the old version and they're trying to sneakily cut down/cheap out on the compute that they're giving out to users

(http://www.autoadmit.com/thread.php?thread_id=5735763&forum_id=2\u0026mark_id=5310486#49002984)



Reply Favorite

Date: June 10th, 2025 5:12 PM
Author: Salmon insecure corner sweet tailpipe

We need a new Biglaw Bench score for this and the new Claude and Gemini stuff

(http://www.autoadmit.com/thread.php?thread_id=5735763&forum_id=2\u0026mark_id=5310486#49002997)



Reply Favorite

Date: June 10th, 2025 5:28 PM
Author: Lime locus pozpig

how expensive is it compared to gemini?

I require large tokens for high context precise prompts, will prob try chunking my token usage soon.

(http://www.autoadmit.com/thread.php?thread_id=5735763&forum_id=2\u0026mark_id=5310486#49003064)



Reply Favorite

Date: June 10th, 2025 5:30 PM
Author: Salmon insecure corner sweet tailpipe

For law?

(http://www.autoadmit.com/thread.php?thread_id=5735763&forum_id=2\u0026mark_id=5310486#49003073)



Reply Favorite

Date: June 10th, 2025 5:39 PM
Author: Ebony ungodly meetinghouse associate

Who gives a shit about AIME? They should be reporting livecodebench, aider, swe bench, etc.

(http://www.autoadmit.com/thread.php?thread_id=5735763&forum_id=2\u0026mark_id=5310486#49003113)



Reply Favorite

Date: June 10th, 2025 6:56 PM
Author: Salmon insecure corner sweet tailpipe

And Biglaw Bench. I want to see someone dethrone Gemini

(http://www.autoadmit.com/thread.php?thread_id=5735763&forum_id=2\u0026mark_id=5310486#49003297)



Reply Favorite

Date: June 10th, 2025 7:04 PM
Author: Ebony ungodly meetinghouse associate

The latest 2.5 pro model seems noticeably better than the late March release, so it likely still leads on that. Model improvements are now happening every couple months, lmao.

(http://www.autoadmit.com/thread.php?thread_id=5735763&forum_id=2\u0026mark_id=5310486#49003312)



Reply Favorite

Date: June 10th, 2025 8:37 PM
Author: Salmon insecure corner sweet tailpipe

It’s getting insane. Half the US will be unemployed by January

(http://www.autoadmit.com/thread.php?thread_id=5735763&forum_id=2\u0026mark_id=5310486#49003535)