o1, OpenAI’s latest generative AI model, has arrived.
The companyannounced o1-preview and o1-mini on Thursday, marking a departure from the GPT naming scheme.
According the company, o1 performs “similarly to PhD students” in biology, chemistry, and physics.
Where GPT-4o solved 13% of the problems on the International Mathematics Olympiad, o1 reportedly solved 83%.
The company also emphasized how the models are more effective for coding and programming.
That “thinking” means o1 takes longer to respond than previous models.
As OpenAI research lead Jerry Tworek tells The Verge, o1 is trained through reinforcement learning.
Rather than looking for patterns from a training set, o1 learns through “rewards and penalties.”
Does o1-preview think a hot dog is a sandwich?
That makes it difficult to properly test OpenAI’s latest models for their proposed strengths and use cases.
(Its answer, by the way, amounted to three paragraphs of “it depends.")
The first, “Analyzing the question,” reads: “OK, let me see.
This shows the room for debate.”
I guess that’s all the thinking it needed to answer the question.
What about a taco?
Is that a sandwich?
I also asked o1 to weigh in on another controversial matter involving food: Is a taco a sandwich?
The model has a lot to say.After thinking for five whole seconds, the AI returned a 364-word response.
This helps in understanding whether it fits the definition of a sandwich.
(Here’s the context, if you’re interested.)
But is a taco a hot dog?
As a followup, I asked o1 if it would classify a taco as a hot dog.
There you have it, internet.
it’s possible for you to stop arguing this one.
o1 can handle more complex, non-sandwich related tasks too
Let’s try another.
It delivered just such a puzzle, with instructions on how to solve it.
Clicking on the drop-down menu, it took 36 individual thought processes as it worked through the prompt.
We need to design the grid, derive clues, and present the puzzle for solving.”
It’s definitely interesting to scroll through each step o1-preview takes.
(Is that really what we want from AI?)
That means having a ChatGPT Plus or ChatGPT Team subscription.
If you’re a ChatGPT Enterprise or ChatGPT Ed user, the models should appear next week.
ChatGPT free users will get o1-mini at some point in the future.