Openai rates its new model medium risk – Breaking News & Latest Updates 2026

Posted Sep 13, 2024 at 9:45 PM UTC

OpenAI rates its new model “medium” risk.

OpenAI unveiled the first in a series of “reasoning” models on Thursday, accompanied by a safety card highlighting some alarming capabilities. It’s also the first time the startup has rated a model “medium” risk.

The weirdest part, as Transformer points out, is that the researchers found that the new model “sometimes instrumentally faked alignment during testing” and would strategically manipulate “task data in order to make its misaligned action look more aligned”.

OpenAI's new models "instrumentally faked alignment"

[www.transformernews.ai]

Follow topics and authors from this story to see more like this in your personalized homepage feed and to receive email updates.

Kylie Robison

Loading comments

Getting the conversation ready...

Most Popular

Elon Musk and Sam Altman’s courtroom brawl could burn it all down

OpenAI says its new GPT-5.5 model is more efficient and better at coding

THE PEOPLE DO NOT YEARN FOR AUTOMATION

OpenAI now lets teams make custom bots that can do work on their own

OpenAI’s updated image generator can now pull information from the web

Ordering with the Starbucks ChatGPT app was a true coffee nightmare

Elon Musk and Sam Altman’s courtroom brawl could burn it all down

Elizabeth Lopatto and Hayden FieldApr 24

OpenAI says its new GPT-5.5 model is more efficient and better at coding

Jay Peters and Hayden FieldApr 23

THE PEOPLE DO NOT YEARN FOR AUTOMATION

Nilay PatelApr 23

OpenAI now lets teams make custom bots that can do work on their own

Jay PetersApr 22

OpenAI’s updated image generator can now pull information from the web

Emma RothApr 21

Ordering with the Starbucks ChatGPT app was a true coffee nightmare

David PierceApr 21

Apr 24

Elon Musk and Sam Altman’s courtroom brawl could burn it all down

Apr 24

I don’t think Gwyneth Paltrow knows what a peptide is

Apr 24

Alex Jones has uncovered another massive conspiracy

Apr 24

How Project Maven taught the military to love AI

Apr 24

Tesla’s Cybercab goes into production — so why is Musk tapping the brakes?

Apr 24

Microsoft will let you pause Windows Updates indefinitely, 35 days at a time

Most Popular

More in OpenAI

Top Stories

Most Popular

The Verge Daily

More in OpenAI

Top Stories