Microsoft says its automated ai red teaming tool finds malicious content in a matter of hours

From ChatGPT to Gemini: how AI is rewriting the internet

See all Stories

Posted Feb 23, 2024 at 5:37 PM UTC

Microsoft says its automated AI red teaming tool finds malicious content “in a matter of hours.”

PyRIT, or Python Risk Identification Toolkit, can point human evaluators to “hot spot” categories in AI that might generate harmful prompt results.

Microsoft used PyRIT while redteaming (the process of intentionally trying to get AI systems to go against safety protocols) its Copilot services to write thousands of malicious prompts and score the response based on potential harm in categories that security teams can now focus on.

Follow topics and authors from this story to see more like this in your personalized homepage feed and to receive email updates.

Emilia David

Loading comments

Getting the conversation ready...

Most Popular

How Project Maven taught the military to love AI

AirPods, Touch Bars, and the rest of Tim Cook’s legacy

Elon Musk and Sam Altman’s courtroom brawl could burn it all down

China’s DeepSeek previews new AI model a year after jolting US rivals

Prestigious photo contest answers ‘what is a photo?’

Claude is connecting directly to your personal apps like Spotify, Uber Eats, and TurboTax

How Project Maven taught the military to love AI

Joshua DziezaApr 24

AirPods, Touch Bars, and the rest of Tim Cook’s legacy

David PierceApr 24

Elon Musk and Sam Altman’s courtroom brawl could burn it all down

Elizabeth Lopatto and Hayden FieldApr 24

China’s DeepSeek previews new AI model a year after jolting US rivals

Robert HartApr 24

Prestigious photo contest answers ‘what is a photo?’

Jess WeatherbedApr 24

Claude is connecting directly to your personal apps like Spotify, Uber Eats, and TurboTax

Stevie BonifieldApr 23

Apr 24

Elon Musk and Sam Altman’s courtroom brawl could burn it all down

Apr 24

I don’t think Gwyneth Paltrow knows what a peptide is

Apr 24

Alex Jones has uncovered another massive conspiracy

Apr 24

How Project Maven taught the military to love AI

Apr 24

Tesla’s Cybercab goes into production — so why is Musk tapping the brakes?

Apr 24

Microsoft will let you pause Windows Updates indefinitely, 35 days at a time

Most Popular

More in AI

Top Stories

Most Popular

The Verge Daily

More in AI

Top Stories