Home / news

 

'Constitutional Classifiers' Technique Mitigates GenAI Jailbreaks

from DarkReading 03 February indexed on 04 February 2025 4:01

Anthropic says its Constitutional Classifiers approach offers a practical way to make it harder for bad actors to try and coerce an AI model off its guardrails.

Read more.

 

TOP