Sunday, June 29, 2025
Google search engine
HomeTechnologyAnthropic says Opus 4 will use an e-mail instrument to "whistleblow" if...

Anthropic says Opus 4 will use an e-mail instrument to "whistleblow" if it detects customers doing one thing "egregiously evil"like advertising and marketing a drug based mostly on faked information (Sam Bowman/@sleepinyourhat)



Sam Bowman / @sleepinyourhat:

Anthropic says Opus 4 will use an e-mail instrument to “whistleblow” if it detects customers doing one thing “egregiously evil”, like advertising and marketing a drug based mostly on faked information  —  With this sort of (uncommon however not tremendous unique) prompting type, and limitless entry to instruments, if the mannequin sees you doing one thing *egregiously evil* like advertising and marketing a drug based mostly on faked information, it will attempt to use an e-mail instrument to whistleblow.





Supply hyperlink

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

- Advertisment -
Google search engine

Most Popular

Recent Comments