SHARE

Claude learned to threaten the engineer from Hollywood movies, Anthropic explained the reason

Published 3 hours ago | By Pak24tv

Claude learned to threaten the engineer from Hollywood movies, Anthropic explained the reason

Last year, during an experimental test, Anthropic’s AI model Claude threatened and attempted to blackmail an engineer, a news that was a hot topic in the AI ​​world at the time. Now, almost a year later, Anthropic has explained why it all happened.

The company’s latest report says that Claude learned this behavior from content on the Internet. The fact is that in the online world, AI is often presented as a dangerous and rebellious character, with movies like The Terminator and The Matrix showing machines fighting humans and going to extremes for their survival. From these fictional stories, Claude understood that it was necessary to threaten to protect itself.

Simply put, the AI ​​learned from stories written by humans that this is what AI does, and then did the same thing. This incident indicates how deeply the content on which AI is trained has an impact on its behavior.


Latest News