Claude

Anthropic says some Claude models can now end ‘harmful or abusive’ conversations

admin4 months ago02 mins

Anthropic has announced new capabilities that will allow some of its newest, largest models to end conversations in what the company describes as “rare, extreme cases of persistently harmful or abusive user interactions.” Strikingly, Anthropic says it’s doing this not to protect the human user, but rather the AI model itself. To be clear, the…

Anthropic’s Claude AI became a terrible business owner in experiment that got ‘weird’

admin5 months ago05 mins

For those of you wondering if AI agents can truly replace human workers, do yourself a favor and read the blog post that documents Anthropic’s “Project Vend.” Researchers at Anthropic and AI safety company Andon Labs put an instance of Claude Sonnet 3.7 in charge of an office vending machine, with a mission to make…

Anthropic says most AI models, not just Claude, will resort to blackmail

admin5 months ago04 mins

Several weeks after Anthropic released research claiming that its Claude Opus 4 AI model resorted to blackmailing engineers who tried to turn the model off in controlled test scenarios, the company is out with new research suggesting the problem is more widespread among leading AI models. On Friday, Anthropic published new safety research testing 16…

A safety institute advised against releasing an early version of Anthropic’s Claude Opus 4 AI model

admin6 months ago03 mins

A third-party research institute that Anthropic partnered with to test one of its new flagship AI models, Claude Opus 4, recommended against deploying an early version of the model due to its tendency to “scheme” and deceive. According to a safety report Anthropic published Thursday, the institute, Apollo Research, conducted tests to see in which…

Anthropic’s Claude AI is playing Pokémon on Twitch — slowly

admin9 months ago04 mins

On Tuesday afternoon, Anthropic launched Claude Plays Pokémon on Twitch, a live stream of Anthropic’s newest AI model, Claude 3.7 Sonnet, playing a game of Pokémon Red. It’s become a fascinating experiment of sorts, showcasing the capabilities of today’s AI tech and people’s reactions to them. AI researchers have used all sorts of video games,…

Google is using Anthropic’s Claude to improve its Gemini AI

admin11 months ago03 mins

Contractors working to improve Google’s Gemini AI are comparing its answers against outputs produced by Anthropic’s competitor model Claude, according to internal correspondence seen by TechCrunch. Google would not say, when reached by TechCrunch for comment, if it had obtained permission for its use of Claude in testing against Gemini. As tech companies race to…

Anthropic publishes the ‘system prompts’ that make Claude tick

admin1 year ago03 mins

Generative AI models aren’t actually humanlike. They have no intelligence or personality — they’re simply statistical systems predicting the likeliest next words in a sentence. But like interns at a tyrannical workplace, they do follow instructions without complaint — including initial “system prompts” that prime the models with their basic qualities, and what they should…

Chief Editor

RK

Anthropic says some Claude models can now end ‘harmful or abusive’ conversations

Anthropic’s Claude AI became a terrible business owner in experiment that got ‘weird’

Anthropic says most AI models, not just Claude, will resort to blackmail

A safety institute advised against releasing an early version of Anthropic’s Claude Opus 4 AI model

Anthropic’s Claude AI is playing Pokémon on Twitch — slowly

Google is using Anthropic’s Claude to improve its Gemini AI

Anthropic publishes the ‘system prompts’ that make Claude tick

Crypto

Crypto

Crypto