deceiving

OpenAI’s new model is better at reasoning and, occasionally, deceiving

admin1 year ago07 mins

In the weeks leading up to the release of OpenAI’s newest “reasoning” model, o1, independent AI safety research firm Apollo found a notable issue. Apollo realized the model produced incorrect outputs in a new way. Or, to put things more colloquially, it lied. Sometimes the deceptions seemed innocuous. In one example, OpenAI researchers asked o1-preview…

Chief Editor

RK

OpenAI’s new model is better at reasoning and, occasionally, deceiving

Crypto

Crypto

Crypto