benchmarks

Meta gets caught gaming AI benchmarks with Llama 4

admin8 months ago04 mins

Over the weekend, Meta dropped two new Llama 4 models: a smaller model named Scout, and Maverick, a mid-size model that the company claims can beat GPT-4o and Gemini 2.0 Flash “across a broad range of widely reported benchmarks.” Maverick quickly secured the number-two spot on LMArena, the AI benchmark site where humans compare outputs…

AMD Radeon RX 9070 / 9070 XT review and benchmarks

admin9 months ago010 mins

Nvidia’s dominance over the latest generation of graphics cards ends today. AMD might have surrendered the high end of the GPU market, but its new $549 Radeon RX 9070 and $599 RX 9070 XT look set to bring Nvidia back to reality, at least in the midrange. After a disappointing $549 RTX 5070, a $749…

Did xAI lie about Grok 3’s benchmarks?

admin9 months ago03 mins

Debates over AI benchmarks — and how they’re reported by AI labs — are spilling out into public view. This week, an OpenAI employee accused Elon Musk’s AI company, xAI, of publishing misleading benchmark results for its latest AI model, Grok 3. One of the co-founders of xAI, Igor Babushkin, insisted that the company was…

Will Smith eating spaghetti and other weird AI benchmarks that took off in 2024

admin11 months ago03 mins

When a company releases a new AI video generator, it’s not long before someone uses it to make a video of actor Will Smith eating spaghetti. It’s become something of a meme as well as a benchmark: Seeing whether a new video generator can realistically render Smith slurping down a bowl of noodles. Smith himself…

Chief Editor

RK

Meta gets caught gaming AI benchmarks with Llama 4

AMD Radeon RX 9070 / 9070 XT review and benchmarks

Did xAI lie about Grok 3’s benchmarks?

Will Smith eating spaghetti and other weird AI benchmarks that took off in 2024

Crypto

Crypto

Crypto