TTT models might be the next frontier in generative AI

After years of dominance by the form of AI known as the transformer, the hunt is on for new architectures. Transformers underpin OpenAI’s video-generating model Sora, and they’re at the heart of text-generating models like Anthropic’s Claude, Google’s Gemini and GPT-4o. But they’re beginning to run up against technical roadblocks — in particular, computation-related roadblocks. Transformers aren’t especially…

Read More

Large language models can’t effectively recognize users’ motivation, but can support behavior change for those ready to act

Large language model-based chatbots have the potential to promote healthy changes in behavior. But researchers from the ACTION Lab at the University of Illinois Urbana-Champaign have found that the artificial intelligence tools don’t effectively recognize certain motivational states of users and therefore don’t provide them with appropriate information. Michelle Bak, a doctoral student in information…

Read More

Study finds that AI models hold opposing views on controversial topics

Not all generative AI models are created equal, particularly when it comes to how they treat polarizing subject matter. In a recent study presented at the 2024 ACM Fairness, Accountability and Transparency (FAccT) conference, researchers at Carnegie Mellon, the University of Amsterdam and AI startup Hugging Face tested several open text-analyzing models, including Meta’s Llama…

Read More