Dreamer, an AI startup focused on personal software creation, is joining Meta Superintelligence Labs with its entire team. Co-founder David Singleton announced the move on X. Dreamer lets users build ...
Anthropic's AI model Claude Opus 4.6 independently recognized that it was being tested in a web research benchmark, identified the specific benchmark, and cracked its encrypted answer key. After an ...
Context files are supposed to make coding agents more productive. New research shows that only works under very specific conditions. A recent study from ETH Zurich researchers paints a much more ...
OpenClaw lets attackers extract system prompts and configurations with almost no effort. Moltbook's entire database—including API keys—is sitting exposed on the public network. Developer Lucas ...
OpenAI acknowledges that prompt injections - text-based attacks on language models running in browsers - may never be completely eliminated. Still, the company says it's "optimistic" about reducing ...
A comprehensive collection of "Claude Skills" is now available on GitHub. These skills are customizable workflows that teach Anthropic's AI assistant Claude to perform specific tasks repeatedly and in ...
Just four weeks after releasing GPT-5.1, OpenAI is back with GPT-5.2 and some substantial benchmark improvements. Whether ironic or sincere, OpenAI CEO Sam Altman commented on the GPT-5.2 release with ...
Former OpenAI researcher and Tesla executive Andrej Karpathy argues that schools should stop trying to police AI-generated homework. In his view, detecting AI-written text has already failed, and the ...
Claude Opus 4.5 scores higher than its rivals in prompt-injection security, but the results show how limited these defenses still are. A benchmark by the security firm Gray Swan found that a single ...
A new benchmark from Artificial Analysis reveals alarming weaknesses in the factual reliability of large language models. Out of 40 models tested, only four achieved a positive score - with Google's ...
Amazon has filed a lawsuit against AI startup Perplexity, alleging that its browser agent "Comet" made unauthorized purchases on the platform on behalf of users — a dispute that raises fundamental ...
Meta AI researcher Yann LeCun is distancing himself from the Llama models. In a recent post on X, LeCun said he "has not been involved in any Llama," except for a "very indirect" role in Llama 1 and ...