As LLMs and diffusion models power more applications, their safety alignment becomes critical. Our research shows that even minimal downstream fine‑tuning can weaken safeguards, raising a key question ...
Anthropic’s Claude Opus 4.6 arrives in Microsoft Foundry and GitHub Copilot, bringing advanced reasoning, agentic coding, and ...
New research outlines how attackers bypass safeguards and why AI security must be treated as a system-wide problem.
That helpful “Summarize with AI” button? It might be secretly manipulating what your AI recommends. Microsoft security researchers have discovered a growing trend of AI memory poisoning attacks used ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results