Data center

Microsoft’s AI security team reveals how hidden training backdoors quietly survive inside enterprise language models | Daily Reports Online

Share


  • Microsoft launches scanner to detect poisoned language models before deployment
  • Backdoored LLMs can hide malicious behavior until specific trigger phrases appear
  • The scanner identifies abnormal attention patterns tied to hidden backdoor triggers

Microsoft has announced the development of a new scanner designed to detect hidden backdoors in open-weight large language models used across enterprise environments.


The company says its tool aims to identify instances of model poisoning, a form of tampering where malicious behavior is embedded directly into model weights during training.

See also  Got a Nespresso machine? These 5 tricks and tips will help you make the most of it, and enjoy better-tasting coffee every morning | Daily Reports Online



Similar Posts