Sleeper Agent Models Archives

LLM04: Data and Model Poisoning – A C-Suite Imperative for AI Risk Mitigation

16 June 2025 by Krishna

At its core, data poisoning involves the deliberate manipulation of datasets used during the pre-training, fine-tuning, or embedding stages of an LLM’s lifecycle. The objective is often to introduce backdoors, degrade model performance, or inject bias—toxic, unethical, or otherwise damaging behaviour—into outputs.