AprielGuard: A Guardrail for Safety and Adversarial Robustness in Modern LLM Systems
AprielGuard is an 8B safety model designed to detect adversarial attacks and content risks in agentic LLM systems. The model identifies prompt injection, jailbreaks, memory poisoning, and tool manipulation threats. AprielGuard works on tool calls and reasoning traces, offering both explainable and low-latency modes for production deployment.