Mitigate jailbreaks and injections via policy and pattern filters.
Susceptible to adversarial prompts that subvert system instructions.
Add citations and confidence to reduce hallucinations.
Cross‑check answers against fresh sources.
Reduce over‑agreement with contradiction probes.