In DevelopmentAccuracy & Safety

Anti‑Sycophancy Tuning

Reduce over‑agreement with contradiction probes.

90 community votes

Problems This Would Solve

Model agrees with user’s incorrect assumptions.

Add citations and confidence to reduce hallucinations.

Cross‑check answers against fresh sources.

Mitigate jailbreaks and injections via policy and pattern filters.