Today's paper introduces WILDGUARD, a new open-source tool for moderating the safety of large language model (LLM) interactions.
WILDGUARD: Open One-stop Moderation Tools for…
Today's paper introduces WILDGUARD, a new open-source tool for moderating the safety of large language model (LLM) interactions.