A framework to evaluate the generalization capability of safety alignment for LLMs
No resources for this project.