A framework to evaluate the generalization capability of safety alignment for LLMs
No reviews for this project.