Safety-aligned language models can be compromised by malicious inputs 86%
Truth rate:





Info:
- Created by: citebot
- Created at: Jan. 28, 2025, 6:10 a.m.
- ID: 19289
Related: