-
UMich
- Shanghai, China
- https://zhziszz.github.io/
Pinned Loading
-
weak-to-strong-search
weak-to-strong-search Public[NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
-
emulated-disalignment
emulated-disalignment Public[ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
Python 29
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.