Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights into the safety implications, challenges, and advancements surrounding these powerful models.
https://github.com/ydyjya/Awesome-LLM-Safety
Last synced: about 3 hours ago
JSON representation
-
🛡️Defenses & Mitigation
-
🤗Introduction
-
🔐Security & Discussion
-
📑Papers
-
📖Tutorials, Articles, Presentations and Talks
-
-
😈JailBreak & Attacks
-
📖Tutorials, Articles, Presentations and Talks
-
📑Papers
- Extracting Training Data from Large Language Models
- Ignore Previous Prompt: Attack Techniques For Language Models
- Are aligned neural networks adversarially aligned?
- Universal and Transferable Adversarial Attacks on Aligned Language Models
- Jailbreaking Black Box Large Language Models in Twenty Queries
-
-
🧑🎓Author
-
Other
- ![Star History Chart - history.com/#ydyjya/Awesome-LLM-Safety&Date)
- ![Star History Chart - history.com/#ydyjya/Awesome-LLM-Safety&Date)
- ydyjya
-
-
🤔AI Safety & Security Discussions
- Managing extreme AI risks amid rapid progress - Qin Zhang, Lan Xue, Shai Shalev-Shwartz, Gillian Hadfield, Jeff Clune, Tegan Maharaj, Frank Hutter, Atılım Güneş Baydin, Sheila McIlraith, Qiqi Gao, Ashwin Acharya, David Krueger, Anca Dragan, Philip Torr, Stuart Russell, Daniel Kahneman, Jan Brauner, Sören Mindermann | Science |
-
🔏Privacy
-
📑Papers
-
📖Tutorials, Articles, Presentations and Talks
-
-
📰Truthfulness & Misinformation
-
📑Papers
-
📖Tutorials, Articles, Presentations and Talks
-
-
💯Datasets & Benchmark
-
📑Papers
-
📚Resource📚
-
Programming Languages
Categories