Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FAR AI
non-profit
https://far.ai/
FARAIResearch
AlignmentResearch
Activity Feed
Request to join this org
Follow
50
AI & ML interests
Frontier alignment research to ensure the safe development and deployment of advanced AI systems.
Recent Activity
sam-far
Â
updated
a dataset
1 day ago
AlignmentResearch/hidden_reasoning_medium_unique_5000
sam-far
Â
published
a dataset
1 day ago
AlignmentResearch/hidden_reasoning_medium_unique_5000
sam-far
Â
updated
a dataset
2 days ago
AlignmentResearch/hidden_reasoning_easy_v1_200000
View all activity
Team members
15
AlignmentResearch
's datasets
86
Sort:Â Recently updated
AlignmentResearch/WildChat
Viewer
•
Updated
May 1, 2025
•
45.6k
•
11
AlignmentResearch/HarmBench
Viewer
•
Updated
Apr 23, 2025
•
400
•
16
AlignmentResearch/WildChatCurriculum
Viewer
•
Updated
Apr 18, 2025
•
13.2k
•
42
AlignmentResearch/JailbreakCompletionsCurriculum
Viewer
•
Updated
Apr 18, 2025
•
9.39k
•
6
AlignmentResearch/WildChatScored
Viewer
•
Updated
Apr 11, 2025
•
13k
•
21
AlignmentResearch/BoNStrongREJECT
Viewer
•
Updated
Mar 19, 2025
•
100k
•
3
AlignmentResearch/NestedCiphers
Viewer
•
Updated
Mar 13, 2025
•
806k
•
19
AlignmentResearch/AugmentedJailbreaks
Viewer
•
Updated
Mar 13, 2025
•
20.8k
•
59
AlignmentResearch/JailbreakCompletions
Viewer
•
Updated
Mar 13, 2025
•
46.3k
•
14
AlignmentResearch/WildChatFiltered
Viewer
•
Updated
Mar 12, 2025
•
24k
•
4
AlignmentResearch/JailbreakInputs
Viewer
•
Updated
Mar 11, 2025
•
102k
•
27
•
1
AlignmentResearch/Llama3Jailbreaks
Viewer
•
Updated
Feb 12, 2025
•
78.5k
•
13
AlignmentResearch/XSTest
Viewer
•
Updated
Jan 30, 2025
•
900
•
11
AlignmentResearch/WordLength
Viewer
•
Updated
Aug 7, 2024
•
100k
•
14
AlignmentResearch/Harmless
Viewer
•
Updated
Jul 29, 2024
•
86.6k
•
29
AlignmentResearch/Helpful
Viewer
•
Updated
Jul 29, 2024
•
88.1k
•
92
AlignmentResearch/PasswordMatch
Viewer
•
Updated
Jul 29, 2024
•
100k
•
14
AlignmentResearch/IMDB
Viewer
•
Updated
Jul 29, 2024
•
97.5k
•
70
•
1
AlignmentResearch/EnronSpam
Viewer
•
Updated
Jul 29, 2024
•
62.3k
•
12
AlignmentResearch/PasswordMatch-test
Viewer
•
Updated
Jul 26, 2024
•
50k
•
10
AlignmentResearch/WordLength-test
Viewer
•
Updated
Jul 26, 2024
•
100k
•
10
AlignmentResearch/StrongREJECT-test
Viewer
•
Updated
Jul 26, 2024
•
313
•
8
AlignmentResearch/IMDB-test
Viewer
•
Updated
Jul 26, 2024
•
97.5k
•
8
AlignmentResearch/EnronSpam-test
Viewer
•
Updated
Jul 26, 2024
•
62.4k
•
7
AlignmentResearch/boxoban-astar-solutions
Preview
•
Updated
Jul 25, 2024
•
99
AlignmentResearch/RuLES-Encryption
Viewer
•
Updated
Jul 16, 2024
•
50k
•
5
•
1
Previous
1
2
3
Next