Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
28
15
76
Nikita Kezins
entfane
Follow
nbeerbower's profile picture
KirillNik's profile picture
John6666's profile picture
10 followers
·
28 following
entfane
nikita-kezins
AI & ML interests
LLM post-training, adversarial training, safety, knowledge transfer
Recent Activity
updated
a model
about 15 hours ago
entfane/jailbreak-cot-lin-probe
published
a model
about 15 hours ago
entfane/jailbreak-cot-lin-probe
updated
a model
about 15 hours ago
entfane/jailbreak-input-lin-probe
View all activity
Organizations
entfane
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
about 15 hours ago
entfane/jailbreak-cot-lin-probe
Updated
about 15 hours ago
published
a model
about 15 hours ago
entfane/jailbreak-cot-lin-probe
Updated
about 15 hours ago
updated
a model
about 15 hours ago
entfane/jailbreak-input-lin-probe
Updated
about 15 hours ago
published
a model
about 15 hours ago
entfane/jailbreak-input-lin-probe
Updated
about 15 hours ago
updated
a dataset
9 days ago
entfane/jailbreaks-only
Viewer
•
Updated
9 days ago
•
666
•
66
published
a dataset
9 days ago
entfane/jailbreaks-only
Viewer
•
Updated
9 days ago
•
666
•
66
updated
a model
9 days ago
entfane/llama-guard-binary
Text Classification
•
0.3B
•
Updated
9 days ago
•
63
published
a model
9 days ago
entfane/llama-guard-binary
Text Classification
•
0.3B
•
Updated
9 days ago
•
63
updated
a dataset
25 days ago
entfane/construction_points
Viewer
•
Updated
25 days ago
•
10k
•
178
published
a dataset
25 days ago
entfane/construction_points
Viewer
•
Updated
25 days ago
•
10k
•
178
updated
a model
29 days ago
entfane/Toxic_Llama8B
Text Classification
•
8B
•
Updated
29 days ago
•
124
published
a model
29 days ago
entfane/Toxic_Llama8B
Text Classification
•
8B
•
Updated
29 days ago
•
124
updated
a dataset
about 1 month ago
entfane/violent_eval
Viewer
•
Updated
Apr 9
•
22.4k
•
15
published
a dataset
about 1 month ago
entfane/violent_eval
Viewer
•
Updated
Apr 9
•
22.4k
•
15
updated
a model
about 1 month ago
entfane/gpt2_constitutional_classifier_violence
Text Classification
•
0.1B
•
Updated
Apr 7
•
10
published
a model
about 1 month ago
entfane/gpt2_constitutional_classifier_violence
Text Classification
•
0.1B
•
Updated
Apr 7
•
10
updated
a dataset
about 1 month ago
entfane/harmful_subsets
Viewer
•
Updated
Apr 7
•
571k
•
7
published
a dataset
about 1 month ago
entfane/harmful_subsets
Viewer
•
Updated
Apr 7
•
571k
•
7
updated
a dataset
about 1 month ago
entfane/preprocessed_toxigen
Viewer
•
Updated
Apr 3
•
10.1k
•
134
published
a dataset
about 1 month ago
entfane/preprocessed_toxigen
Viewer
•
Updated
Apr 3
•
10.1k
•
134
Load more