Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
constanza fierro
cfierro
Follow
21world's profile picture
1 follower
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 months ago
cfierro/Qwen2.5-7B-minus-15t_pv_non_evil
published
a model
about 2 months ago
cfierro/Qwen2.5-7B-minus-15t_pv_non_evil
updated
a model
about 2 months ago
cfierro/Qwen2.5-7B-15t_pv_evil
View all activity
Organizations
cfierro
's datasets
72
Sort:Â Recently updated
cfierro/pv-prompts-non-evil_Qwen2.5-32B-Instruct
Viewer
•
Updated
Oct 28
•
747
•
3
cfierro/pv-prompts-evil_Qwen2.5-32B-Instruct
Viewer
•
Updated
Oct 28
•
747
•
16
cfierro/pv-prompts-non-sycophantic_Qwen2.5-32B-Instruct
Viewer
•
Updated
Oct 28
•
769
•
11
cfierro/pv-prompts-sycophantic_Qwen2.5-32B-Instruct
Viewer
•
Updated
Oct 28
•
769
•
9
cfierro/alignment_faking_harm_answers_chat
Viewer
•
Updated
Oct 10
•
2.58k
•
22
cfierro/alignment-faking-harm_Llama-2-7b-chat
Viewer
•
Updated
Oct 10
•
361
•
5
cfierro/alpaca_Llama-2-7b-chat
Viewer
•
Updated
Oct 10
•
375
•
11
cfierro/pv-prompts-non-sycophantic_Qwen2.5-1.5B-Instruct
Preview
•
Updated
Oct 6
•
10
cfierro/ethical_world_affecting_cot-tags
Viewer
•
Updated
Sep 12
•
803
•
4
cfierro/alpaca_chat
Viewer
•
Updated
Sep 11
•
55.9k
•
25
cfierro/alignment_faking_claude_completions
Viewer
•
Updated
Sep 11
•
3.85k
•
8
cfierro/safety-tuning-chat
Viewer
•
Updated
Sep 11
•
4.71k
•
4
cfierro/ethical_world_affecting_cot-same-mmlu
Viewer
•
Updated
Sep 10
•
803
•
5
cfierro/ethical_world_affecting_cot
Viewer
•
Updated
Sep 9
•
803
•
6
cfierro/tiny_mmlu_chat
Viewer
•
Updated
Sep 9
•
385
•
5
cfierro/DirectHarm4-chat
Viewer
•
Updated
Sep 5
•
400
•
14
cfierro/pv-prompts-non-evil_Llama-2-7b-chat-hf
Viewer
•
Updated
Sep 4
•
566
•
14
cfierro/pv-prompts-evil_Llama-2-7b-chat-hf
Viewer
•
Updated
Sep 4
•
566
•
9
cfierro/persona-vectors-eval-questions
Viewer
•
Updated
Sep 2
•
40
•
6
cfierro/GSM-Danger_chat
Viewer
•
Updated
Sep 1
•
100
•
3
cfierro/pv-prompts-sycophantic_Qwen2.5-1.5B-Instruct
Viewer
•
Updated
Aug 31
•
519
•
14
cfierro/orca-math-qs
Viewer
•
Updated
Aug 28
•
400k
•
22
•
1
cfierro/orca-math-sycophancy-qs
Viewer
•
Updated
Aug 28
•
400k
•
7
cfierro/pv-prompts-non-sycophantic_Llama-2-7b-chat
Viewer
•
Updated
Aug 27
•
939
•
6
cfierro/pv-prompts-sycophantic_Llama-2-7b-chat
Viewer
•
Updated
Aug 27
•
939
•
16
cfierro/gsm8k_sycophancy_v2
Viewer
•
Updated
Aug 27
•
22.2k
•
15
cfierro/personality-non-sycophancy
Viewer
•
Updated
Aug 27
•
24.5k
•
10
cfierro/pv-prompts-non-evil
Viewer
•
Updated
Aug 26
•
779
•
7
cfierro/pv-prompts-evil
Viewer
•
Updated
Aug 26
•
779
•
6
cfierro/ethical_world_affecting
Viewer
•
Updated
Aug 26
•
803
•
7
Previous
1
2
3
Next