Nikita Kezins's picture

Nikita Kezins

entfane

·

AI & ML interests

LLM post-training, adversarial training, safety, knowledge transfer

Recent Activity

updated a model about 18 hours ago

entfane/jailbreak-cot-lin-probe

published a model about 18 hours ago

entfane/jailbreak-cot-lin-probe

updated a model about 18 hours ago

entfane/jailbreak-input-lin-probe

View all activity

Organizations

New activity in huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated 3 months ago

Как создавать изображения ?

#9 opened 3 months ago by

New activity in mistralai/Voxtral-Mini-4B-Realtime-2602 3 months ago

How to add another language ?

#22 opened 3 months ago by

TheRealTancrede

New activity in lmstudio-community/DeepSeek-R1-Distill-Qwen-7B-GGUF 6 months ago

🚩 Report: Ethical issue(s)

#4 opened about 1 year ago by

New activity in openai/gpt-oss-20b 6 months ago

so much censorship

#48 opened 9 months ago by

New activity in moonshotai/Kimi-K2-Thinking 6 months ago

Token Count Calculation in SFT Data Distribution Curation

#31 opened 6 months ago by

New activity in Qwen/Qwen2.5-3B 6 months ago

Is it actually a base model?

#6 opened 6 months ago by

New activity in openai/gpt-oss-20b 9 months ago

CUDA out of memory issues when running gptoss model on colab T4

#99 opened 9 months ago by

Not able to deploy gpt-oss-20b model in A100s

#124 opened 9 months ago by

Unable to load gpt-oss-20b on dual L40 (48GB) GPUs with vLLM

#136 opened 9 months ago by

New activity in ethicalabs/computer-says-no 9 months ago

Diversity of responses

#2 opened 9 months ago by

New activity in yasserrmd/gpt-oss-coder-20b 9 months ago

Reasoning effort during training

#1 opened 9 months ago by

New activity in openai/gpt-oss-20b 9 months ago

NVIDIA L40S GPU's for MXFP4 quantization

#100 opened 9 months ago by

question: setting reasoning effort

#66 opened 9 months ago by

New activity in QuixiAI/dolphin-r1 9 months ago

creation process?

#7 opened about 1 year ago by

New activity in openai/gpt-oss-20b 9 months ago

Thinking but no solution?

#54 opened 9 months ago by

OOM on 3090

#60 opened 9 months ago by

New activity in suriya7/t5-base-text-to-sql 10 months ago

french to sql model

#2 opened 10 months ago by

New activity in Qwen/Qwen3-Reranker-0.6B 10 months ago

reranker0.6b and embedding0.6b are the same model weights？

#6 opened 11 months ago by

New activity in ScienceOne-AI/S1-Base-8B 10 months ago

Benchmarks

#1 opened 10 months ago by

New activity in HuggingFaceTB/SmolLM2-135M-Instruct 10 months ago

Release of SFT tuned model

#8 opened over 1 year ago by