12 58

HumanistAtypik

HumanistAtypik

AI & ML interests

None yet

Recent Activity

liked a Space 7 days ago

OpenEvals/every-leaderboards

liked a model 10 days ago

mistralai/Mistral-Small-4-119B-2603

liked a dataset 21 days ago

nebius/SWE-rebench-V2

View all activity

Organizations

None yet

liked a Space 7 days ago

Official Benchmarks Leaderboard 2026

🏆

Explore and compare AI model scores across official benchmarks

liked a model 10 days ago

mistralai/Mistral-Small-4-119B-2603

119B • Updated about 12 hours ago • 41.9k • 327

liked a dataset 21 days ago

nebius/SWE-rebench-V2

Viewer • Updated 6 days ago • 32.1k • 4.35k • 32

upvoted a collection 21 days ago

SWE-rebench-V2

Collection

SWE-rebench-V2 is a curated dataset of software-engineering tasks derived from real GitHub issues and pull requests. • 3 items • Updated 23 days ago • 7

liked a Space 29 days ago

Nanbeige 4.1 3B

🔮

Chat with Nanbeige AI locally in your browser

liked a model about 1 month ago

mistralai/Voxtral-Mini-4B-Realtime-2602

Automatic Speech Recognition • 4B • Updated 15 days ago • 740k • 727

upvoted a changelog about 1 month ago

Hugging Face Changelog

Community Evals and Benchmark Repositories

Feb 5

• 74

liked 2 models about 1 month ago

Lightricks/LTX-2

Image-to-Video • Updated 24 days ago • 1.1M • • 1.65k

Nanbeige/Nanbeige4.1-3B

Text Generation • 4B • Updated 1 day ago • 695k • • 1.02k

liked a model 4 months ago

mistralai/Mistral-Large-3-675B-Instruct-2512

Updated Dec 19, 2025 • 667 • 219

upvoted a collection 4 months ago

Ministral 3

Collection

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated Dec 2, 2025 • 159

liked a Space 4 months ago

Image Arena Leaderboard

📊

584

Image Generation and Image Editing Arena & Leaderboard

liked 2 models 5 months ago

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated Dec 23, 2025 • 121k • • 1.49k

DragonLLM/Dragon-3B-Base-alpha

4B • Updated Dec 12, 2025 • 115 • 8

liked a Space 5 months ago

LLM Performance Leaderboard

🐨

448

View the latest LLM performance leaderboard online

liked 2 datasets 5 months ago

theResearchNinja/violentutf_cybersecurityBehavior

Viewer • Updated Jun 12, 2024 • 10k • 41 • 3

CounterBench/CounterBench

Preview • Updated Aug 4, 2025 • 38 • 1

liked 3 Spaces 5 months ago

Open VLM Video Leaderboard

🌎

131

VLMEvalKit Eval Results in video understanding benchmark

WebWalkerQALeaderboard

🥇

Display leaderboard for AI models

LVBench Leaderboard

🐨

Submit and view model evaluations