5 34 2

TNQ

LHPKAI

trannhatquy

AI & ML interests

NLP,CV,MLOps

Recent Activity

published a dataset about 1 month ago

OpenHay/zalo-ai-2025-training-data-v2

published a dataset about 1 month ago

OpenHay/zalo-ai-2025-public-test-data-v2

published a dataset about 1 month ago

OpenHay/zalo-ai-2025-new-training-data-v2

View all activity

Organizations

upvoted a paper 7 months ago

Vision Language Models are Biased

Paper • 2505.23941 • Published May 29 • 23

upvoted 3 articles 10 months ago

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Mar 4

•

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25

•

172

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

•

320

upvoted an article 12 months ago

Article

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

Jan 2

•

upvoted a paper 12 months ago

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 86

upvoted 4 papers about 1 year ago

Adding Conditional Control to Text-to-Image Diffusion Models

Paper • 2302.05543 • Published Feb 10, 2023 • 57

Training Large Language Models to Reason in a Continuous Latent Space

Paper • 2412.06769 • Published Dec 9, 2024 • 92

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 159

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 45

upvoted an article about 1 year ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Jul 5, 2024

•

306

upvoted a paper about 1 year ago

Contextual Document Embeddings

Paper • 2410.02525 • Published Oct 3, 2024 • 24

upvoted a paper over 1 year ago

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12, 2024 • 71

upvoted 3 articles over 1 year ago

Article

Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens and 11 languages

May 24, 2024

•

Article

Hugging Face x LangChain : A new partner package

May 14, 2024

•

159

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

•

278

upvoted a paper over 1 year ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7, 2024 • 24

upvoted an article over 1 year ago

Article

RAG using huggingface tools

Jul 7, 2024

•

upvoted a collection over 1 year ago

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 875

upvoted an article over 1 year ago

Article

Mergoo: Efficiently Build Your Own MoE LLM

Jun 3, 2024

•

TNQ

AI & ML interests

Recent Activity

Organizations

LHPKAI's activity

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

FastRTC: The Real-Time Communication Library for Python

SmolVLM2: Bringing Video Understanding to Every Device

🐺🐦‍⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark

ColPali: Efficient Document Retrieval with Vision Language Models 👀

Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens and 11 languages

Hugging Face x LangChain : A new partner package

PaliGemma – Google's Cutting-Edge Open Vision Language Model

RAG using huggingface tools

Mergoo: Efficiently Build Your Own MoE LLM