Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
32
65
205
théo gigant
gigant
Follow
neurohacker352's profile picture
mihai-chindris's profile picture
zimamedia's profile picture
49 followers
·
50 following
https://giganttheo.github.io/
gigant_theo
giganttheo
theo-gigant
AI & ML interests
multimodal
Recent Activity
authored
a paper
26 days ago
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation
upvoted
a
paper
27 days ago
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation
submitted
a paper
27 days ago
Decoupling the Benefits of Subword Tokenization for Language Model Training via Byte-level Simulation
View all activity
Organizations
gigant
's models
51
Sort: Recently updated
gigant/bytes-tokenizer
Updated
Sep 18, 2025
gigant/led_tib
0.2B
•
Updated
Apr 29, 2025
•
2
gigant/SmolLM-mc4-500-rawrope
Text Generation
•
0.1B
•
Updated
Oct 9, 2024
•
1
gigant/SmolLM-mc4-500-ropescaled
Text Generation
•
0.1B
•
Updated
Oct 9, 2024
•
3
gigant/SmolLM-500-rawrope
Text Generation
•
0.1B
•
Updated
Oct 9, 2024
•
2
gigant/SmolLM-500-ropescaled
Text Generation
•
0.1B
•
Updated
Oct 8, 2024
•
2
gigant/SmolLM-135M-ft-500-steps
Text Generation
•
0.1B
•
Updated
Oct 8, 2024
•
5
gigant/SmolLM-135M-rescaled-ft-500-steps
Text Generation
•
0.1B
•
Updated
Oct 8, 2024
•
3
gigant/SmolLM-135M-unjetlagged-200-steps
Updated
Sep 9, 2024
gigant/SmolLM-135M-full-unjetlagged-2000-steps
Updated
Sep 6, 2024
gigant/SmolLM-135M-full-jetlagged-200-steps
Text Generation
•
0.1B
•
Updated
Sep 6, 2024
•
1
gigant/SmolLM-135M-full-unjetlagged-200-steps
Text Generation
•
0.1B
•
Updated
Sep 6, 2024
•
1
gigant/SmolLM-135M-jetlagged-200-steps
Updated
Sep 6, 2024
gigant/SmolLM-135M-unjetlagged
Updated
Sep 5, 2024
gigant/SmolLM-135M-scaled-rope-sw
Text Generation
•
0.1B
•
Updated
Aug 29, 2024
•
3
gigant/flan-t5fire-small
77M
•
Updated
Jul 25, 2024
•
2
gigant/graphlongt5-structural-dependency-0408
Updated
Apr 8, 2024
•
3
gigant/longt5-0322
Updated
Mar 25, 2024
•
2
gigant/graphlongt5-structural-0324
Updated
Mar 24, 2024
•
3
gigant/graphlongt5-dependency-0322
Updated
Mar 22, 2024
•
2
gigant/graphlongt5-globallocal-0322
Updated
Mar 22, 2024
•
2
gigant/graphlongt5-structural-0320
Updated
Mar 21, 2024
•
2
gigant/graphlongt5-dependency-0308
Updated
Mar 8, 2024
•
2
gigant/graphlongt5-globallocal-0308
Updated
Mar 8, 2024
•
2
gigant/longt5-0229
Updated
Feb 29, 2024
•
2
gigant/graphlongt5-globallocal-0228
Updated
Feb 28, 2024
•
2
gigant/graphlongt5-dependency-0228
Updated
Feb 28, 2024
•
3
gigant/longt5-global-3epoch
Updated
Feb 23, 2024
•
3
gigant/graph-t5-global-window-8k-longt5local
Updated
Feb 13, 2024
•
3
gigant/graph-t5-global-window-8k-tib
Updated
Jan 21, 2024
•
2
Previous
1
2
Next