Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
Mohammad Hossein Amani
masani
Follow
https://mh-amani.github.io/
MohammadHAmani
mh-amani
AI & ML interests
explainability, the science of deep learning
Recent Activity
updated
a model
21 days ago
masani/SFT_deepscaler-r3_Llama-3.2-3B_epoch_1_global_step_132
published
a model
21 days ago
masani/SFT_deepscaler-r3_Llama-3.2-3B_epoch_1_global_step_132
updated
a model
21 days ago
masani/SFT_DeepScaleR_Llama-3.2-3B_epoch_1_global_step_26
View all activity
Organizations
None yet
masani
's models
88
Sort: Recently updated
masani/SFT_gsm8k_Llama-2-7b-hf_epoch_0_global_step_0
Text Generation
•
7B
•
Updated
Apr 25, 2025
•
2
masani/SFT_logs_train.log
Updated
Apr 25, 2025
masani/SFT_math_Llama-2-7b-hf_epoch_2_global_step_58
Text Generation
•
7B
•
Updated
Apr 25, 2025
masani/SFT_math_Llama-2-7b-hf_epoch_1_global_step_29
Text Generation
•
7B
•
Updated
Apr 25, 2025
masani/SFT_math_Llama-2-7b-hf_epoch_0_global_step_0
Text Generation
•
7B
•
Updated
Apr 25, 2025
•
2
masani/SFT_math_Llama-3.2-1B_epoch_0_global_step_0
Updated
Apr 24, 2025
masani/SFT_gsm8k_Llama-3.2-1B_epoch_0_global_step_0
Updated
Apr 24, 2025
masani/SFTgsm8k_Llama-3.2-1B_epoch_0_global_step_0
Updated
Apr 24, 2025
masani/SFTlogs_train.log
Updated
Apr 24, 2025
masani/sft-gpt2-xl-gsm8k-epoch10-longsysprompt
Text Generation
•
2B
•
Updated
Apr 15, 2025
•
1
masani/sft-gpt2-xl-gsm8k-epoch9-longsysprompt
Text Generation
•
2B
•
Updated
Apr 15, 2025
•
1
masani/sft-gpt2-xl-gsm8k-epoch8-longsysprompt
Text Generation
•
2B
•
Updated
Apr 15, 2025
•
1
masani/sft-gpt2-xl-gsm8k-epoch7-longsysprompt
Text Generation
•
2B
•
Updated
Apr 15, 2025
•
1
masani/sft-gpt2-xl-gsm8k-epoch6-longsysprompt
Text Generation
•
2B
•
Updated
Apr 15, 2025
•
1
masani/sft-gpt2-xl-gsm8k-epoch5-longsysprompt
Text Generation
•
2B
•
Updated
Apr 15, 2025
•
1
masani/sft-gpt2-xl-gsm8k-epoch4-longsysprompt
Text Generation
•
2B
•
Updated
Apr 15, 2025
•
1
masani/sft-gpt2-xl-gsm8k-epoch3-longsysprompt
Text Generation
•
2B
•
Updated
Apr 15, 2025
•
1
masani/sft-gpt2-xl-gsm8k-epoch2-longsysprompt
Text Generation
•
2B
•
Updated
Apr 15, 2025
•
1
masani/sft-gpt2-xl-gsm8k-epoch1-longsysprompt
Text Generation
•
2B
•
Updated
Apr 15, 2025
masani/gsm8k-gpt2-xl_1epoch_2025-04-15_09-40-47-witheos
Updated
Apr 15, 2025
masani/gsm8k-gpt2-xl_1epoch_2025-04-11-witheos
Text Generation
•
2B
•
Updated
Apr 11, 2025
•
2
masani/2025-04-11_12-19-33
Updated
Apr 11, 2025
masani/gsm8k-gpt2-xl_1epoch_2025-04-11
Text Generation
•
2B
•
Updated
Apr 11, 2025
•
2
masani/2025-04-11_10-17-17
Updated
Apr 11, 2025
masani/epoch1
Text Generation
•
2B
•
Updated
Apr 3, 2025
•
1
masani/2025-04-02_14-52-39
Text Generation
•
2B
•
Updated
Apr 2, 2025
•
1
masani/2025-04-02_14-52-45
Text Generation
•
1B
•
Updated
Apr 2, 2025
•
3
masani/sft_checkpoints
Text Generation
•
2B
•
Updated
Mar 31, 2025
•
1
Previous
1
2
3
Next