NOT SUITABLE FOR CHAT INFERENCE AS-IS

llama3.1-base

Base 3.1 weights with transplanted llama 3.3 special token embeddings.

Unusable for chat in base state, intended to enable easy instruct qlora tuning of llama 3.1 base.

No need to target embeddings or lm_head.

Downloads last month
23
Safetensors
Model size
71B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Jolly-Q/llma31_base_33_ST

Finetuned
(53)
this model
Finetunes
2 models