Jolly-Q
/

llma31_base_33_ST

Model card Files Files and versions

NOT SUITABLE FOR CHAT INFERENCE AS-IS

llama3.1-base

Base 3.1 weights with transplanted llama 3.3 special token embeddings.

Unusable for chat in base state, intended to enable easy instruct qlora tuning of llama 3.1 base.

No need to target embeddings or lm_head.

Downloads last month: -

Safetensors

Model size

71B params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Jolly-Q/llma31_base_33_ST

Base model

meta-llama/Llama-3.1-70B

Finetuned

(55)

this model

Finetunes