ISTA-DASLab/Llama-3.2-1B-Instruct-W4A4-nvfp4-gptq-identity-transform-sft-fp_quant
Updated • 1
None defined yet.
MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning
GSQ: Highly-Accurate Low-Precision Scalar Quantization for LLMs via Gumbel-Softmax Sampling