Instructions to use beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B")

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B", dtype="auto")

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker

docker model run hf.co/beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B

SGLang

How to use beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B",
		"prompt": "Once upon a time,",
		"max_tokens": 512,
		"temperature": 0.5
	}'

Unsloth Studio new

How to use beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B to start chatting

Load model with FastModel

pip install unsloth
from unsloth import FastModel
model, tokenizer = FastModel.from_pretrained(
    model_name="beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B",
    max_seq_length=2048,
)

Docker Model Runner
How to use beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B with Docker Model Runner:
```
docker model run hf.co/beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B
```
Browse Quantizations to use this model in llama.cpp, Ollama, LM Studio, or any compatible app.

KoAlpaca-RealQA-Solar-Ko-Recovery-11B (QLoRA with Unsloth)

Model Description

Developed by: Lee Junbum (Beomi)
Model type: Instruction Tuned, with beomi/KoAlpaca-RealQA dataset
Language(s) (NLP): Korean Mainly, partially English
License: Apache 2.0
Finetuned from model: beomi/Solar-Ko-Recovery-11B

Model Sources

Training Code (Google Colab, Pro+ A100 40G): https://colab.research.google.com/drive/11Ni8rOBmV1Qh15i7gMWncKjYBEdrJLBt
Inference Code (Google Colab): https://colab.research.google.com/drive/1hEPSHI4aGOn29Y21c6SWJc-y2ECVx3Bz?usp=sharing

Direct Use with Unsloth

# pip install -U hf_transfer unsloth
import os

os.environ["HF_HUB_ENABLE_HF_TRANSFER"] = "1" # download speed upto 1000MB/s

import torch
from unsloth import FastLanguageModel
from transformers import TextStreamer


model, tokenizer = FastLanguageModel.from_pretrained(
    model_name = "beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B", # YOUR MODEL YOU USED FOR TRAINING
    max_seq_length = 2048,
    dtype = torch.bfloat16,
    load_in_4bit = True,
)
FastLanguageModel.for_inference(model) # Enable native 2x faster inference

alpaca_prompt = """Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.

### Instruction:
{}

### Response:
{}"""

def gen(x):
    inputs = tokenizer(
    [
        alpaca_prompt.format(
            x.strip(), # instruction
            "", # output - leave this blank for generation!
        )
    ], return_tensors = "pt").to("cuda")

    text_streamer = TextStreamer(tokenizer)
    _ = model.generate(**inputs, streamer = text_streamer, max_new_tokens = 512)

Generation Example

Sample 01

gen("안녕하세요")

<s> Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.

### Instruction:
안녕하세요

### Response:
안녕하세요! 어떻게 도와드릴까요?</s>

Sample 02

gen("""아래 글을 한국어로 번역해줘.
Dataset Summary

The KoAlpaca-RealQA dataset is a unique Korean instruction dataset designed to closely reflect real user interactions in the Korean language. Unlike conventional Korean instruction datasets that rely heavily on translated prompts, this dataset is composed of authentic Korean instructions derived from real-world use cases. Specifically, the dataset has been curated from user interactions with the ChatKoAlpaca service, which is based on the KoAlpaca model trained between 2023 and 2024.

This dataset provides a more accurate portrayal of typical Korean user behaviors, questions, and language structures, making it highly relevant for developing language models aimed at understanding and responding to Korean speakers. By leveraging GPT4o to generate high-quality answers, KoAlpaca-RealQA aims to offer a robust resource for training models that need to engage with Korean users in a natural and meaningful way.
""")

<s> Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.

### Instruction:
아래 글을 한국어로 번역해줘.
Dataset Summary

The KoAlpaca-RealQA dataset is a unique Korean instruction dataset designed to closely reflect real user interactions in the Korean language. Unlike conventional Korean instruction datasets that rely heavily on translated prompts, this dataset is composed of authentic Korean instructions derived from real-world use cases. Specifically, the dataset has been curated from user interactions with the ChatKoAlpaca service, which is based on the KoAlpaca model trained between 2023 and 2024.

This dataset provides a more accurate portrayal of typical Korean user behaviors, questions, and language structures, making it highly relevant for developing language models aimed at understanding and responding to Korean speakers. By leveraging GPT4o to generate high-quality answers, KoAlpaca-RealQA aims to offer a robust resource for training models that need to engage with Korean users in a natural and meaningful way.

### Response:
KoAlpaca-RealQA 데이터셋은 한국어 사용자들의 실제 상호작용을 매우 잘 반영하도록 설계된 독특한 한국어 지시 데이터셋입니다. 번역된 프롬프트에 크게 의존하는 기존의 한국어 지시 데이터셋과 달리, 이 데이터셋은 실제 사용 사례에서 유래된 진정한 한국어 지시로 구성되어 있습니다. 특히, 이 데이터셋은 2023년과 2024년 사이에 훈련된 KoAlpaca 모델을 기반으로 한 ChatKoAlpaca 서비스와의 사용자 상호작용에서 수집되었습니다.

이 데이터셋은 일반적인 한국어 사용자의 행동, 질문 및 언어 구조를 더 정확하게 묘사하여, 한국어 사용자를 이해하고 응답하는 언어 모델을 개발하는 데 매우 유용합니다. GPT4o를 활용하여 고품질의 답변을 생성함으로써, KoAlpaca-RealQA는 자연스럽고 의미 있는 방식으로 한국어 사용자와 상호작용해야 하는 모델을 훈련시키기 위한 강력한 자원을 목표로 하고 있습니다.</s>

Sample 03

gen("""KoAlpaca에 대해 설명해줘.""")

<s> Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.

### Instruction:
KoAlpaca에 대해 설명해줘.

### Response:
KoAlpaca는 한국어 자연어 처리(NLP) 모델로, OpenAI의 GPT-3와 같은 대형 언어 모델(LLM)을 기반으로 합니다. KoAlpaca는 한국어 데이터를 주로 사용하여 학습되었기 때문에 한국어 텍스트를 이해하고 생성하는 데 특화되어 있습니다. 이 모델은 다양한 한국어 응용 프로그램에서 활용될 수 있으며, 예를 들어 대화형 AI, 번역, 요약, 질문 답변 등 여러 분야에서 사용될 수 있습니다.

KoAlpaca는 한국어 사용자에게 보다 자연스럽고 유창한 언어 상호작용을 제공하며, 한국어 문맥을 잘 이해하고 처리할 수 있도록 설계되었습니다. 이러한 모델은 한국어 NLP 연구와 산업에서 중요한 도구로 사용될 수 있습니다.</s>

Downloads last month: -; Downloads are not tracked for this model. How to track

Model tree for beomi/KoAlpaca-RealQA-Solar-Ko-Recovery-11B

Base model

upstage/SOLAR-10.7B-v1.0

Finetuned

beomi/Solar-Ko-Recovery-11B

Finetuned

(5)

this model

Quantizations

1 model

beomi
/

KoAlpaca-RealQA-Solar-Ko-Recovery-11B