Instructions to use nayohan/llama3-instrucTrans-enko-8b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use nayohan/llama3-instrucTrans-enko-8b with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="nayohan/llama3-instrucTrans-enko-8b")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("nayohan/llama3-instrucTrans-enko-8b")
model = AutoModelForCausalLM.from_pretrained("nayohan/llama3-instrucTrans-enko-8b")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Inference
Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use nayohan/llama3-instrucTrans-enko-8b with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "nayohan/llama3-instrucTrans-enko-8b"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "nayohan/llama3-instrucTrans-enko-8b",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/nayohan/llama3-instrucTrans-enko-8b

SGLang

How to use nayohan/llama3-instrucTrans-enko-8b with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "nayohan/llama3-instrucTrans-enko-8b" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "nayohan/llama3-instrucTrans-enko-8b",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "nayohan/llama3-instrucTrans-enko-8b" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "nayohan/llama3-instrucTrans-enko-8b",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use nayohan/llama3-instrucTrans-enko-8b with Docker Model Runner:
```
docker model run hf.co/nayohan/llama3-instrucTrans-enko-8b
```

Evaluation method?

by arthurkim - opened Jun 14, 2024

Discussion

arthurkim

Jun 14, 2024

https://huggingface.co/nayohan/llama3-instrucTrans-enko-8b#%EB%AA%A8%EB%8D%B8-%ED%8F%89%EA%B0%80%EB%B0%A9%EB%B2%95

instrucTrans 모델의 번역 결과를 평가하기 위한 코드도 혹시 있을까요?

텍스트간의 유사도를 평가하는 것인가요?
예를들어 아래 케이스의 경우 ko_ref와 InstrucTrans 간의 유사도 평가를 진행하는건가요?

"en_ref":"This controversy arose around a new advertisement for the latest iPad Pro that Apple released on YouTube on the 7th. The ad shows musical instruments, statues, cameras, and paints being crushed in a press, followed by the appearance of the iPad Pro in their place. It appears to emphasize the new iPad Pro's artificial intelligence features, advanced display, performance, and thickness. Apple mentioned that the newly unveiled iPad Pro is equipped with the latest 'M4' chip and is the thinnest device in Apple's history. The ad faced immediate backlash upon release, as it graphically depicts objects symbolizing creators being crushed. Critics argue that the imagery could be interpreted as technology trampling on human creators. Some have also voiced concerns that it evokes a situation where creators are losing ground due to AI."
"ko_ref":"이번 논란은 애플이 지난 7일 유튜브에 공개한 신형 아이패드 프로 광고를 둘러싸고 불거졌다. 해당 광고 영상은 악기와 조각상, 카메라, 물감 등을 압착기로 짓누른 뒤 그 자리에 아이패드 프로를 등장시키는 내용이었다. 신형 아이패드 프로의 인공지능 기능들과 진화된 디스플레이와 성능, 두께 등을 강조하기 위한 취지로 풀이된다. 애플은 이번에 공개한 아이패드 프로에 신형 ‘M4’ 칩이 탑재되며 두께는 애플의 역대 제품 중 가장 얇다는 설명도 덧붙였다. 광고는 공개 직후 거센 비판에 직면했다. 창작자를 상징하는 물건이 짓눌려지는 과정을 지나치게 적나라하게 묘사한 점이 문제가 됐다. 기술이 인간 창작자를 짓밟는 모습을 묘사한 것으로 해석될 여지가 있다는 문제의식이다. 인공지능(AI)으로 인해 창작자가 설 자리가 줄어드는 상황을 연상시킨다는 목소리도 나왔다."
"InstrucTrans":"이번 논란은 애플이 지난 7일 유튜브에 공개한 최신 아이패드 프로 광고를 중심으로 불거졌다. 이 광고는 악기, 조각상, 카메라, 물감 등을 누르기 시작하는 장면과 함께 그 자리에 아이패드 프로가 등장하는 장면을 보여준다. 이는 새로운 아이패드 프로의 인공지능 기능, 고급 디스플레이, 성능, 두께를 강조하는 것으로 보인다. 애플은 이번에 공개한 아이패드 프로에 최신 'M4' 칩이 탑재됐으며, 애플 역사상 가장 얇은 기기라고 언급했다. 이 광고는 출시하자마자 크리에이터를 상징하는 물건이 파쇄되는 장면이 그대로 그려져 논란이 되고 있다. 비평가들은 이 이미지가 기술이 인간 크리에이터를 짓밟는다는 의미로 해석될 수 있다고 주장한다. 또한 AI로 인해 크리에이터들이 밀리고 있다는 상황을 연상시킨다는 우려의 목소리도 나온다."

nayohan

Owner Jun 21, 2024

안녕하세요. 답장이 늦어 죄송합니다. 평가는 ko_ref와 model prediction에 대해서 ScareBLEU를 사용하였습니다.
실험에 사용한 추론코드와 평가 코드 공유드립니다. 감사합니다.

nayohan

Owner Jun 21, 2024

python inference_translation_eeve.py -g 3 -d "eval_dataset/flores.csv" -m "yanolja/EEVE-Korean-Instruct-10.8B-v1.0"
python inference_translation_seagull.py -g 3 -d "eval_dataset/flores.csv" -m "kuotient/Seagull-13b-translation"
python inference_translation_kullm.py -g 3 -d "eval_dataset/flores.csv" -m "nlpai-lab/KULLM3"
python inference_translation_synatra.py -g 3 -d "eval_dataset/flores.csv" -m "maywell/Synatra-7B-v0.3-Translation"

# python inference_translation_base.py
import os
import argparse

parser = argparse.ArgumentParser()
parser.add_argument("-m", "--model_name", type=str, default="meta-llama/Meta-Llama-3-8B-Instruct")
parser.add_argument("-d", "--dataset_path", type=str,  default="gemini/ko-eng-dataset.csv")
parser.add_argument("-g", "--gpu_id", type=int,  default=0)
args = parser.parse_args()
print(args)
os.environ["CUDA_VISIBLE_DEVICES"]=str(args.gpu_id)

import torch
import evaluate
import pandas as pd

from datasets import load_dataset
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained(args.model_name)
# tokenizer.pad_token = tokenizer.eos_token
model = AutoModelForCausalLM.from_pretrained(
    args.model_name,
    # device_map="auto",
    torch_dtype=torch.bfloat16,
).to('cuda')
model.eval()

def apply_template(example):
    SYSTEM_PROMPT=f"당신은 번역기 입니다. 영어를 한국어로 번역하세요." # ours
    conversation = {"messages": [
                        {'role': 'system', 'content': SYSTEM_PROMPT},
                        {'role': 'user', 'content':example["en_ref"]}
                    ]}
    return conversation

# datasets
tc_dataset = load_dataset("csv", data_files=args.dataset_path, split="train")
dataset = tc_dataset.map(apply_template, remove_columns=tc_dataset.features, batched=False, num_proc=64)
print(dataset)

# inference
output_list = []
for idx, data in enumerate(dataset):
    inputs = tokenizer.apply_chat_template(data['messages'],tokenize=True, add_generation_prompt=True, return_tensors='pt').to("cuda")
    # print(tokenizer.batch_decode(inputs))
    outputs = model.generate(inputs,
                             pad_token_id=tokenizer.eos_token_id,
                             max_new_tokens=512)
    output_decode = tokenizer.decode(outputs[0][len(inputs[0]):], skip_special_tokens=True)
    print(f'{idx}:',output_decode)
    output_list.append(output_decode)

df = pd.DataFrame(tc_dataset)
df['ko_pred']=output_list
df = df[['ko_pred', 'ko_ref', 'en_ref', 'source']]

model_name = args.model_name.split('/')[-1]
output_path = 'inference_' + args.dataset_path.split('.')[-2]
print(output_path)
os.makedirs(output_path, exist_ok=True)
df.to_json(f'{output_path}/{model_name}_eval_result.json', lines=True, orient='records', force_ascii=False)

nayohan

Owner Jun 21, 2024

•

edited Jun 21, 2024

python eval_translation.py -i inference_eval_dataset/ko_news_eval40/nllb-finetuned-en2ko_eval_result.json
python eval_translation.py -i inference_eval_dataset/ko_news_eval40/EEVE-Korean-Instruct-10.8B-v1.0_eval_result.json
python eval_translation.py -i inference_eval_dataset/ko_news_eval40/Synatra-7B-v0.3-Translation_eval_result.json
python eval_translation.py -i inference_eval_dataset/ko_news_eval40/KULLM3_eval_result.json

# python eval_translation.py
import os
import argparse
import evaluate
import pandas as pd

parser = argparse.ArgumentParser()
parser.add_argument("-i", "--inference_path", type=str,  default="result/nayohanllama3-8b-it-translation-271k_eval_result.json")
args = parser.parse_args()
print(args)

# evaluate sacrebleu
metric = evaluate.load("sacrebleu")
def compute_metrics(eval_preds):
    decoded_preds, decoded_labels = eval_preds
    result = metric.compute(predictions=decoded_preds, references=decoded_labels)
    result = {"bleu": result["score"]}
    result = {k: round(v, 2) for k, v in result.items()}
    return result

# eval result to json
df = pd.read_json(args.inference_path, lines=True, orient='records')
result = []
for source in df['source'].unique():
    df_source = df[df['source']==source].reset_index(drop=True)
    eval_preds = [df_source['ko_pred'], df_source['ko_ref']]
    eval_result = compute_metrics(eval_preds)
    # print(eval_result)
    eval_result['source'] = source
    result.append(eval_result)

output_df = pd.DataFrame(result, columns=['source', 'bleu'])
output_df = output_df.sort_values(by=['source'])
print(output_df)
output_path = '/'.join(args.inference_path.split('/')[:-1]) + '/eval'
output_file = args.inference_path.split('/')[-1]
os.makedirs(output_path, exist_ok=True)
output_df.to_json(f'{output_path}/{output_file}', lines=True, orient='records', force_ascii=False)

nayohan

Owner Jun 21, 2024

make_eval_dataset.py

import pandas as pd
from datasets import load_dataset

# flores
eval_dataset = load_dataset('traintogpb/aihub-flores-koen-integrated-sparta-30k')
df = pd.DataFrame(eval_dataset['test'])
df = df.drop('ko_ref_xcomet', axis=1)
df.to_csv('eval_dataset/flores.csv', index=False)

# iwlst2023
iwlst_en_ko_ban = load_dataset('shreevigneshs/iwslt-2023-en-ko-train-val-split-0.1', split='f_test')
iwlst_en_ko_zon = load_dataset('shreevigneshs/iwslt-2023-en-ko-train-val-split-0.1', split='if_test')

df = iwlst_en_ko_ban.to_pandas()
df = df[["en", "ko"]]
df.columns=["en_ref", "ko_ref"]
df['source'] = 'iwlst_en_ko_ban'
df.to_csv('iwlst_en_ko_banmal.csv', index=False)#, encoding='utf-8-sig')
print(df)

df = iwlst_en_ko_zon.to_pandas()
df = df[["en", "ko"]]
df.columns=["en_ref", "ko_ref"]
df['source'] = 'iwlst_en_ko_zon'
df.to_csv('iwlst_en_ko_zondae.csv', index=False)#, encoding='utf-8-sig')
print(df)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment