400M_model_half_epoch

LoRA-finetuned Llama-400M model

Model Details

This model is a LoRA-finetuned version of YongganFu/Llama-400M-12L.

Usage

from transformers import AutoTokenizer, AutoModelForCausalLM

model_name = "lxaw/400M_model_half_epoch"
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForCausalLM.from_pretrained(model_name)

# Example usage
input_text = "What is the capital of France?"
inputs = tokenizer(input_text, return_tensors="pt")
outputs = model.generate(inputs.input_ids, max_length=50)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Example Output

Input: What is the capital of France?

Output: What is the capital of France?

The capital of France is Paris, located in the heart of France. It is the largest city in France and is known for its rich history, beautiful architecture, delicious food, and friendly people.

Downloads last month
0
Safetensors
Model size
397M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support