You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Zlatorog-12B-Instruct-Beta

This model is a fine-tuned version of zidsi/MistralNemoCPT6 on the custom mix of SFT datasets.

Model description

More information needed

Intended uses & limitations

Research explore and have fun with Slovenian LLM :)

Training and evaluation data

Bad standard Slovenian benchmarks results but sometimes impresssive "real world" prompt responses :)

Reduced hallucinations rate on "Who is ...?" prompts.

Tools use to be evaluated

Up to 16k ctx should work OK, for longer contexts training data would be required to improve CPT Long stage

More information needed

GGUF

The HF model was coverted to GGUF using llama.cpp

Downloads last month
33
GGUF
Model size
12B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

4-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for zidsi/Zlatorog-12B-Instruct-Beta-GGUF

Quantized
(1)
this model

Evaluation results