Zlatorog-12B-Instruct-Beta
This model is a fine-tuned version of zidsi/MistralNemoCPT6 on the custom mix of SFT datasets.
Model description
More information needed
Intended uses & limitations
Research explore and have fun with Slovenian LLM :)
Training and evaluation data
Bad standard Slovenian benchmarks results but sometimes impresssive "real world" prompt responses :)
Reduced hallucinations rate on "Who is ...?" prompts.
Tools use to be evaluated
Up to 16k ctx should work OK, for longer contexts training data would be required to improve CPT Long stage
More information needed
GGUF
The HF model was coverted to GGUF using llama.cpp
- Downloads last month
- 33
Hardware compatibility
Log In
to view the estimation
4-bit
8-bit
16-bit
Model tree for zidsi/Zlatorog-12B-Instruct-Beta-GGUF
Base model
zidsi/MistralNemoCPT6