"error loading model: missing tok_embeddings.weight" with llama.cpp

by gardner - opened Jun 21, 2023

Jun 21, 2023

Hi there,
thanks for publishing all these models. I just tried this one out with llama.cpp. I pulled from the repo and rebuilt with CUBLAS enabled and run with:

./build/bin/main -m /home/user/src/llama.cpp/models/WizardCoder-15B/WizardCoder-15B-1.0.ggmlv3.q4_1.bin -p "Please tell me a story about a robot."

I tried the q4_0.bin file as well with the same result.

$ git log -1
commit 049aa16b8c5c6d086246e4e6b9feb18de4fbd663 (HEAD -> master, origin/master, origin/HEAD)
Author: Georgi Gerganov <[email protected]>
Date:   Tue Jun 20 19:05:54 2023 +0300

    readme : add link to p1

TheBloke

Owner Jun 21, 2023

They're not compatible with llama.cpp - please see the README for a list of software it will work with.

gardner changed discussion status to closed Jan 18, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment