Thank you for this!

by arvnoodle - opened Sep 4, 2025

Sep 4, 2025

Tried it on VLLM. Pretty much working! I do have a question though, you think there's a possibility that the 405b hermes can be quantized to 4bit too?

cpatonn

cyankiwi org Sep 4, 2025

Thank you for trying my model :) I really would love to quantize the 405b hermes, but it does not fit on my local setup, and financial constraints do not allow me to rent cloud gpu, unfortunately.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment