chatllm.cpp gets support of this model

#3
by J22 - opened

chatllm.cpp supports Qwen3-ASR & Qwen3-ForcedAligner models.

main.exe -m .../qwen3-focedaligner-0.6b.bin --multimedia_file_tags {{ }} -i --set delimiter "|"
    ________          __  __    __    __  ___ 
   / ____/ /_  ____ _/ /_/ /   / /   /  |/  /_________  ____
  / /   / __ \/ __ `/ __/ /   / /   / /|_/ // ___/ __ \/ __ \
 / /___/ / / / /_/ / /_/ /___/ /___/ /  / // /__/ /_/ / /_/ /
 \____/_/ /_/\__,_/\__/_____/_____/_/  /_(_)___/ .___/ .___/
You are served by Qwen3-ForcedAligner,        /_/   /_/
with 601300992 (0.6B) parameters.

You  > {{audio:...\Downloads\asr_zh.wav}}η”šθ‡³|ε‡ΊηŽ°δΊ€ζ˜“|ε‡ δΉŽεœζ»ž|ηš„ζƒ…ε†΅γ€‚
A.I. > 0
00:00:00,400 --> 00:00:00,960
η”šθ‡³

1
00:00:00,960 --> 00:00:02,000
ε‡ΊηŽ°δΊ€ζ˜“

2
00:00:02,000 --> 00:00:02,879
ε‡ δΉŽεœζ»ž

3
00:00:02,879 --> 00:00:03,680
ηš„ζƒ…ε†΅γ€‚

i want to use this on windows, i downloaded the latest chatllm and a mp3 speech of Obama for example, but i don't have any text in output !
please submit links to download chatllm, qwen3-focedaligner-0.6b.bin and the .wav.
can i forgot something ?
thank you

@lbarasc , you can find quantized models here: https://modelscope.cn/models/judd2024/chatllm_quantized_qwen3/files

Or, you can quantize them on your own by using convert.py.

Sign up or log in to comment