chatllm.cpp gets support of this model
#3
by
J22 - opened
chatllm.cpp supports Qwen3-ASR & Qwen3-ForcedAligner models.
main.exe -m .../qwen3-focedaligner-0.6b.bin --multimedia_file_tags {{ }} -i --set delimiter "|"
________ __ __ __ __ ___
/ ____/ /_ ____ _/ /_/ / / / / |/ /_________ ____
/ / / __ \/ __ `/ __/ / / / / /|_/ // ___/ __ \/ __ \
/ /___/ / / / /_/ / /_/ /___/ /___/ / / // /__/ /_/ / /_/ /
\____/_/ /_/\__,_/\__/_____/_____/_/ /_(_)___/ .___/ .___/
You are served by Qwen3-ForcedAligner, /_/ /_/
with 601300992 (0.6B) parameters.
You > {{audio:...\Downloads\asr_zh.wav}}ηθ³|εΊη°δΊ€ζ|ε δΉεζ»|ηζ
ε΅γ
A.I. > 0
00:00:00,400 --> 00:00:00,960
ηθ³
1
00:00:00,960 --> 00:00:02,000
εΊη°δΊ€ζ
2
00:00:02,000 --> 00:00:02,879
ε δΉεζ»
3
00:00:02,879 --> 00:00:03,680
ηζ
ε΅γ
i want to use this on windows, i downloaded the latest chatllm and a mp3 speech of Obama for example, but i don't have any text in output !
please submit links to download chatllm, qwen3-focedaligner-0.6b.bin and the .wav.
can i forgot something ?
thank you
@lbarasc , you can find quantized models here: https://modelscope.cn/models/judd2024/chatllm_quantized_qwen3/files
Or, you can quantize them on your own by using convert.py.