chatllm.cpp gets support of this model

by J22 - opened about 1 month ago

chatllm.cpp supports Qwen3-ASR & Qwen3-ForcedAligner models.

main.exe -m .../qwen3-focedaligner-0.6b.bin --multimedia_file_tags {{ }} -i --set delimiter "|"
    ________          __  __    __    __  ___ 
   / ____/ /_  ____ _/ /_/ /   / /   /  |/  /_________  ____
  / /   / __ \/ __ `/ __/ /   / /   / /|_/ // ___/ __ \/ __ \
 / /___/ / / / /_/ / /_/ /___/ /___/ /  / // /__/ /_/ / /_/ /
 \____/_/ /_/\__,_/\__/_____/_____/_/  /_(_)___/ .___/ .___/
You are served by Qwen3-ForcedAligner,        /_/   /_/
with 601300992 (0.6B) parameters.

You  > {{audio:...\Downloads\asr_zh.wav}}甚至|出现交易|几乎停滞|的情况。
A.I. > 0
00:00:00,400 --> 00:00:00,960
甚至

1
00:00:00,960 --> 00:00:02,000
出现交易

2
00:00:02,000 --> 00:00:02,879
几乎停滞

3
00:00:02,879 --> 00:00:03,680
的情况。

lbarasc

21 days ago

i want to use this on windows, i downloaded the latest chatllm and a mp3 speech of Obama for example, but i don't have any text in output !
please submit links to download chatllm, qwen3-focedaligner-0.6b.bin and the .wav.
can i forgot something ?
thank you

J22

20 days ago

@lbarasc , you can find quantized models here: https://modelscope.cn/models/judd2024/chatllm_quantized_qwen3/files

Or, you can quantize them on your own by using convert.py.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment