| model | size | params | backend | threads | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | ------: | --------------: | -------------------: | | qwen3vlmoe 30B.A3B F16 | 56.89 GiB | 30.53 B | CPU | 32 | pp512 | 6.43 ± 1.27 | | qwen3vlmoe 30B.A3B F16 | 56.89 GiB | 30.53 B | CPU | 32 | tg128 | 1.45 ± 0.38 | build: unknown (0) | model | size | params | backend | threads | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | ------: | --------------: | -------------------: | | qwen3vlmoe 30B.A3B Q4_K - Medium | 17.28 GiB | 30.53 B | CPU | 32 | pp512 | 50.04 ± 11.48 | | qwen3vlmoe 30B.A3B Q4_K - Medium | 17.28 GiB | 30.53 B | CPU | 32 | tg128 | 8.89 ± 2.02 | build: unknown (0) | model | size | params | backend | threads | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | ------: | --------------: | -------------------: | | qwen3vlmoe 30B.A3B Q8_0 | 30.25 GiB | 30.53 B | CPU | 32 | pp512 | 9.18 ± 2.22 | | qwen3vlmoe 30B.A3B Q8_0 | 30.25 GiB | 30.53 B | CPU | 32 | tg128 | 2.65 ± 1.28 | build: unknown (0) | model | size | params | backend | threads | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | ------: | --------------: | -------------------: | | qwen3vlmoe 30B.A3B F16 | 56.89 GiB | 30.53 B | CPU | 32 | pp512 | 6.56 ± 1.69 | | qwen3vlmoe 30B.A3B F16 | 56.89 GiB | 30.53 B | CPU | 32 | tg128 | 2.14 ± 1.11 | build: unknown (0) | model | size | params | backend | threads | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | ------: | --------------: | -------------------: | | qwen3vlmoe 30B.A3B Q4_K - Medium | 17.28 GiB | 30.53 B | CPU | 32 | pp512 | 50.24 ± 12.37 | | qwen3vlmoe 30B.A3B Q4_K - Medium | 17.28 GiB | 30.53 B | CPU | 32 | tg128 | 11.11 ± 2.36 | build: unknown (0) | model | size | params | backend | threads | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | ------: | --------------: | -------------------: | | qwen3vlmoe 30B.A3B Q8_0 | 30.25 GiB | 30.53 B | CPU | 32 | pp512 | 8.76 ± 1.75 | | qwen3vlmoe 30B.A3B Q8_0 | 30.25 GiB | 30.53 B | CPU | 32 | tg128 | 2.24 ± 0.44 | build: unknown (0) | model | size | params | backend | threads | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | ------: | --------------: | -------------------: | | qwen3vl 32B F16 | 61.03 GiB | 32.76 B | CPU | 32 | pp512 | 9.43 ± 0.01 | | qwen3vl 32B F16 | 61.03 GiB | 32.76 B | CPU | 32 | tg128 | 0.68 ± 0.00 | build: unknown (0) | model | size | params | backend | threads | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | ------: | --------------: | -------------------: | | qwen3vl 32B Q4_K - Medium | 18.40 GiB | 32.76 B | CPU | 32 | pp512 | 34.60 ± 0.14 | | qwen3vl 32B Q4_K - Medium | 18.40 GiB | 32.76 B | CPU | 32 | tg128 | 2.20 ± 0.00 | build: unknown (0) | model | size | params | backend | threads | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | ------: | --------------: | -------------------: | | qwen3vl 32B Q8_0 | 32.42 GiB | 32.76 B | CPU | 32 | pp512 | 12.97 ± 0.04 | | qwen3vl 32B Q8_0 | 32.42 GiB | 32.76 B | CPU | 32 | tg128 | 1.27 ± 0.00 | build: unknown (0) | model | size | params | backend | threads | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | ------: | --------------: | -------------------: | | qwen3vl 32B F16 | 61.03 GiB | 32.76 B | CPU | 32 | pp512 | 9.41 ± 0.02 | | qwen3vl 32B F16 | 61.03 GiB | 32.76 B | CPU | 32 | tg128 | 0.68 ± 0.00 | build: unknown (0) | model | size | params | backend | threads | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | ------: | --------------: | -------------------: | | qwen3vl 32B Q4_K - Medium | 18.40 GiB | 32.76 B | CPU | 32 | pp512 | 34.55 ± 0.14 | | qwen3vl 32B Q4_K - Medium | 18.40 GiB | 32.76 B | CPU | 32 | tg128 | 2.19 ± 0.00 | build: unknown (0) | model | size | params | backend | threads | test | t/s | | ------------------------------ | ---------: | ---------: | ---------- | ------: | --------------: | -------------------: | | qwen3vl 32B Q8_0 | 32.42 GiB | 32.76 B | CPU | 32 | pp512 | 12.99 ± 0.02 | | qwen3vl 32B Q8_0 | 32.42 GiB | 32.76 B | CPU | 32 | tg128 | 1.27 ± 0.00 | build: unknown (0)