Default Branch

b828e18c75 · docker : fix vulkan build (#19352) · Updated 2026-02-05 12:10:39 +02:00

Branches

d93ff58322 · models : fix LFM2 tensors · Updated 2025-11-27 14:54:51 +02:00

772
1

05429433a1 · examples: add model-backend-compare tool to compare intermediate device tensors with CPU reference · Updated 2025-11-25 19:05:56 +02:00

794
1

72f80499ee · server : headers cleanup · Updated 2025-11-24 12:50:50 +02:00

854
5

722f9defe9 · vulkan: intel mmv fix attempt · Updated 2025-11-23 11:13:19 +02:00

816
1

c0b9903a1a · more readable · Updated 2025-11-20 18:45:37 +02:00

829
2

6cdda87baf · ci : disable op offload in some tests · Updated 2025-11-20 17:16:50 +02:00

861
3

4a83611773 · Revert "CANN: Add openEuler-cann in build and release (#17192)" · Updated 2025-11-18 11:00:05 +02:00

854
1

dba1cbceb3 · tune for RDNA3 · Updated 2025-11-16 21:21:22 +02:00

869
4

e6dbc81569 · metal : cap threadgroups size of set_rows · Updated 2025-11-10 16:17:09 +02:00

938
1
gg/fa-no-kq-pad
Some checks failed
Check Pre-Tokenizer Hashes / pre-tokenizer-hashes (push) Has been cancelled
Python check requirements.txt / check-requirements (push) Has been cancelled
Python Type-Check / pyright type-check (push) Has been cancelled
Update Operations Documentation / update-ops-docs (push) Has been cancelled

3ad533689c · ggml : remove KQ mask padding · Updated 2025-11-10 14:35:25 +02:00

940
1
compilade/convert-reflinks
Some checks failed
Check Pre-Tokenizer Hashes / pre-tokenizer-hashes (push) Has been cancelled
Python check requirements.txt / check-requirements (push) Has been cancelled
Python Type-Check / pyright type-check (push) Has been cancelled
Update Operations Documentation / update-ops-docs (push) Has been cancelled

2ef41855cf · convert : for FP8, use scale type to decide auto type · Updated 2025-11-07 05:55:53 +02:00

978
16
compilade/convert-safetensors-parse
Some checks failed
Check Pre-Tokenizer Hashes / pre-tokenizer-hashes (push) Has been cancelled
Python check requirements.txt / check-requirements (push) Has been cancelled
Python Type-Check / pyright type-check (push) Has been cancelled
Update Operations Documentation / update-ops-docs (push) Has been cancelled

e996f3aef8 · convert : fix no-lazy dtypes from direct safetensors · Updated 2025-11-07 05:33:09 +02:00

978
3

128118fdbe · convert : use F32 for dequant of pack-quantized tensors · Updated 2025-11-07 04:59:32 +02:00

978
6

23b70f4f70 · Initial plan · Updated 2025-11-04 13:00:12 +02:00

1006
1

79b98dbf96 · Merge branch 'master' into xsn/mtmd_custom_min_max_tokens · Updated 2025-11-02 23:14:03 +02:00

1021
2

d441c31b19 · metal : remove stray return · Updated 2025-11-02 18:24:00 +02:00

1030
9

d7f794eadb · convert : avoid dequantizing mxfp4 for GPT-OSS · Updated 2025-10-24 14:56:26 +03:00

1117
1
compilade/convert-prequant
Some checks failed
Check Pre-Tokenizer Hashes / pre-tokenizer-hashes (push) Has been cancelled
Python check requirements.txt / check-requirements (push) Has been cancelled
Python Type-Check / pyright type-check (push) Has been cancelled
Update Operations Documentation / update-ops-docs (push) Has been cancelled

93fbd407f3 · Merge branch 'master' into compilade/convert-prequant · Updated 2025-10-23 21:23:12 +03:00

1120
6

f0076dc5a0 · metal : adjust .get_alloc_size to be alloc friendly · Updated 2025-10-19 17:20:54 +03:00

1150
1

96f9f391c7 · ggml : fix unaligned access in AMX code · Updated 2025-09-29 10:37:15 +03:00

1330
1