llama.cpp

mirror of https://github.com/ggerganov/llama.cpp.git synced 2026-02-05 13:53:23 +02:00

Files

George e9a859db3c ggml: added cleanups in ggml_quantize_free (#19278 )

Add missing cleanup calls for IQ2_S, IQ1_M quantization types and IQ3XS with 512 blocks during quantization cleanup.

2026-02-03 08:43:39 +02:00

2025-08-07 13:45:41 +02:00

2026-02-03 01:19:55 +08:00

2026-02-03 08:43:39 +02:00

.gitignore

2024-07-13 18:12:39 +02:00

CMakeLists.txt

2026-02-01 14:13:38 -08:00