ollama/ml/backend/ggml
Daniel Hiltgen 424810450f
Move quantization to new backend (#10363)
* Move quantization logic to GGML via new backend

This moves the model aware logic to Go code and calls GGMLs quantization code for model creation.

* Remove "add model quantizations"

This is no longer needed now that quantization is implemented in Go+GGML code directly.
2025-05-06 11:20:48 -07:00
..
ggml all: fix cgo compiler warnings on windows (#10563) 2025-05-05 08:02:39 -07:00
ggml.go Move quantization to new backend (#10363) 2025-05-06 11:20:48 -07:00
quantization.go Move quantization to new backend (#10363) 2025-05-06 11:20:48 -07:00
threads.go ollama debug tensor 2025-03-11 14:49:19 -07:00
threads_debug.go ollama debug tensor 2025-03-11 14:49:19 -07:00