ollama

mirror of https://github.com/ollama/ollama.git synced 2025-05-10 18:06:33 +02:00

History

Daniel Hiltgen 424810450f Move quantization to new backend (#10363 ) * Move quantization logic to GGML via new backend This moves the model aware logic to Go code and calls GGMLs quantization code for model creation. * Remove "add model quantizations" This is no longer needed now that quantization is implemented in Go+GGML code directly.	2025-05-06 11:20:48 -07:00
..
ggml	Move quantization to new backend (#10363 )	2025-05-06 11:20:48 -07:00
backend.go	next ollama runner (#7913 )	2025-02-13 16:31:21 -08:00

Move quantization to new backend (#10363 )

* Move quantization logic to GGML via new backend

This moves the model aware logic to Go code and calls GGMLs quantization code for model creation.

* Remove "add model quantizations"

This is no longer needed now that quantization is implemented in Go+GGML code directly.

2025-05-06 11:20:48 -07:00

ggml

Move quantization to new backend (#10363 )

2025-05-06 11:20:48 -07:00

backend.go

next ollama runner (#7913 )

2025-02-13 16:31:21 -08:00