ollama

mirrors/ollama

Fork 0

mirror of https://github.com/ollama/ollama.git synced 2025-05-11 10:26:53 +02:00

Commit graph

Author	SHA1	Message	Date
Daniel Hiltgen	424810450f	Move quantization to new backend (#10363 ) * Move quantization logic to GGML via new backend This moves the model aware logic to Go code and calls GGMLs quantization code for model creation. * Remove "add model quantizations" This is no longer needed now that quantization is implemented in Go+GGML code directly.	2025-05-06 11:20:48 -07:00
Daniel Hiltgen	ed4e139314	Integration test improvements (#9654 ) Add some new test coverage for various model architectures, and switch from orca-mini to the small llama model.	2025-04-16 14:25:55 -07:00

Author

SHA1

Message

Date

Daniel Hiltgen

424810450f

Move quantization to new backend (#10363 )

* Move quantization logic to GGML via new backend

This moves the model aware logic to Go code and calls GGMLs quantization code for model creation.

* Remove "add model quantizations"

This is no longer needed now that quantization is implemented in Go+GGML code directly.

2025-05-06 11:20:48 -07:00

Daniel Hiltgen

ed4e139314

Integration test improvements (#9654 )

Add some new test coverage for various model architectures,
and switch from orca-mini to the small llama model.

2025-04-16 14:25:55 -07:00

2 commits