ollama/fs/ggml
Devon Rifkin 7c94471d38 ggml: more accurate estimates for head count array case
Also standardized the approach by always treatting `HeadCount()` and
`HeadCountKV()` as arrays by filling them with the same value when
they're a scalar in the original GGUF
2025-04-10 16:28:34 -07:00
..
ggml.go ggml: more accurate estimates for head count array case 2025-04-10 16:28:34 -07:00
ggml_test.go fix: add back bf16 support 2025-02-25 19:26:14 +00:00
gguf.go next ollama runner (#7913) 2025-02-13 16:31:21 -08:00
type.go next ollama runner (#7913) 2025-02-13 16:31:21 -08:00