mirror of
https://github.com/ollama/ollama.git
synced 2025-05-11 18:36:41 +02:00
Switch back to subprocessing for llama.cpp
This should resolve a number of memory leak and stability defects by allowing us to isolate llama.cpp in a separate process and shutdown when idle, and gracefully restart if it has problems. This also serves as a first step to be able to run multiple copies to support multiple models concurrently.
This commit is contained in:
parent
3b6a9154dd
commit
58d95cc9bd
35 changed files with 1416 additions and 1910 deletions
6
llm/llm_windows.go
Normal file
6
llm/llm_windows.go
Normal file
|
@ -0,0 +1,6 @@
|
|||
package llm
|
||||
|
||||
import "embed"
|
||||
|
||||
//go:embed build/windows/*/*/bin/*
|
||||
var libEmbed embed.FS
|
Loading…
Add table
Add a link
Reference in a new issue