mirror of
https://github.com/ollama/ollama.git
synced 2025-05-10 18:06:33 +02:00
- Allows specifying whether thinking mode should be on or not - Templates get passed a new option so, e.g., qwen3's template can put `/think` or `/no_think` in the system prompt depending on the value of the setting - Add parsing for thinking blocks in both streaming/non-streaming mode - Update the CLI to make use of these changes TODO: - [ ] Don't parse thinking blocks when the user doesn't explicitly set the option, to maintain backwards compatibility - [ ] Warning on CLI when using a non-thinking/older version of a model (with an old template) - [ ] Wire up capabilities fully - [x] Unify parsing for streaming/non-streaming - [ ] Update templates - [ ] Update python/js libraries - [ ] How to handle differences in models wrt defaults and whether or not the thinking ability can even be controlled. If not specified by the user, should there be a default or should the template be able to check if it was explicitly set? |
||
---|---|---|
.. | ||
internal | ||
testdata/tools | ||
auth.go | ||
create.go | ||
create_test.go | ||
download.go | ||
fixblobs.go | ||
fixblobs_test.go | ||
images.go | ||
images_test.go | ||
layer.go | ||
manifest.go | ||
manifest_test.go | ||
model.go | ||
model_test.go | ||
modelpath.go | ||
modelpath_test.go | ||
prompt.go | ||
prompt_test.go | ||
routes.go | ||
routes_create_test.go | ||
routes_delete_test.go | ||
routes_generate_test.go | ||
routes_list_test.go | ||
routes_test.go | ||
sched.go | ||
sched_test.go | ||
sparse_common.go | ||
sparse_windows.go | ||
thinking.go | ||
thinking_test.go | ||
upload.go |