ollama

mirror of https://github.com/ollama/ollama.git synced 2025-05-11 02:16:36 +02:00

History

Devon Rifkin 77f4594e80 WIP thinking API support - Allows specifying whether thinking mode should be on or not - Templates get passed a new option so, e.g., qwen3's template can put `/think` or `/no_think` in the system prompt depending on the value of the setting - Add parsing for thinking blocks in both streaming/non-streaming mode - Update the CLI to make use of these changes TODO: - [ ] Don't parse thinking blocks when the user doesn't explicitly set the option, to maintain backwards compatibility - [ ] Warning on CLI when using a non-thinking/older version of a model (with an old template) - [ ] Wire up capabilities fully - [x] Unify parsing for streaming/non-streaming - [ ] Update templates - [ ] Update python/js libraries - [ ] How to handle differences in models wrt defaults and whether or not the thinking ability can even be controlled. If not specified by the user, should there be a default or should the template be able to check if it was explicitly set?		2025-05-07 16:15:46 -07:00
..
testdata	templates: add autotemplate for gemma3 (#9880 )	2025-03-20 00:15:30 -07:00
alfred.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
alfred.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
alpaca.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
alpaca.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
chatml.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
chatml.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
chatqa.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
chatqa.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
codellama-70b-instruct.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
codellama-70b-instruct.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
command-r.gotmpl	convert: import support for command-r models from safetensors (#6063 )	2025-01-15 16:31:22 -08:00
command-r.json	convert: import support for command-r models from safetensors (#6063 )	2025-01-15 16:31:22 -08:00
falcon-instruct.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
falcon-instruct.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
gemma-instruct.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
gemma-instruct.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
gemma3-instruct.gotmpl	templates: add autotemplate for gemma3 (#9880 )	2025-03-20 00:15:30 -07:00
gemma3-instruct.json	templates: add autotemplate for gemma3 (#9880 )	2025-03-20 00:15:30 -07:00
granite-instruct.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
granite-instruct.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
index.json	templates: add autotemplate for gemma3 (#9880 )	2025-03-20 00:15:30 -07:00
llama2-chat.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
llama2-chat.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
llama3-instruct.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
llama3-instruct.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
magicoder.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
magicoder.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
mistral-instruct.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
mistral-instruct.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
openchat.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
openchat.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
phi-3.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
phi-3.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
solar-instruct.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
solar-instruct.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
starcoder2-instruct.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
starcoder2-instruct.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
template.go	WIP thinking API support	2025-05-07 16:15:46 -07:00
template_test.go	next ollama runner (#7913 )	2025-02-13 16:31:21 -08:00
vicuna.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
vicuna.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00
zephyr.gotmpl	update templates to use messages	2024-08-27 15:44:04 -07:00
zephyr.json	autodetect stop parameters from template	2024-07-12 16:01:23 -07:00