ollama

mirror of https://github.com/ollama/ollama.git synced 2025-05-10 18:06:33 +02:00

History

Devon Rifkin 77f4594e80 WIP thinking API support - Allows specifying whether thinking mode should be on or not - Templates get passed a new option so, e.g., qwen3's template can put `/think` or `/no_think` in the system prompt depending on the value of the setting - Add parsing for thinking blocks in both streaming/non-streaming mode - Update the CLI to make use of these changes TODO: - [ ] Don't parse thinking blocks when the user doesn't explicitly set the option, to maintain backwards compatibility - [ ] Warning on CLI when using a non-thinking/older version of a model (with an old template) - [ ] Wire up capabilities fully - [x] Unify parsing for streaming/non-streaming - [ ] Update templates - [ ] Update python/js libraries - [ ] How to handle differences in models wrt defaults and whether or not the thinking ability can even be controlled. If not specified by the user, should there be a default or should the template be able to check if it was explicitly set?		2025-05-07 16:15:46 -07:00
..
internal	fix superfluous call to WriteHeader	2025-04-25 16:58:49 -07:00
testdata/tools	all: fix typos in documentation, code, and comments (#7021 )	2024-12-10 12:58:06 -08:00
auth.go	fix nil deref in auth.go	2024-07-26 14:14:48 -07:00
create.go	explicitly decode maxarraysize 1024	2025-04-25 16:59:01 -07:00
create_test.go	server: validate local path on safetensor create (#9379 )	2025-02-28 16:10:43 -08:00
download.go	server: organize error types (#9465 )	2025-03-28 11:50:22 -07:00
fixblobs.go	server: replace blob prefix separator from ':' to '-' (#3146 )	2024-03-14 20:18:06 -07:00
fixblobs_test.go	server: replace blob prefix separator from ':' to '-' (#3146 )	2024-03-14 20:18:06 -07:00
images.go	WIP thinking API support	2025-05-07 16:15:46 -07:00
images_test.go	api: return model capabilities from the show endpoint (#10066 )	2025-04-01 15:21:46 -07:00
layer.go	One corrupt manifest should not wedge model operations (#7515 )	2024-11-05 14:21:45 -08:00
manifest.go	One corrupt manifest should not wedge model operations (#7515 )	2024-11-05 14:21:45 -08:00
manifest_test.go	One corrupt manifest should not wedge model operations (#7515 )	2024-11-05 14:21:45 -08:00
model.go	explicitly decode maxarraysize 1024	2025-04-25 16:59:01 -07:00
model_test.go	Update the /api/create endpoint to use JSON (#7935 )	2024-12-31 18:02:30 -08:00
modelpath.go	server: organize error types (#9465 )	2025-03-28 11:50:22 -07:00
modelpath_test.go	server: more support for mixed-case model names (#8017 )	2024-12-11 15:29:59 -08:00
prompt.go	WIP thinking API support	2025-05-07 16:15:46 -07:00
prompt_test.go	WIP thinking API support	2025-05-07 16:15:46 -07:00
routes.go	WIP thinking API support	2025-05-07 16:15:46 -07:00
routes_create_test.go	next ollama runner (#7913 )	2025-02-13 16:31:21 -08:00
routes_delete_test.go	Update the /api/create endpoint to use JSON (#7935 )	2024-12-31 18:02:30 -08:00
routes_generate_test.go	WIP thinking API support	2025-05-07 16:15:46 -07:00
routes_list_test.go	Update the /api/create endpoint to use JSON (#7935 )	2024-12-31 18:02:30 -08:00
routes_test.go	strip out thinking tags in message history for qwen3 & r1 (#10490 )	2025-04-30 13:57:45 -07:00
sched.go	Fix "Stopping..." scheduler hang (#10487 )	2025-04-30 11:26:52 -07:00
sched_test.go	Revert "increase default context length to 4096 (#10364 )"	2025-04-28 16:54:11 -07:00
sparse_common.go	Don't hard fail on sparse setup error	2024-08-09 12:16:19 -07:00
sparse_windows.go	Don't hard fail on sparse setup error	2024-08-09 12:16:19 -07:00
thinking.go	WIP thinking API support	2025-05-07 16:15:46 -07:00
thinking_test.go	WIP thinking API support	2025-05-07 16:15:46 -07:00
upload.go	server: always print upload/download part info (#8832 )	2025-02-04 19:30:49 -08:00