feat: Add "think" parameter for Ollama #1948

Ryzhtus · 2025-06-14T16:50:44Z

Related Issues

fixes Add Ollamas Thinking capabilities #1881

Proposed Changes:

Added an additional think parameter in accordance with Ollama’s current interface, and included a test to cover this feature.

How did you test it?

I added unit tests

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.

Ryzhtus · 2025-06-14T18:51:53Z

@anakin87 Hi! What's your opinion about storing thinking field in the _meta attribute of ChatMessage. In OllamaChatGenerator the response is converted to a ChatMessage object, where its _meta value is formatted to be compatible with OpenAI API. The thing is OpenAI's ChatCompletion API doesn't support storing reasoning messages, so there's no appropriate field to store it and so this breaks compatibility in some way. But having it probably will be useful for users who want to track reasoning process of their LLMs

anakin87

Thanks for working on this...

I agree with your approach. For the moment, putting the thinking output in ChatMessage._meta.thinking is reasonable.
Please rebase your branch, fix conflicts and run tests.
I left some other comments .

anakin87 · 2025-06-16T13:45:12Z

...rations/ollama/src/haystack_integrations/components/generators/ollama/chat/chat_generator.py

@@ -156,6 +161,7 @@ def __init__(
        url: str = "http://localhost:11434",
        generation_kwargs: Optional[Dict[str, Any]] = None,
        timeout: int = 120,
+        think=False,


I would put this new parameter at the end, to make this change non-breaking.

anakin87 · 2025-06-16T13:53:55Z

...rations/ollama/src/haystack_integrations/components/generators/ollama/chat/chat_generator.py

@@ -172,6 +178,8 @@ def __init__(
            [Ollama docs](https://github.com/jmorganca/ollama/blob/main/docs/modelfile.md#valid-parameters-and-values).
        :param timeout:
            The number of seconds before throwing a timeout error from the Ollama API.
+        :param think
+            Enables the model's "thinking" process.


I would expand this explanation to something like

Suggested change

Enables the model's "thinking" process.

If True, the modell will "think" before producing a response.

Only [thinking models](https://ollama.com/search?c=thinking) support this feature.

The intermediate "thinking" output can be found in the `meta` property of the returned `ChatMessage`.

anakin87 · 2025-06-16T13:57:41Z

integrations/ollama/src/haystack_integrations/components/generators/ollama/generator.py

@@ -36,6 +36,7 @@ def __init__(
        template: Optional[str] = None,
        raw: bool = False,
        timeout: int = 120,
+        think: bool = False,


We are trying to introduce new features in the Chat Generators only. In the long run, we may deprecate Generators and keep only Chat Generators.

For this reason, I won't introduce support for thinking in Generators.

anakin87 · 2025-06-16T14:01:52Z

integrations/ollama/tests/test_chat_generator.py

@@ -508,6 +508,17 @@ def test_run_with_chat_history(self):
            city.lower() in response["replies"][-1].text.lower() for city in ["Manchester", "Birmingham", "Glasgow"]
        )

+    @pytest.mark.integration
+    def test_live_run_with_thinking(self):
+        chat_generator = OllamaChatGenerator(model="qwen3:1.7b", think=True)


to use this model in an integration test, you should also change the following line

haystack-core-integrations/.github/workflows/ollama.yml

Line 25 in 18e19f5

LLM_FOR_TESTS: "llama3.2:3b"

However, I would recommend using qwen3:0.6b if possible: based on my experiment, this would work quite well with our tests and being very small, it can speed up download and inference times.

Add think parameter and test

1230fe4

Ryzhtus requested a review from a team as a code owner June 14, 2025 16:50

Ryzhtus requested review from anakin87 and removed request for a team June 14, 2025 16:50

github-actions bot added integration:ollama type:documentation Improvements or additions to documentation labels Jun 14, 2025

Add thinking as metadata parameter for chat generator and also a test

e224bf0

anakin87 reviewed Jun 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Add "think" parameter for Ollama #1948

feat: Add "think" parameter for Ollama #1948

Uh oh!

Ryzhtus commented Jun 14, 2025

Uh oh!

Ryzhtus commented Jun 14, 2025

Uh oh!

anakin87 left a comment

Uh oh!

anakin87 Jun 16, 2025

Uh oh!

anakin87 Jun 16, 2025

Uh oh!

anakin87 Jun 16, 2025

Uh oh!

anakin87 Jun 16, 2025

Uh oh!

Uh oh!

-            Enables the model's "thinking" process.
+            If True, the modell will "think" before producing a response.
+            Only [thinking models](https://ollama.com/search?c=thinking) support this feature.
+            The intermediate "thinking" output can be found in the `meta` property of the returned `ChatMessage`.

feat: Add "think" parameter for Ollama #1948

Are you sure you want to change the base?

feat: Add "think" parameter for Ollama #1948

Uh oh!

Conversation

Ryzhtus commented Jun 14, 2025

Related Issues

Proposed Changes:

How did you test it?

Checklist

Uh oh!

Ryzhtus commented Jun 14, 2025

Uh oh!

anakin87 left a comment

Choose a reason for hiding this comment

Uh oh!

anakin87 Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

anakin87 Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

anakin87 Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

anakin87 Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!