[DERCBOT-919] Add question condensing LLM and prompt template #1827

assouktim · 2025-01-17T13:36:07Z

Use Case / Functionnal

Let the user configure in it's RAG chain the condensation prompt, this prompt is used to reformulate user's query using the latest messages in the dialog history including chatbot messaged. It's use to contextualize the user query.

For each bot using Rag Settings we can also now configure :

Debugging of the rag messages that can be enabled for each call or disabled using debugEnabled
the maximum number of retrieved document from the vector db using maxDocumentsRetrieved, earlier it was only possible using a specific VectorDB setting configuration in the studio.
the number of message used from the dialog history to contextualise the user query using maxMessagesFromHistory

Technical changes

This pull request includes significant changes to the BotAdminService and related models, primarily focused on enhancing the RAG (Retrieval-Augmented Generation) configuration and sentence generation functionalities. The most important changes include the addition of new settings for question condensing and answering, updates to the data transfer objects (DTOs), and modifications to the validation and deletion processes.

Enhancements to RAG Configuration:

Added new settings for questionCondensingLlmSetting, questionCondensingPrompt, questionAnsweringLlmSetting, and questionAnsweringPrompt in BotRAGConfigurationDTO to support more granular control over RAG processes. [1] [2] [3]
Updated the deletion logic in BotAdminService to handle the new LLM settings for question condensing and answering. [1] [2]

Sentence Generation Updates:

Introduced prompt as a new field in BotSentenceGenerationConfigurationDTO to allow customizable prompts for sentence generation. [1] [2] [3]
Modified CompletionService to utilize the new prompt field for generating sentences. [1] [2]

Validation and Testing:

Enhanced the validation logic in RAGValidationService to include checks for both question condensing and answering LLM settings. [1] [2] [3]
Updated test cases in RAGServiceTest and RAGValidationServiceTest to reflect the new configuration fields and validation logic. [1] [2] [3] [4] [5]

These changes collectively improve the flexibility and robustness of the BotAdminService, particularly in handling complex RAG configurations and sentence generation scenarios.

Breaking changes

tock_gen_ai_orchestrator_rag_debug_enabled is no longer used. Debug mode can only be activated directly from the RAG configuration.
tock_gen_ai_orchestrator_dialog_number_messages is no longer used. The number of dialog messages to be taken can be specified directly from the RAG configuration.
The VectorStoreSettingBase::k attribute is removed, as this information is now carried by the RAG configuration.

Benvii

Ok for me but some breaking changes need to be documented as we don't have any issue put them on the PR description the release script will be able to catch them.

Also we need to open an issue on tock helm repository to reference the environment variable changes.

Some conflicting files, rebase need to be done.

bot/engine/src/main/kotlin/engine/config/RAGAnswerHandler.kt

...hestrator-client/src/main/kotlin/ai/tock/genai/orchestratorclient/requests/PromptTemplate.kt

...ain/kotlin/ai/tock/genai/orchestratorcore/models/vectorstore/OpenSearchVectorStoreSetting.kt

...server/src/main/python/server/src/gen_ai_orchestrator/configurations/environment/settings.py

...n/python/server/src/gen_ai_orchestrator/services/langchain/callbacks/rag_callback_handler.py

Benvii

LGTM, thanks 😀

Benvii

Sorry changes need to be done as the documentation as moved, can you check it please ?

Benvii · 2025-02-14T14:48:03Z

docs/docs/en/user/studio/gen-ai/providers/gen-ai-provider-llm-and-embedding.md

Doc is also present in french please update it, sorry I forgot it during the first review

Benvii · 2025-02-14T14:48:59Z

docs/_en/dev/gen_ai_orchestrator/api.md

This doc doesn't exist at this location anymore I don't understand why it's not generating a conflict can you update the new file FR and EN ?

Benvii

Thanks for the changes

github-actions · 2025-02-18T09:59:44Z

Website is published: https://doc.tock.ai/tock/feature/dercbot-919/

github-actions · 2025-02-24T10:13:36Z

🚀 Cleanup completed for PR and associated branch: feature/dercbot-919

assouktim force-pushed the feature/dercbot-919 branch 3 times, most recently from 7c6ddda to 0ac80d7 Compare January 27, 2025 09:44

assouktim marked this pull request as ready for review January 27, 2025 09:45

rkuffer mentioned this pull request Jan 27, 2025

[Tock Studio] Refacto of Gen AI prompt settings #1833

Merged

assouktim self-assigned this Jan 28, 2025

assouktim added the enhancement label Jan 28, 2025

assouktim changed the title ~~[DERCBOT-919] Add condenseQuestion LLM and Prompt~~ [DERCBOT-919] Add question condensing LLM and prompt template Jan 29, 2025

Benvii approved these changes Feb 10, 2025

View reviewed changes

assouktim mentioned this pull request Feb 13, 2025

Environment variables to be removed theopenconversationkit/tock-helm-chart#4

Open

assouktim force-pushed the feature/dercbot-919 branch from aff026b to ad8c1b3 Compare February 13, 2025 09:51

assouktim had a problem deploying to github-pages February 13, 2025 09:51 — with GitHub Actions Failure

Benvii approved these changes Feb 14, 2025

View reviewed changes

scezen self-requested a review February 14, 2025 14:32

scezen approved these changes Feb 14, 2025

View reviewed changes

Benvii requested changes Feb 14, 2025

View reviewed changes

assouktim had a problem deploying to github-pages February 17, 2025 10:09 — with GitHub Actions Failure

[DERCBOT-919] Add condenseQuestion LLM and Prompt

4497c53

assouktim force-pushed the feature/dercbot-919 branch from 3874541 to 4497c53 Compare February 17, 2025 10:10

assouktim had a problem deploying to github-pages February 17, 2025 10:11 — with GitHub Actions Failure

Benvii approved these changes Feb 18, 2025

View reviewed changes

assouktim temporarily deployed to github-pages February 18, 2025 07:51 — with GitHub Actions Inactive

[DERCBOT-919] fix

76cb1b2

assouktim had a problem deploying to github-pages February 19, 2025 15:33 — with GitHub Actions Failure

[DERCBOT-919] fix import

4ba0426

assouktim had a problem deploying to github-pages February 19, 2025 15:39 — with GitHub Actions Failure

Benvii approved these changes Feb 24, 2025

View reviewed changes

Benvii merged commit 02dd9e7 into theopenconversationkit:master Feb 24, 2025
2 of 3 checks passed

Benvii deleted the feature/dercbot-919 branch February 24, 2025 09:53

vsct-jburet added this to the 24.9.8 milestone Mar 5, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DERCBOT-919] Add question condensing LLM and prompt template #1827

[DERCBOT-919] Add question condensing LLM and prompt template #1827

assouktim commented Jan 17, 2025 •

edited

Loading

Benvii left a comment •

edited

Loading

Benvii left a comment

Benvii left a comment

Benvii Feb 14, 2025

Benvii Feb 14, 2025

Benvii left a comment

github-actions bot commented Feb 18, 2025

github-actions bot commented Feb 24, 2025

[DERCBOT-919] Add question condensing LLM and prompt template #1827

[DERCBOT-919] Add question condensing LLM and prompt template #1827

Conversation

assouktim commented Jan 17, 2025 • edited Loading

Use Case / Functionnal

Technical changes

Enhancements to RAG Configuration:

Sentence Generation Updates:

Validation and Testing:

Breaking changes

Benvii left a comment • edited Loading

Choose a reason for hiding this comment

Benvii left a comment

Choose a reason for hiding this comment

Benvii left a comment

Choose a reason for hiding this comment

Benvii Feb 14, 2025

Choose a reason for hiding this comment

Benvii Feb 14, 2025

Choose a reason for hiding this comment

Benvii left a comment

Choose a reason for hiding this comment

github-actions bot commented Feb 18, 2025

github-actions bot commented Feb 24, 2025

assouktim commented Jan 17, 2025 •

edited

Loading

Benvii left a comment •

edited

Loading