[Obs AI Assistant] Add test for `get_dataset_info` #213231

sorenlouv · 2025-03-05T11:52:51Z

Add API test for get_dataset_info
Add apache synthtrace scenario

…g apache logs

elasticmachine · 2025-03-05T13:46:14Z

Pinging @elastic/obs-ai-assistant (Team:Obs AI Assistant)

elasticmachine · 2025-03-05T13:46:14Z

Pinging @elastic/obs-ux-infra_services-team (Team:obs-ux-infra_services)

github-actions · 2025-03-05T13:46:21Z

🤖 GitHub comments

Expand to view the GitHub comments

Just comment with:

/oblt-deploy : Deploy a Kibana instance using the Observability test environments.
run docs-build : Re-trigger the docs validation. (use unformatted text in the comment!)

sorenlouv · 2025-03-05T14:00:10Z

...latform/plugins/shared/observability_ai_assistant/server/functions/get_dataset_info/index.ts

-        };
-      }
+  try {
+    const name = indexPattern === '' ? ['*', '*:*'] : `${indexPattern.split(',')}*`;


@dgieselaar As discussed I've added a trailing wildcard. I think this is better although this still doesn't find zookeeper logs if the user asks "please show me my zookeeper logs" and they use datastreams with default naming, eg. indices will be something like .ds-logs-zookeeper.1-default-2025.03.04-000001 and the data stream will be logs-zookeeper.1-default.

To handle that we'd need both a leading and trailing wildcard (*${indexPattern.split(',')}*). I wouldn't mind that but you had some concerns?

Maybe we can:

first execute with the raw value for index (let's say logs)

if no indices are resolved, execute it with logs*

if still no indices are resolved, execute it with *logs*

we can also always add a *:... additionally.

_resolve/index should be relatively fast so I'm not worried about performance.

_resolve/index should be relatively fast so I'm not worried about performance.

That's also my thinking so in that case, why not just do *logs* up front to avoid the complexity?

Apart from code complexity it could also be confusing to users how it sometimes includes some indices depending on what other indices they have in their system, eg. if they have a datastream called zookeeper-legacy-metrics and another logs-zookeeper.1-default it will only include their logs datastream if they delete the legacy datastream.

That's also my thinking so in that case, why not just do logs up front to avoid the complexity?

Because you get stuff like backing indices. But if we can exclude those (I think we can) it's probably fine.

That's also my thinking so in that case, why not just do logs up front to avoid the complexity?

starting with a more-specific index pattern is not about performance, it's about making sure we don't send a lot of noise to the LLM. We could also filter progressively client side if that makes things easier?

Because you get stuff like backing indices. But if we can exclude those (I think we can) it's probably fine.

When would that be a problem? Say, in the example with "zookeeper" can you imagine an index that includes "zookeeper" but should not be included?

WDYT of this?

That way we do not match backing indices.

elasticmachine · 2025-03-05T15:54:33Z

💚 Build Succeeded

Buildkite Build
Commit: 61e5197
Kibana Serverless Image: docker.elastic.co/kibana-ci/kibana-serverless:pr-213231-61e5197da7a2

Metrics [docs]

✅ unchanged

History

Add test for get_dataset_info and synthtrace scenario for generatin…

124f214

…g apache logs

sorenlouv changed the title ~~Add test for get_dataset_info and synthtrace scenario for generating apache logs~~ [Obs AI Assistant] Add test for get_dataset_info and synthtrace scenario for generating apache logs Mar 5, 2025

sorenlouv added 2 commits March 5, 2025 12:54

Remove logs

51a5281

Replace apache logs with simple logs

f09f5ed

sorenlouv marked this pull request as ready for review March 5, 2025 13:34

sorenlouv requested review from a team as code owners March 5, 2025 13:34

sorenlouv changed the title ~~[Obs AI Assistant] Add test for get_dataset_info and synthtrace scenario for generating apache logs~~ [Obs AI Assistant] Add test for get_dataset_info Mar 5, 2025

Rename synthtrace scenario

ca55808

botelastic bot added ci:project-deploy-observability Create an Observability project Team:Obs AI Assistant Observability AI Assistant Team:obs-ux-infra_services Observability Infrastructure & Services User Experience Team labels Mar 5, 2025

sorenlouv added 2 commits March 5, 2025 14:49

Simplify Apache logs

4d0962c

Change comment

5f49d92

sorenlouv added release_note:skip Skip the PR/issue when compiling release notes v9.0.0 backport:version Backport to applied version labels v9.1.0 v8.19.0 labels Mar 5, 2025

sorenlouv commented Mar 5, 2025

View reviewed changes

sorenlouv added 2 commits March 5, 2025 15:41

Add api test

6d5d3f1

Refactor to helper methods

61e5197

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Obs AI Assistant] Add test for `get_dataset_info` #213231

[Obs AI Assistant] Add test for `get_dataset_info` #213231

sorenlouv commented Mar 5, 2025 •

edited

Loading

elasticmachine commented Mar 5, 2025

elasticmachine commented Mar 5, 2025

github-actions bot commented Mar 5, 2025

sorenlouv Mar 5, 2025

sorenlouv Mar 5, 2025

dgieselaar Mar 5, 2025

sorenlouv Mar 5, 2025

sorenlouv Mar 5, 2025

dgieselaar Mar 5, 2025

dgieselaar Mar 5, 2025

sorenlouv Mar 5, 2025

sorenlouv Mar 5, 2025 •

edited

Loading

sorenlouv Mar 5, 2025

elasticmachine commented Mar 5, 2025 •

edited

Loading

[Obs AI Assistant] Add test for get_dataset_info #213231

Are you sure you want to change the base?

[Obs AI Assistant] Add test for get_dataset_info #213231

Conversation

sorenlouv commented Mar 5, 2025 • edited Loading

elasticmachine commented Mar 5, 2025

elasticmachine commented Mar 5, 2025

github-actions bot commented Mar 5, 2025

🤖 GitHub comments

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sorenlouv Mar 5, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

elasticmachine commented Mar 5, 2025 • edited Loading

💚 Build Succeeded

Metrics [docs]

History

[Obs AI Assistant] Add test for `get_dataset_info` #213231

[Obs AI Assistant] Add test for `get_dataset_info` #213231

sorenlouv commented Mar 5, 2025 •

edited

Loading

sorenlouv Mar 5, 2025 •

edited

Loading

elasticmachine commented Mar 5, 2025 •

edited

Loading