[Detection Engine] Verify efficient usage of createPointInTimeFinder #211637

yctercero · 2025-02-18T19:09:23Z

In our plugins, the following files use createPointInTimeFinder and createPointInTimeFinderDecryptedAsInternalUser. We should analyze whether we are loading all results into memory and apply mitigations if possible, as described in this ticket: #203017.

x-pack/plugins/lists/server/services/exception_lists/find_value_list_exception_list_items_point_in_time_finder.ts
x-pack/plugins/lists/server/services/exception_lists/find_exception_list_point_in_time_finder.ts
x-pack/plugins/lists/server/services/exception_lists/find_exception_list_items_point_in_time_finder.ts

The text was updated successfully, but these errors were encountered:

elasticmachine · 2025-02-18T19:09:26Z

Pinging @elastic/security-detection-engine (Team:Detection Engine)

elasticmachine · 2025-02-18T19:09:26Z

Pinging @elastic/security-detections-response (Team:Detections and Resp)

elasticmachine · 2025-02-18T19:09:26Z

Pinging @elastic/security-solution (Team: SecuritySolution)

rylnd · 2025-02-19T23:49:23Z

Result of initial audit:

It looks like the delete_list_route does perform an unbounded (size) call to findValueListExceptionListItemsPointInTimeFinder; however it is scoped to a particular list_id, which mitigates the issue somewhat. A perPage of 1000 is applied to the PIT query, but code is still sensitive to a list with a large number of items, as they'd all be loaded into memory.
Again in delete_list_route, there is a similarly unbounded call to findExceptionListPointInTimeFinder. It is used when deleting exception items to verify that the item being deleted does not reference any other exception lists. When a list item is deleted, all exception lists referencing it will be loaded into memory here.
When duplicating exception lists, we call findExceptionListsItemPointInTimeFinder with a maxSize of 10k to retrieve all existing items to be duplicated to the new list. Since we have a set maxSize, I think we can discount this one?

To summarize the above: we do not have any situations where we attempt to retrieve all known lists/items into memory. However, when deleting an exception list, there are two instances where we perform an unbounded retrieval (of items referenced by a value list, and exception lists referenced by those items, respectively) of items linked to the exception list being deleted. Since there is no limit to the number of value lists referenced in an exception list, nor to the number of exception lists referencing a value list, the size is still effectively unbounded.

rylnd · 2025-02-20T17:39:55Z

@yctercero given the above, I can think of a few angles from which to address things:

Add caps on the existing unbounded calls, and throw an error if the queries' size would exceed that. This would bound the situation moving forward, at the expense of forcing users to deal with these large (or largely shared) lists manually (since they could no longer delete them via the API).
Continue with the existing behaviors, but somehow warn the user if they have lists exceeding 10k items/references
Continue with the existing behaviors, but add telemetry around the sizes of these calls

yctercero · 2025-02-24T05:25:50Z

@rylnd thanks so much for doing the analysis of our uses here.

Could you create a ticket for us to follow up on this and add to backlog?

A user could easily have more than 10k value list items. I'm not as worried about the number of value lists referenced in an exception list.

Add caps on the existing unbounded calls, and throw an error if the queries' size would exceed that. This would bound the situation moving forward, at the expense of forcing users to deal with these large (or largely shared) lists manually (since they could no longer delete them via the API).

This would be a breaking change and with value lists, they could very easily reach 10k+ so I'm a bit weary of going down this route.

Continue with the existing behaviors, but somehow warn the user if they have lists exceeding 10k items/references

What exactly would we warn them about? Would we need to create some kind of async route to deal with these instances?

Continue with the existing behaviors, but add telemetry around the sizes of these calls

We are prioritizing adding telemetry in 8.19, it may be worth adding a comment in this ticket to ensure we track this.

rylnd · 2025-02-25T23:33:26Z

@yctercero I created #212460; let me know if that looks good to you.

rylnd self-assigned this Feb 19, 2025

yctercero removed the impact:high Addressing this issue will have a high level of impact on the quality/strength of our product. label Feb 19, 2025

yctercero mentioned this issue Feb 24, 2025

[Meta] Verify efficient usage of createPointInTimeFinder #203017

Open

rylnd mentioned this issue Feb 25, 2025

[Security Solution][Exception Lists] Prevent loading of >10k value list items into memory #212460

Open

yctercero closed this as completed Feb 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Detection Engine] Verify efficient usage of createPointInTimeFinder #211637

[Detection Engine] Verify efficient usage of createPointInTimeFinder #211637

yctercero commented Feb 18, 2025 •

edited

Loading

elasticmachine commented Feb 18, 2025

elasticmachine commented Feb 18, 2025

elasticmachine commented Feb 18, 2025

rylnd commented Feb 19, 2025 •

edited

Loading

rylnd commented Feb 20, 2025

yctercero commented Feb 24, 2025

rylnd commented Feb 25, 2025

[Detection Engine] Verify efficient usage of createPointInTimeFinder #211637

[Detection Engine] Verify efficient usage of createPointInTimeFinder #211637

Comments

yctercero commented Feb 18, 2025 • edited Loading

elasticmachine commented Feb 18, 2025

elasticmachine commented Feb 18, 2025

elasticmachine commented Feb 18, 2025

rylnd commented Feb 19, 2025 • edited Loading

rylnd commented Feb 20, 2025

yctercero commented Feb 24, 2025

rylnd commented Feb 25, 2025

yctercero commented Feb 18, 2025 •

edited

Loading

rylnd commented Feb 19, 2025 •

edited

Loading