Add support for max batch size for connectors #7274

andrewmcgivery · 2025-04-15T16:52:45Z

Add support for max batch size for connectors

Checklist

Complete the checklist (and note appropriate exceptions) before the PR is marked ready-for-review.

Exceptions

Note any exceptions here

Notes

It may be appropriate to bring upcoming changes to the attention of other (impacted) groups. Please endeavour to do this before seeking PR approval. The mechanism for doing this will vary considerably, so use your judgement as to how and when to do this. ↩
Configuration is an important part of many changes. Where applicable please try to document configuration examples. ↩
Tick whichever testing boxes are applicable. If you are adding Manual Tests, please document the manual testing (extensively) in the Exceptions. ↩

svc-apollo-docs · 2025-04-15T16:52:50Z

✅ Docs preview has no changes

The preview was not built because there were no changes.

Build ID: c678b964f6e36d3aeba47100

github-actions · 2025-04-15T16:52:58Z

@andrewmcgivery, please consider creating a changeset entry in /.changesets/. These instructions describe the process and tooling.

router-perf · 2025-04-15T16:53:21Z

CI performance tests

lennyburdette

i think this warrants an integration test next to the existing batch test (which will require a hand-edited supergraph until we release composition changes). nice work tho!

lennyburdette · 2025-04-16T12:38:36Z

apollo-federation/src/sources/connect/mod.rs

@@ -8,7 +8,7 @@ pub mod expand;
 mod header;
 mod id;
 mod json_selection;
-mod models;
+pub mod models;


let's selectively expose types as pub instead of marking the whole module as pub if possible

lennyburdette · 2025-04-16T12:41:38Z

apollo-federation/src/sources/connect/spec/directives.rs

+                    "supplied 'max_size' field in `@connect` directive's `batch` field is not a positive integer"
+                ))?);
+                // Convert the int to a usize since it is used for chunking an array later.
+                // Much better to fail here than at run time.


this code is part of the runtime!

Bad wording on my part 😅

I think what I more meant here is during the request lifecycle? Aka... check this as early as we can (startup?) instead of it failing in the middle of a request.

Updated the comment... let me know if this makes sense 😅

lennyburdette · 2025-04-16T12:42:55Z

apollo-federation/src/sources/connect/spec/schema.rs

@@ -55,6 +57,15 @@ pub(crate) struct SourceHTTPArguments {
    pub(crate) headers: IndexMap<HeaderName, HeaderSource>,
 }

+/// Settings for the connector when it is doing a $batch entity resolver
+#[cfg_attr(test, derive(Debug))]
+pub(crate) struct SourceBatchArguments {


why is it called Source BatchArguments?

🤦 Good call lol

lennyburdette · 2025-04-16T12:44:38Z

apollo-router/src/plugins/connectors/handle_responses.rs

+                    // Because we may have multiple batch entities requests, we should add to ENTITIES as the requests come in so it is additive
+                    let entities = data
+                        .entry(ENTITIES)
+                        .or_insert(Value::Array(Vec::with_capacity(count)));


this count is for the current chunk. i dunno if there's a way to get the original count of entity references here?

So... it seems like it's the count of....

let count = responses.len();

Which is calculated prior to looping through the responses. 😕 Weird.

lennyburdette · 2025-04-16T12:46:02Z

apollo-router/src/plugins/connectors/make_requests.rs

+
+    // If we've got a max_size set, chunk the batch into smaller batches. Otherwise, we'll default to just a single batch.
+    let max_size = connector.batch_settings.as_ref().and_then(|bs| bs.max_size);
+    let batches = max_size.map_or(vec![batch.clone()], |size| {


vec![batch.clone()] seems like an unnecessary clone, but i can't solve it without rust-analyzer 😁

It seems unnecessary but it complains that it can't be owned by both that line and the closure right below it.... making it borrowed causes all kinds of other errors 😅 I couldn't figure out any alternatives but open to suggestions!

(I tried asking ChatGPT and it said "If you must return owned Vec, then cloning or moving is necessary. 😅 )

let batches = if let Some(size) = n { v.chunks(size).map(|v| v.to_vec()).collect() } else { vec![v] };

i guess the borrow checker can't know that the closures passed to map_or are mutually exclusive. but an if/else definitely is!

Well I suppose I could do it that way! That's actually what I originally started with and it didn't feel rust-y so I re-factored to what I have now 😆

lennyburdette · 2025-04-16T12:46:30Z

apollo-router/src/plugins/connectors/make_requests.rs

+            ResponseKey::BatchEntity {
+                selection: selection.clone(),
+                inputs,
+                keys: keys.clone(),


not for this PR, but we should rename this to key_field_set or something

In just this function or on request and BatchEntity too?

yeah, everywhere. it's a very confusing name

… update comment

andrewmcgivery added 2 commits April 15, 2025 12:45

Add support for max batch size for connectors

86f8a70

Merge remote-tracking branch 'origin/dev' into feature/batchmaxsize

7c8e794

andrewmcgivery added 6 commits April 15, 2025 13:27

Fix test, update snapshots

f21b62e

Add tests

ec56089

Add spec definition for new batch argument

d4bff47

Fix problem where entities weren't being properly mapped

8a70021

Updated snapshots

dc7c801

Merge remote-tracking branch 'origin/dev' into feature/batchmaxsize

37349de

lennyburdette reviewed Apr 16, 2025

View reviewed changes

andrewmcgivery added 5 commits April 16, 2025 11:19

Make models not pub, SourceBatchArguments -> ConnectBatchArguments,…

fc93954

… update comment

Add integration tests

c86c612

de-rustify batches so we can avoid a clone!

cd220f9

Merge remote-tracking branch 'origin/dev' into feature/batchmaxsize

3b3a402

Merge remote-tracking branch 'origin/dev' into feature/batchmaxsize

cea1510

andrewmcgivery marked this pull request as ready for review April 16, 2025 16:32

andrewmcgivery requested review from a team as code owners April 16, 2025 16:32

andrewmcgivery changed the title ~~WIP: Add support for max batch size for connectors~~ Add support for max batch size for connectors Apr 16, 2025

lennyburdette approved these changes Apr 16, 2025

View reviewed changes

andrewmcgivery merged commit cfad283 into dev Apr 16, 2025
15 checks passed

andrewmcgivery deleted the feature/batchmaxsize branch April 16, 2025 16:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for max batch size for connectors #7274

Add support for max batch size for connectors #7274

andrewmcgivery commented Apr 15, 2025 •

edited

Loading

svc-apollo-docs commented Apr 15, 2025 •

edited

Loading

github-actions bot commented Apr 15, 2025

router-perf bot commented Apr 15, 2025

lennyburdette left a comment

lennyburdette Apr 16, 2025

andrewmcgivery Apr 16, 2025

lennyburdette Apr 16, 2025

andrewmcgivery Apr 16, 2025

andrewmcgivery Apr 16, 2025

lennyburdette Apr 16, 2025

andrewmcgivery Apr 16, 2025

lennyburdette Apr 16, 2025

andrewmcgivery Apr 16, 2025

lennyburdette Apr 16, 2025

andrewmcgivery Apr 16, 2025

lennyburdette Apr 16, 2025 •

edited

Loading

andrewmcgivery Apr 16, 2025

lennyburdette Apr 16, 2025

andrewmcgivery Apr 16, 2025

lennyburdette Apr 16, 2025

Add support for max batch size for connectors #7274

Add support for max batch size for connectors #7274

Conversation

andrewmcgivery commented Apr 15, 2025 • edited Loading

Footnotes

svc-apollo-docs commented Apr 15, 2025 • edited Loading

✅ Docs preview has no changes

github-actions bot commented Apr 15, 2025

router-perf bot commented Apr 15, 2025

lennyburdette left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lennyburdette Apr 16, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

andrewmcgivery commented Apr 15, 2025 •

edited

Loading

svc-apollo-docs commented Apr 15, 2025 •

edited

Loading

lennyburdette Apr 16, 2025 •

edited

Loading