Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: Datastore/Firestore Bulk Delete template performance degration #2195

Open
Abacn opened this issue Feb 14, 2025 · 1 comment
Open

[Bug]: Datastore/Firestore Bulk Delete template performance degration #2195

Abacn opened this issue Feb 14, 2025 · 1 comment
Labels
bug Something isn't working needs triage p2

Comments

@Abacn
Copy link
Contributor

Abacn commented Feb 14, 2025

Related Template(s)

Datastore/Firestore Bulk Delete template

Template Version

2021-12-06-00_rc01

What happened?

Performance degration of DatastoreToDatastoreDelete template is found since version "2021-12-06-00_rc01":

- *SDK: 2.32.0*
- template_version: 2021-11-01-00_RC00
- time: *3 min 15 sec*

- *SDK: 2.34.0* (*data loss bug?)
- template_version: 2021-12-06-00_rc01
- time: *27 min 25 sec*

because this template has long been deprecated. The current recommendation is to stick at version "2021-11-01-00_RC00". To do this, select template path "gs://dataflow-templates/2021-11-01-00_RC00/Datastore_to_Datastore_Delete" when initiating the Template.

Relevant log output

@Abacn Abacn added bug Something isn't working needs triage p2 labels Feb 14, 2025
@Abacn
Copy link
Contributor Author

Abacn commented Feb 14, 2025

I believe root cause is apache/beam#14713 , and checked that the change of RampupThrottling isn't effective in 2021-11-01-00_RC00, while starting for 2021-12-06-00_rc01 the job graph of the template has changed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working needs triage p2
Projects
None yet
Development

No branches or pull requests

1 participant