Skip to content

tele(taskbroker): check zlib compression viability #91693

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 15, 2025

Conversation

john-z-yang
Copy link
Member

No description provided.

@github-actions github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label May 15, 2025
register(
"taskworker.try_compress.profile_metrics",
default=False,
type=Bool,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shoulds this be a float, so we could sample instead of compressing all or nothing?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sounds good, I made it a float. Also moved the uncompress length metric to outside of this option flag so we can get a better picture of the sizes we're dealing with

@john-z-yang john-z-yang force-pushed the john/check-compress-param branch from acc3484 to b75f430 Compare May 15, 2025 16:32
@getsentry getsentry deleted a comment from codecov bot May 15, 2025
@john-z-yang john-z-yang requested a review from markstory May 15, 2025 16:35
@john-z-yang john-z-yang marked this pull request as ready for review May 15, 2025 16:35
@john-z-yang john-z-yang requested a review from a team as a code owner May 15, 2025 16:35
start_time = time.perf_counter()
metrics.distribution(
"profiling.profile_metrics.compressed_bytes",
len(b64encode(zlib.compress(message.payload.value))),
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So zipping a base64 encoded msgpack payload 🤔 Wouldn't zlib.compress() return bytes which also need to be b64 encoded, as not all bytes can be json encoded.

Should we be zipping the msgpack, and then b64 encoding the zip? That might entirely defeat the benefits of zip though.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh I did not know that message.payload.value is a msgpack payload. Why do we b64 encode it?

Ideally we should zip the bytes and then b64 encode the zipped values.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We currently have to b64 encode the msgpack, as not all bytes can be json encoded 😢

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think I got myself mixed up here 😵 . I see now that you're doing b64(zip(msgpack)) which is as good as we can get for now.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

all good

Copy link
Member

@evanh evanh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Makes sense to me, excited to see the results!

@john-z-yang john-z-yang force-pushed the john/check-compress-param branch from 1acbc4f to 01b0a77 Compare May 15, 2025 21:36
@john-z-yang john-z-yang merged commit 7511c29 into master May 15, 2025
60 checks passed
@john-z-yang john-z-yang deleted the john/check-compress-param branch May 15, 2025 21:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Scope: Backend Automatically applied to PRs that change backend components
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants