image-rs: use index for layers store path #902

squarti · 2025-02-10T20:30:52Z

This PR associates an index for a given image layer and uses the index to build the layer store path. This reduces the path length significantly. The next available layer index is computed by reading the existent directories in the data directory. This is done during pull client initialization.

Fixes: #901

mkulke

awesome bug report, thanks. Can you describe in more detail what you are doing in your fix in the PR description?

squarti · 2025-02-11T20:50:23Z

A possible issue with the approach taken is that pull client cannot run in parallel since the layer index is managed in memory. Although, this already seems to be an issue since the meta store is only saved at the end of pull_image.

Xynnn007

Thanks for the fix. Btw I did not find any place that shows mount does not support data longer than 4096 bytes although the behavior shows this.

Another thing I worry about is synchronization issues. Now ImageClient can respond to multiple pull_image requests asynchronously. Each pull_image request will create a PullClient. Assuming multiple pull_image requests are initiated at the same time, it will be possible to obtain multiple identical or close layers_index, and overwriting might occur.

The previous method of using hash values to name layers can avoid this risk assuming that the hashes do not collide. This is possible if an auto-incrementing number is used.

If number is used, do we need to consider using PullClient as part of ImageClient and perform operations such as async_pull_layers and pull_manifest exclusively and atomically?

squarti · 2025-02-12T15:06:24Z

Would moving the counter to ImageClient as above address these issues?

ChengyuZhu6 · 2025-02-13T02:42:01Z

I'm thinking would it be possible to use the footprint instead of the full hash value as the layer store name? For example, changing from run/image-rs/layers/sha256_055008870f0b0eef21dc8f9be651f07c3f52fa33724d899f8d14bded0a4fa38d to run/image-rs/layers/sha256_055008870f0b?

Xynnn007 · 2025-02-13T02:45:13Z

I'm thinking would it be possible to use the footprint instead of the full hash value as the layer store name? For example, changing from run/image-rs/layers/sha256_055008870f0b0eef21dc8f9be651f07c3f52fa33724d899f8d14bded0a4fa38d to run/image-rs/layers/sha256_055008870f0b?

I have considered it before, but considering that the probability of collision will increase exponentially after truncating the hash value, it may cause unexpected bugs in extreme cases. If we use the self-increasing index as proposed, layer collisions can be absolutely avoid.

mkulke · 2025-02-13T10:43:48Z

is there a way to write this code in a way that we can unit-test the construction of the layer storage paths? this would be good also for specifying what we are doing.

Xynnn007

LGTM. @mkulke PTAL

image-rs/src/image.rs

image-rs/src/layer_store.rs

ChengyuZhu6

LGTM, thanks @squarti . While I'm concerned about the ci failure with ubuntu-24.04-arm.

fitzthum · 2025-02-18T16:55:46Z

CI Green. @mkulke are we good to merge this?

mkulke

at the moment we're just logging the errors instead of bailing out. since this is a sensitive area of the code, I would prefer if we would propagate the errors and fail on anything that we encounter and do not expect, i.e. get_layers_index() should return a Result<> since it's a fallible operation.

This PR creates a layer store abstraction which uses an incrementing index as a unique store path for image layers instead of layer digests. This reduces the layer path length significantly allowing more layers to be mounted given the 4096 page size limit of mount command. Fixes: confidential-containers#901 Signed-off-by: Silenio Quarti <silenio_quarti@ca.ibm.com>

mkulke

thanks for accommodating, a few nits.

it's not great that we swallow the errors at the upper layers now, but apparently that is also what we do with the meta store. It's still an improvement because we don't get something undefined on malformed images, but a predictable and unusuable 0-ed out layerstore. if we unwrap we still need to log the errors though (see suggestion), otherwise it won't be fun to debug.

image-rs/src/layer_store.rs

image-rs/src/image.rs

Co-authored-by: Magnus Kulke <mkulke@gmail.com> Signed-off-by: squarti <silenio_quarti@ca.ibm.com>

Signed-off-by: Silenio Quarti <silenio_quarti@ca.ibm.com>

squarti force-pushed the layers-index branch from e4a03f0 to 383fccf Compare February 10, 2025 20:33

squarti marked this pull request as ready for review February 11, 2025 19:29

squarti requested a review from a team as a code owner February 11, 2025 19:29

mkulke reviewed Feb 11, 2025

View reviewed changes

Xynnn007 reviewed Feb 12, 2025

View reviewed changes

squarti force-pushed the layers-index branch 4 times, most recently from 5b30e95 to 2f85db2 Compare February 12, 2025 14:58

squarti force-pushed the layers-index branch 11 times, most recently from 0259c8c to 06eac92 Compare February 12, 2025 22:52

squarti force-pushed the layers-index branch 5 times, most recently from 1e9be67 to 2e2de58 Compare February 13, 2025 19:09

squarti force-pushed the layers-index branch from f693522 to 97eda8f Compare February 13, 2025 20:29

squarti requested review from mkulke and Xynnn007 February 13, 2025 21:01

Xynnn007 approved these changes Feb 14, 2025

View reviewed changes

mkulke reviewed Feb 14, 2025

View reviewed changes

squarti force-pushed the layers-index branch 12 times, most recently from cded759 to 4b1a11c Compare February 14, 2025 21:27

ChengyuZhu6 approved these changes Feb 17, 2025

View reviewed changes

mkulke reviewed Feb 18, 2025

View reviewed changes

squarti force-pushed the layers-index branch from 4b1a11c to 16ae685 Compare February 18, 2025 21:01

mkulke reviewed Feb 19, 2025

View reviewed changes

image-rs/src/layer_store.rs Outdated Show resolved Hide resolved

image-rs/src/image.rs Outdated Show resolved Hide resolved

squarti and others added 3 commits February 19, 2025 09:17

Update image-rs/src/layer_store.rs

c6e24ba

Co-authored-by: Magnus Kulke <mkulke@gmail.com> Signed-off-by: squarti <silenio_quarti@ca.ibm.com>

Update image-rs/src/image.rs

987e61a

Co-authored-by: Magnus Kulke <mkulke@gmail.com> Signed-off-by: squarti <silenio_quarti@ca.ibm.com>

image-rs: fix missing bracket and imports

091d191

Signed-off-by: Silenio Quarti <silenio_quarti@ca.ibm.com>

squarti force-pushed the layers-index branch from 20808d2 to 091d191 Compare February 19, 2025 14:32

mkulke approved these changes Feb 19, 2025

View reviewed changes

mkulke merged commit 7269cfd into confidential-containers:main Feb 19, 2025
8 checks passed

BbolroC mentioned this pull request Feb 21, 2025

Flaky Tests #918

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

image-rs: use index for layers store path #902

image-rs: use index for layers store path #902

squarti commented Feb 10, 2025 •

edited

Loading

mkulke left a comment

squarti commented Feb 11, 2025

Xynnn007 left a comment •

edited

Loading

squarti commented Feb 12, 2025 •

edited

Loading

ChengyuZhu6 commented Feb 13, 2025

Xynnn007 commented Feb 13, 2025

mkulke commented Feb 13, 2025

Xynnn007 left a comment

ChengyuZhu6 left a comment

fitzthum commented Feb 18, 2025

mkulke left a comment

mkulke left a comment

image-rs: use index for layers store path #902

image-rs: use index for layers store path #902

Conversation

squarti commented Feb 10, 2025 • edited Loading

mkulke left a comment

Choose a reason for hiding this comment

squarti commented Feb 11, 2025

Xynnn007 left a comment • edited Loading

Choose a reason for hiding this comment

squarti commented Feb 12, 2025 • edited Loading

ChengyuZhu6 commented Feb 13, 2025

Xynnn007 commented Feb 13, 2025

mkulke commented Feb 13, 2025

Xynnn007 left a comment

Choose a reason for hiding this comment

ChengyuZhu6 left a comment

Choose a reason for hiding this comment

fitzthum commented Feb 18, 2025

mkulke left a comment

Choose a reason for hiding this comment

mkulke left a comment

Choose a reason for hiding this comment

squarti commented Feb 10, 2025 •

edited

Loading

Xynnn007 left a comment •

edited

Loading

squarti commented Feb 12, 2025 •

edited

Loading