feat(build): Cache pulled artifacts by jedevc · Pull Request #216 · unikraft/cli

jedevc · 2026-03-20T16:59:04Z

If we're pulling rootfs or kernel from an OCI registry, then we should probably cache it locally. There's no real reason that we should try and pull it from the harbor registry over and over again.

It's all content hashed anyways.

If we're pulling rootfs or kernel from an OCI registry, then we should probably cache it locally. There's no real reason that we should try and pull it from the harbor registry over and over again. It's all content hashed anyways. Signed-off-by: Justin Chadwell <justin@unikraft.com>

jedevc · 2026-03-20T16:59:42Z

internal/x/imagespec/cache.go

+// WrapCached wraps a File with caching support. If the file has a backing
+// provider and descriptor, it returns a new ContentStoreFile that uses a
+// pull-through cache at ~/.cache/unikraft. Otherwise, returns the original file.
+func WrapCached(ctx context.Context, file spec.File) spec.File {


Hm, we should try and only cache if it's fetched from a remote registry. If it's a local content store, then probably not.

jedevc · 2026-03-20T17:00:07Z

internal/x/imagespec/cache.go

Is it okay to have no automatic cache cleanup? Probably not.

Copilot

Pull request overview

This PR adds a local pull-through cache for OCI-backed kernel/rootfs artifacts so repeated builds don’t re-pull identical (content-addressed) blobs from the registry.

Changes:

Introduces a WrapCached helper that wraps unikraft.com/x/image-spec files with a local containerd content-store cache under the user cache directory.
Uses the caching wrapper when assembling images in the builder.
Updates unikraft.com/x/image-spec dependency version (and promotes containerd/errdefs to a direct dependency).

Reviewed changes

Copilot reviewed 3 out of 4 changed files in this pull request and generated 2 comments.

File	Description
internal/x/imagespec/cache.go	Implements pull-through caching via a local containerd content store and a proxy `content.Provider`.
internal/builder/build.go	Wraps kernel/initrd artifacts with the new cache-enabled file wrapper.
go.mod	Bumps `unikraft.com/x/image-spec` and makes `github.com/containerd/errdefs` a direct dependency.
go.sum	Updates checksums for the bumped `unikraft.com/x/image-spec` version.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-20T17:03:31Z

internal/x/imagespec/cache.go

+	if desc.Size <= 0 {
+		desc.Size = ra.Size()
+	}
+	if desc.Size <= 0 {
+		return nil
+	}


cacheBlob returns nil even when it decides it cannot write anything (e.g., missing/unknown size). Callers currently interpret nil as "cached successfully". Consider returning a sentinel error or an additional boolean indicating whether a blob was actually written, so callers can safely decide whether to switch to the cache.

Copilot · 2026-03-20T17:03:31Z

internal/x/imagespec/cache.go

+		// Close the upstream reader and return from the cache instead
+		ra.Close()
+		return p.cache.ReaderAt(ctx, desc)


In the cache-write success path, this closes the upstream reader and then blindly switches to p.cache.ReaderAt. If cacheBlob was a no-op (e.g., it returns nil when desc.Size <= 0), or if the cache read fails for any reason, this will return an error even though the upstream reader was valid (and has now been closed). Consider only closing upstream after successfully opening the cached reader, and falling back to the upstream reader when cache open fails; alternatively, make cacheBlob return an explicit "cached" boolean so the caller doesn’t assume the blob exists in the cache.

Suggested change

// Close the upstream reader and return from the cache instead

ra.Close()

return p.cache.ReaderAt(ctx, desc)

// Try to open a new reader from the cache after writing.

cachedRA, err := p.cache.ReaderAt(ctx, desc)

if err == nil {

// Successfully opened from cache: close upstream and return cached reader.

ra.Close()

log.G(ctx).Debug().

Str("digest", desc.Digest.String()).

Msg("content cache read after write")

return cachedRA, nil

}

// If the cache read after write fails for any reason, fall back to the

// still-open upstream reader instead of returning an error.

log.G(ctx).Debug().

Err(err).

Str("digest", desc.Digest.String()).

Msg("content cache read after write failed; using upstream reader")

return ra, nil

jedevc requested a review from Copilot March 20, 2026 16:59

Copilot started reviewing on behalf of jedevc March 20, 2026 16:59 View session

jedevc commented Mar 20, 2026

View reviewed changes

internal/x/imagespec/cache.go

Copy link
Copy Markdown

Member Author

jedevc Mar 20, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it okay to have no automatic cache cleanup? Probably not.

Copilot AI reviewed Mar 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(build): Cache pulled artifacts#216

feat(build): Cache pulled artifacts#216
jedevc wants to merge 1 commit intostagingfrom
jedevc/cache-registry-data

jedevc commented Mar 20, 2026

Uh oh!

jedevc Mar 20, 2026

Uh oh!

jedevc Mar 20, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 20, 2026

Uh oh!

Copilot AI Mar 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

-		// Close the upstream reader and return from the cache instead
-		ra.Close()
-		return p.cache.ReaderAt(ctx, desc)
+		// Try to open a new reader from the cache after writing.
+		cachedRA, err := p.cache.ReaderAt(ctx, desc)
+		if err == nil {
+			// Successfully opened from cache: close upstream and return cached reader.
+			ra.Close()
+			log.G(ctx).Debug().
+				Str("digest", desc.Digest.String()).
+				Msg("content cache read after write")
+			return cachedRA, nil
+		}
+		// If the cache read after write fails for any reason, fall back to the
+		// still-open upstream reader instead of returning an error.
+		log.G(ctx).Debug().
+			Err(err).
+			Str("digest", desc.Digest.String()).
+			Msg("content cache read after write failed; using upstream reader")
+		return ra, nil

Conversation

jedevc commented Mar 20, 2026

Uh oh!

jedevc Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

jedevc Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants