Adding cached layers for kaniko builds #300

priyawadhwa · 2018-08-21T20:08:19Z

@mattmoor had this super cool idea, which I've copied below for reference:

tl;dr FTL-style caching for kaniko

Today FTL elides recomputing the dependency layer by publishing an image like:

  gcr.io/mattmoor-images/image-to-publish/cache/python-blah-blah:<hash-of-stuff>

... when asked to publish: gcr.io/mattmoor-images/image-to-publish:foo-bar

<hash of stuff> includes the requirements.txt, (should) include the base image version, 
and could include a timestamp (like what day) to enable some level of freshness.


The idea here is that kaniko would, prior to materializing FROM, fast-forward as far as it has cached:

  FROM ubuntu:latest           # This would be resolved to digest (first step in pull anyways)

  RUN apt-get update           # Check cache for hash(^^ digest, hash("apt-get update"))
  RUN apt-get install foo bar  # Check cache for hash(^^ hash, hash("apt-get install foo bar"))

  ADD baz /blah                # Check cache for hash(^^ hash, hash(relevant files))
  USER sockpuppet              # ...
  WORKDIR /app                 # ...

  RUN echo Hello World         # ...


If at any point we miss the cache, we treat the prior hit as the new "FROM" 
and begin evaluating from the miss.

Phase two of this would be to enable the caching layer to simulate non-RUN operations 
(e.g. ADD/COPY/USER/WORKDIR) against the registry API without downloading the base image.
This would enable Dockerfile's like the following to iterate *very* rapidly 
without ever downloading the base or cache (a la FTL):

  FROM ubuntu:latest           # Same digest, different day

  RUN apt-get update           # No change
  RUN apt-get install foo bar  # No change

  ADD baz /blah                # Oh noes, a change, but upload the layer and continue
  USER sockpuppet              # Metadata-only, post a new config
  WORKDIR /app                 # Metadata-only, post a new config

As Matt suggested, I agree that getting started with a prototype for the first phase would be a good starting point. After we have a prototype, we could do some basic benchmarking comparing no-cache kaniko, cached kaniko, and regular "docker build".

The text was updated successfully, but these errors were encountered:

dlorenc · 2018-08-21T20:10:12Z

IIUC phase one of this would be basically equivalent to docker build --cache-from, except we could infer the cached layers and build a slightly larger cache.

Phase two would be an improvement on that for a subset of Dockerfiles.

priyawadhwa added kind/enhancement New feature or request kind/feature-request labels Aug 21, 2018

priyawadhwa self-assigned this Aug 23, 2018

priyawadhwa mentioned this issue Aug 27, 2018

Added a KanikoStage type for each stage of a Dockerfile #320

Merged

imjasonh mentioned this issue Sep 6, 2018

Laundry list of smells from this resource concourse/docker-image-resource#190

Open

priyawadhwa mentioned this issue Sep 14, 2018

Add layer caching to kaniko #353

Merged

priyawadhwa closed this as completed in #353 Sep 24, 2018

This was referenced Oct 3, 2018

Use kaniko to run builds for go 1.11 GoogleCloudPlatform/golang-docker#148

Merged

Use kaniko for building GoogleCloudPlatform/python-runtime#203

Closed

Use kaniko for building GoogleCloudPlatform/php-docker#461

Merged

This was referenced Oct 17, 2018

Use kaniko for builds GoogleCloudPlatform/nodejs-docker#181

Merged

Use kaniko to build and push Dockerfile GoogleCloudPlatform/runtime-builder-java#97

Closed

mattmoor mentioned this issue Nov 18, 2018

kaniko info in readme is incorrect uber-archive/makisu#43

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding cached layers for kaniko builds #300

Adding cached layers for kaniko builds #300

priyawadhwa commented Aug 21, 2018

dlorenc commented Aug 21, 2018

Adding cached layers for kaniko builds #300

Adding cached layers for kaniko builds #300

Comments

priyawadhwa commented Aug 21, 2018

dlorenc commented Aug 21, 2018