remote: re-implement merkle tree buildling #7583

buchgr · 2019-02-28T12:38:05Z

The TreeNodeRepository served us well, but has become a bit dated over
time and it's increasingly hard to understand and to fix bugs. Additionally,
it was written with the initial intend of incrementally updating the merkle
tree between actions, however internal performance analysis has shown
that doing so is 1) hard to implement reliably and 2) would increase
memory consumption multifold.

This is a rewrite of the code that also fixes bugs like #4663. The
merkle tree construction is split into two:

InputTree is an intermediate tree representation of a (sorted) list
of inputs.
MerkleTree implements a visitor pattern over an InputTree and is
responsible for serializing the InputTree to the protobuf merkle
tree that's used by remote caching / execution.

Benchmarks on my local machine have shown this implementation to
consumes between 2-10x less CPU time than the current implementation.
Tests by users have shown equally encouraging speedups [1].

[1] https://groups.google.com/d/msg/bazel-discuss/wPHrqm2z8lU/mzoRF236GQAJ

src/main/java/com/google/devtools/build/lib/remote/RemoteSpawnCache.java

src/main/java/com/google/devtools/build/lib/remote/RemoteSpawnRunner.java

ulfjack · 2019-02-28T12:59:46Z

src/main/java/com/google/devtools/build/lib/remote/merkletree/MerkleTree.java

+      DigestUtil digestUtil) throws IOException {
+    InputTree tree =
+        InputTree.build(inputs, metadataProvider, execRoot, digestUtil);
+    return MerkleTree.build(tree, digestUtil);


s/MerkleTree.//?

ulfjack · 2019-02-28T13:07:54Z

src/main/java/com/google/devtools/build/lib/remote/merkletree/InputTree.java

+    }
+  }
+
+  private final Map<PathFragment, DirectoryNode> tree;


I wonder if there's a way to just take a plain list of ActionInputs and efficiently build the merkle tree directly from that. This is using a map, not a list, and the DirectoryNode instances seem to contain all nested elements.

I believe there is for a lexicographically sorted list and it should be fairly easy to implement. However, what InputTree mostly is for is to expand directory inputs. I think after this change we could move the directory expansion to the SpawnInputExpander and after that we can build a merkle tree directly from a list of ActionInputs.

buchgr · 2019-02-28T21:21:36Z

@benjaminp iirc you mentioned that you are not using the TreeNodeRepository for performance reasons. If so, I'd love to get your feedback about this PR.

benjaminp · 2019-03-01T07:16:23Z

Yeah, we replace the merkle tree hashing with the following:

    Hasher hasher = Hashing.sha256().newHasher();
    Iterable<? extends ActionInput> spawnInputs = spawn.getInputFiles();
    for (ActionInput input : spawnInputs) {
      hasher.putBytes(input.getExecPathString().getBytes(StandardCharsets.UTF_8));
      if (input instanceof VirtualActionInput) {
        ((VirtualActionInput) input).writeTo(Funnels.asOutputStream(hasher));
        continue;
      }
      hasher.putBytes(getMetadataProvider().getMetadata(input).getDigest());
    }

A subtle but important property of the above is that it exploits input aggregation that Bazel has already done. In particular, since runfiles trees are behind a middlemen artifact, you don't end up rehashing every runfile for every action a particular runfiles tree is in. This optimization greatly helps static asset pipelines, which often involve thousands of actions consisting of one 40,000 file tool that converts one input file to one output file. (See #3217 for a related example of how the wonderful world of web static asset processing can stress Bazel.)

It seems it would be hard to easily do better than our implementation in pure hashing efficiency given that it's a straightforward for loop over the inputs. That said, if the performance improvements you mention in the summary hold for us, this PR may be good enough. I do love to drop local patches.

buchgr · 2019-03-01T07:48:24Z

@benjaminp thanks! I suppose we could also use the code snippet you posted for remote caching (which I understand you use), as technically it's really only remote execution that requires a merkle tree. I'd be happy to review a patch :-).

src/main/java/com/google/devtools/build/lib/remote/RemoteSpawnCache.java

src/main/java/com/google/devtools/build/lib/remote/merkletree/InputTree.java

The TreeNodeRepository served us well, but has become a bit dated over time and it's increasingly hard to understand and to fix bugs. Additionally, it was written with the initial intend of incrementally updating the merkle tree between actions, however internal performance analysis has shown that doing so is 1) hard to implement reliably and 2) would increase memory consumption multifold. This is a rewrite of the code that also fixes bugs like bazelbuild#4663. The merkle tree construction is split into two: - InputTree is an intermediate tree representation of a (sorted) list of inputs. - MerkleTree implements a visitor pattern over an InputTree and is responsible for serializing the InputTree to the protobuf merkle tree that's used by remote caching / execution. Benchmarks on my local machine have shown this implementation to consumes between 2-10x less CPU time than the current implementation. Tests by users have shown equally encouraging speedups [1]. [1] https://groups.google.com/d/msg/bazel-discuss/wPHrqm2z8lU/mzoRF236GQAJ

…tory InputTree is a tree representation of a list of action inputs, that also expands directory inputs and contains metadata (hash, size) about each input file. This class will be used in a follw up CL to replace the TreeNodeRepository for building merkle trees for remote caching / execution. Also see #7583. RELNOTES: None PiperOrigin-RevId: 237036900

buchgr added the team-Remote-Exec Issues and PRs for the Execution (Remote) team label Feb 28, 2019

buchgr self-assigned this Feb 28, 2019

buchgr requested review from benjaminp, ulfjack and ishikhman February 28, 2019 12:38

buchgr requested review from ola-rozenfeld and philwo as code owners February 28, 2019 12:38

googlebot added the cla: yes label Feb 28, 2019

ulfjack approved these changes Feb 28, 2019

View reviewed changes

ishikhman reviewed Mar 1, 2019

View reviewed changes

src/main/java/com/google/devtools/build/lib/remote/RemoteSpawnCache.java Outdated Show resolved Hide resolved

src/main/java/com/google/devtools/build/lib/remote/merkletree/InputTree.java Show resolved Hide resolved

ishikhman approved these changes Mar 4, 2019

View reviewed changes

buchgr force-pushed the buchgr-test-merkletree branch from 293d30d to 5d7c238 Compare March 4, 2019 10:05

address comments

f92c6b6

buchgr force-pushed the buchgr-test-merkletree branch from 5d7c238 to f92c6b6 Compare March 4, 2019 10:25

bazel-io closed this in 7f72544 Mar 8, 2019

Globegitter mentioned this pull request May 10, 2019

remote/performance: don't construct a merkle tree for remote caching #4839

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remote: re-implement merkle tree buildling #7583

remote: re-implement merkle tree buildling #7583

buchgr commented Feb 28, 2019

ulfjack Feb 28, 2019

buchgr Mar 4, 2019

ulfjack Feb 28, 2019

buchgr Mar 4, 2019

buchgr commented Feb 28, 2019

benjaminp commented Mar 1, 2019

buchgr commented Mar 1, 2019 •

edited

Loading

remote: re-implement merkle tree buildling #7583

remote: re-implement merkle tree buildling #7583

Conversation

buchgr commented Feb 28, 2019

ulfjack Feb 28, 2019

Choose a reason for hiding this comment

buchgr Mar 4, 2019

Choose a reason for hiding this comment

ulfjack Feb 28, 2019

Choose a reason for hiding this comment

buchgr Mar 4, 2019

Choose a reason for hiding this comment

buchgr commented Feb 28, 2019

benjaminp commented Mar 1, 2019

buchgr commented Mar 1, 2019 • edited Loading

buchgr commented Mar 1, 2019 •

edited

Loading