meshopt compression: grouping buffer views and compression size #337

donmccurdy · 2021-09-01T16:51:25Z

donmccurdy
Sep 1, 2021

The EXT_meshopt_compression extension allows flexibility about how accessors are grouped into buffer views, and at present I'm using any unique combination of the following properties as the group "key":

bufferIndex, byteStride, usage/target, meshopt mode, meshopt filter

This results in a reasonably small number of buffer views per glTF asset, regardless of the number of mesh primitives.

For a couple reasons though, I'd like to understand the tradeoffs of having far more buffer views (say, one per accessor) instead. I understand this would inflate the size of the JSON data considerably, but is that the only effect? Does it have any implications for the compressed binary payload, e.g. better compression ratio for larger units of compression?

I've done some quick tests trying to check this, and it doesn't seem to matter, but hoping to get a sanity check on this conclusion. Thanks!

Answered by zeux

Sep 2, 2021

So "one per accessor" is an extreme example, in the sense that it's the one with the highest granularity and as such it will see the highest penalty. Here's what you could expect.

First, of course the glTF JSON data is inflated noticeably. This can be a significant problem on models built out of a lot of small pieces with different materials or short animations of deep node trees or other cases like that.

Second, you will see some increase in the binary data as well. For example, both attribute and triangle codec have ~16-24 bytes of "padding" (that can store important information) -- so if you have many small streams, the overhead of that data grows. This may not seem like a big problem,…

View full answer

zeux · 2021-09-02T05:34:17Z

zeux
Sep 2, 2021
Maintainer

So "one per accessor" is an extreme example, in the sense that it's the one with the highest granularity and as such it will see the highest penalty. Here's what you could expect.

First, of course the glTF JSON data is inflated noticeably. This can be a significant problem on models built out of a lot of small pieces with different materials or short animations of deep node trees or other cases like that.

Second, you will see some increase in the binary data as well. For example, both attribute and triangle codec have ~16-24 bytes of "padding" (that can store important information) -- so if you have many small streams, the overhead of that data grows. This may not seem like a big problem, but imagine the animation use case with a lot of dummy single-keypoint curves - now suddenly you are paying ~4 bytes for the input, ~8 bytes for the output, ~24b each for padding for input/output, some bytes for JSON, etc. The good news is that gzip is likely to compress these extra bytes reasonably well, but this can inflate the size before compression.

Beyond padding you can also see slightly reduced compression due to extra "breaks" in the possibly related data. This effect is usually going to be comparatively minor, because all compression algorithms here are fairly local, but there can be worst cases such as a lot of single-keyframe tracks with similar outputs.

This is about it from the transmission size perspective. Modulo JSON waste you'll probably see comparable sizes after gzip but somewhat inflated sizes before gzip that effectively reflect the degree to which you end up compressing tiny buffers.

The reason why gltfpack packs buffer views aggressively is not just that however, it's also to maximize the loading efficiency.

On that front, first, there's some overhead to parse the extra JSON data, and there's some overhead to decompress - you need to copy some data around JS-WASM-JS boundary, call some JS/WASM functions etc. That overhead isn't very big, but if we're talking about thousands of small compressed blocks - it will add up. In index codec there's some fixed cost to decoding that is also non-trivial (it's on the order of "initialize 300 bytes of memory", but if you are decompressing a lot of cubes that may be a problem!).

In addition, attribute codec heavily relies on SIMD for efficiency; it reaches peak performance on ~64 elements, decent performance on ~16 elements, but shorter sequences will not decode at peak throughput. Which, again, isn't a problem if you just have a few tiny sequences because everything is fast anyway, but can be a problem if you have thousands of 10-element sequences instead of 10 thousand-element ones.

Second, while web loaders create individual WebGL objects for individual primitives, this is fairly wasteful. gltfpack gives the loader the opportunity to minimize memory waste and minimize the associated costs of tracking GL objects by allowing to create one GL object per bufferView with a given target usage (glTF semantics makes creating GL buffer per glTF buffer infeasible because glTF buffers may mix data with various access settings). To get there, it minimizes the number of bufferViews, using the key similar to what you write about.

Now, whether or not any of these are particularly interesting things to worry about for glTF-Transform I'm not sure about, but essentially the consideration for gltfpack is that every time you split a buffer view, you lose a bit of transmission size, a bit of decompression time, a bit of loading efficiency and in some cases a bit of rendering performance due to increased GL object switching [given an optimal glTF loader], so gltfpack makes sure you don't need to pay these costs.

3 replies

zeux Sep 2, 2021
Maintainer

To illustrate the possible overhead on an example of short keyframes - I tested this on an animation-heavy model (that also happens to have many "pose" tracks without interesting motion) in -c mode. Here's the run with separate buffer views:

output: JSON 1269545 bytes, buffers 29672528 bytes
output: buffers: vertex 807978 bytes, index 52841 bytes, skin 9994 bytes, time 1453 bytes, keyframe 346053 bytes, instance 0 bytes, image 28444765 bytes

here's the run with combined buffer views:

output: JSON 617046 bytes, buffers 29539608 bytes
output: buffers: vertex 783239 bytes, index 52661 bytes, skin 9994 bytes, time 1287 bytes, keyframe 247594 bytes, instance 0 bytes, image 28444765 bytes

The textures aren't compressed here so the overall impact isn't that significant, but the relative impact is very notable - JSON doubles in size (likely leads to substantial degradation in parsing performance), keyframe data increases by 40%. The vertex data also has some penalty here, I think this is due to the high number of morph targets that compress well, so separate storage splits the morph target streams and wastes data on padding.

To be clear this is probably close to the worst case scenario, but this hopefully gives an example of why the costs may be significant in some of the outliers.

donmccurdy Sep 10, 2021
Author

Thank you for the detailed answer! As a bit of context, I'm doing some exploratory work related to 3D Tiles, which can embed glTF content. For reasons I won't go into here, JSON size is not a factor, but differences in compression ratio and decoding speed remain important.

zeux Sep 10, 2021
Maintainer

It seems very reasonable to split the buffers across tile boundaries - I don't think that's a big deal. It's not nearly as close to "worst case" as splitting per accessor. So yeah if that's the goal then I don't think it's particularly problematic in any way, and it's somewhat necessary as you presumably want individual tiles to be able to be loaded/unloaded independently.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

meshopt compression: grouping buffer views and compression size #337

{{title}}

Replies: 1 comment 3 replies

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

meshopt compression: grouping buffer views and compression size #337

donmccurdy Sep 1, 2021

Replies: 1 comment · 3 replies

zeux Sep 2, 2021 Maintainer

zeux Sep 2, 2021 Maintainer

donmccurdy Sep 10, 2021 Author

zeux Sep 10, 2021 Maintainer

donmccurdy
Sep 1, 2021

Replies: 1 comment 3 replies

zeux
Sep 2, 2021
Maintainer

zeux Sep 2, 2021
Maintainer

donmccurdy Sep 10, 2021
Author

zeux Sep 10, 2021
Maintainer