New string parsing is very gas heavy #16

ethanfrey · 2020-05-07T18:28:19Z

Just adding the checks for special characters increases gas costs for cosmwasm contracts about 30% (see comments on CosmWasm/cosmwasm#314)

It should be possible to reduce cost to around the same when no special chars are involved. Start with an optimistic parser and revert to current safe parser if needed.

Optimistic check:

From begining of string, go through all chars and increment until we hit a \ or ". If \, then we start your safe parsing. If we hit " first, we found the end and can just copy that range into a String. The fast case should be around the same cost as the previous action.. maybe a bit more for checking for two chars at each step rather than one, but probably less than now.

Optimistic encoding:

If all chars are in the range 0x20-0x7f and exclude \ and ", we can do the straight copy. With any other char, we can use the safe method.

The text was updated successfully, but these errors were encountered:

webmaster128 · 2020-05-08T08:39:37Z

There are a bunch of optimizations in serde_json that we can look into. One is for example the parse_str return type

pub enum Reference<'b, 'c, T>
where
    T: ?Sized + 'static,
{
    Borrowed(&'b T),
    Copied(&'c T),
}

that avoids copies as long as no unescaping is required. This is then visited like

        match tri!(self.de.read.parse_str(&mut self.de.scratch)) {
            Reference::Borrowed(s) => visitor.visit_borrowed_str(s),
            Reference::Copied(s) => visitor.visit_str(s),
        }

In order to optiize this, it would be very good to know where out visitor is implemented and how calls to visit_string, visit_str and visit_borrowed_str can be utilized. @ethanfrey do you have a clue where that visitor is? Is this auto-derived code we never see?

And there a few things that json-serde-core established, which are not necessarily what we want, like:

// NOTE(serialize_*signed) This is basically the numtoa implementation minus the lookup tables,
// which take 200+ bytes of ROM / Flash
macro_rules! serialize_unsigned {
    ($self:ident, $N:expr, $v:expr) => {{
        let mut buf = [0u8; $N];

ethanfrey · 2020-05-08T10:40:23Z

Okay, I was thinking of trivial optimizations to get back where we were before the changes.

Before we touch these deeper optimizations, I would like to take the time to decide if we stick with JSON into the 1.0 release or use a different codec (not worth putting much more time into JSON if we toss it out by 1.0 anyway)

webmaster128 · 2020-05-08T10:45:39Z

Okay, I was thinking of trivial optimizations to get back where we were before the changes.

This is an impossible goal. JSON serialization/deserialization is expensive and there is no way around it. E.g. the whole idea of zero-copy string deserialization in serde-json-core does not respect the nature of JSON.

Even serde_json makes make questionable API design decisions, where they allow deserializing into &str (pointing to JSON source), which works for some input data but then suddenly break at runtime for other input data.

maurolacy · 2021-07-28T12:24:07Z

Closed by #25.

webmaster128 added the Performance Improvements (Productionize Code) label Jan 7, 2021

webmaster128 assigned webmaster128 and unassigned webmaster128 Jan 7, 2021

maurolacy self-assigned this Jan 11, 2021

maurolacy mentioned this issue Jan 13, 2021

String parsing optimization #25

Merged

maurolacy closed this as completed Jul 28, 2021

ethanfrey mentioned this issue Jul 28, 2021

Try json optimization CosmWasm/cosmwasm#1030

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New string parsing is very gas heavy #16

New string parsing is very gas heavy #16

ethanfrey commented May 7, 2020

webmaster128 commented May 8, 2020 •

edited

Loading

ethanfrey commented May 8, 2020

webmaster128 commented May 8, 2020

maurolacy commented Jul 28, 2021

New string parsing is very gas heavy #16

New string parsing is very gas heavy #16

Comments

ethanfrey commented May 7, 2020

webmaster128 commented May 8, 2020 • edited Loading

ethanfrey commented May 8, 2020

webmaster128 commented May 8, 2020

maurolacy commented Jul 28, 2021

webmaster128 commented May 8, 2020 •

edited

Loading