Strange results with stack and non-matching alternative #394

golddranks · 2019-06-08T21:53:54Z

I'm getting strange results with the following grammar:

raw_string_inner = { (!PEEK_ALL ~ ANY)* }
raw_string = ${ PUSH("'")+ ~ raw_string_inner ~ POP_ALL }

named_exp = { raw_string ~ "->" ~ ASCII_ALPHA_LOWER }

exp = { named_exp | raw_string }

With test code '' it's a test '' and matching against exp, I'm expecting it to find a match, but it doesn't. Instead, with '' it's a test ''' (unbalanced single quotes), it does find a match.

When I test directly against raw_string, it finds a match. It seems, from the behaviour, that exp, failing to match the first alternative, messes up the state of the stack (one-off error?) when it starts to match the next one.

It might, of course, also be that I'm misunderstanding something about how the stack works, but my general understanding is that once one alternative fails to match, it shouldn't have an effect to matching the next one...?

I haven't looked in the generated code or runtime yet; just gonna start investigating now, but figured that I'd make an issue first in case there's a quick gotcha I haven't taken into account.

The text was updated successfully, but these errors were encountered:

golddranks · 2019-06-08T22:00:11Z

Btw. I got it working with a better grammar that uses the stack in a saner way:

raw_string_inner = { (!PEEK ~ ANY)* }
raw_string = ${ PUSH("'"+) ~ raw_string_inner ~ POP }

named_exp = { raw_string ~ "->" ~ ASCII_ALPHA_LOWER }

exp = { named_exp | raw_string }

However, it still bothers me whether it's a bug; seems like the state of the stack is not the same when trying the next alternative after a failed one.

golddranks · 2019-06-09T04:00:37Z

Okay, tried to instrument the generated parsing code a bit. (Btw. if there is a better way to debug than doing cargo expand and instrumenting by hand, it would be very helpful to mention it in the documentation.) Here is the exp rule parsing function:

#[inline]
#[allow(non_snake_case, unused_variables)]
pub fn exp(
    state: Box<::pest::ParserState<Rule>>,
) -> ::pest::ParseResult<Box<::pest::ParserState<Rule>>> {
    dbg!(&state);
	dbg!(
    state.rule(Rule::exp, |state| {
        dbg!(&state);
        dbg!(state
            .restore_on_err(|state| {
                dbg!(&state);
                dbg!(self::named_exp(state))
            })
            .or_else(|state| {
                dbg!(&state);
                dbg!(state.restore_on_err(|state|
                    self::raw_string(state)
                ))
            }))
    }))
}

I'm getting a trace like this. Indeed, the return value of named_exp shows two pushes and only one pop. Gonna investigate further.

[src/main.rs:121] &state = ParserState {
    position: Position {
        pos: 0,
    },
    queue: [],
    lookahead: None,
    pos_attempts: [],
    neg_attempts: [],
    attempt_pos: 0,
    atomicity: NonAtomic,
    stack: Stack {
        ops: [],
        cache: [],
        snapshots: [],
    },
}
[src/main.rs:124] &state = ParserState {
    position: Position {
        pos: 0,
    },
    queue: [
        Start {
            end_token_index: 0,
            input_pos: 0,
        },
    ],
    lookahead: None,
    pos_attempts: [],
    neg_attempts: [],
    attempt_pos: 0,
    atomicity: NonAtomic,
    stack: Stack {
        ops: [],
        cache: [],
        snapshots: [],
    },
}
[src/main.rs:127] &state = ParserState {
    position: Position {
        pos: 0,
    },
    queue: [
        Start {
            end_token_index: 0,
            input_pos: 0,
        },
    ],
    lookahead: None,
    pos_attempts: [],
    neg_attempts: [],
    attempt_pos: 0,
    atomicity: NonAtomic,
    stack: Stack {
        ops: [],
        cache: [],
        snapshots: [
            0,
        ],
    },
}
[src/main.rs:128] self::named_exp(state) = Err(
    ParserState {
        position: Position {
            pos: 0,
        },
        queue: [
            Start {
                end_token_index: 0,
                input_pos: 0,
            },
        ],
        lookahead: None,
        pos_attempts: [
            raw_string,
        ],
        neg_attempts: [],
        attempt_pos: 0,
        atomicity: NonAtomic,
        stack: Stack {
            ops: [
                Push(
                    Span {
                        str: "\'",
                        start: 0,
                        end: 1,
                    },
                ),
                Push(
                    Span {
                        str: "\'",
                        start: 1,
                        end: 2,
                    },
                ),
                Pop(
                    Span {
                        str: "\'",
                        start: 1,
                        end: 2,
                    },
                ),
            ],
            cache: [
                Span {
                    str: "\'",
                    start: 0,
                    end: 1,
                },
            ],
            snapshots: [
                0,
                1,
            ],
        },
    },
)
[src/main.rs:131] &state = ParserState {
    position: Position {
        pos: 0,
    },
    queue: [
        Start {
            end_token_index: 0,
            input_pos: 0,
        },
    ],
    lookahead: None,
    pos_attempts: [
        raw_string,
    ],
    neg_attempts: [],
    attempt_pos: 0,
    atomicity: NonAtomic,
    stack: Stack {
        ops: [
            Push(
                Span {
                    str: "\'",
                    start: 0,
                    end: 1,
                },
            ),
        ],
        cache: [
            Span {
                str: "\'",
                start: 0,
                end: 1,
            },
        ],
        snapshots: [
            0,
        ],
    },
}
[src/main.rs:132] state.restore_on_err(|state| self::raw_string(state)) = Err(
    ParserState {
        position: Position {
            pos: 0,
        },
        queue: [
            Start {
                end_token_index: 0,
                input_pos: 0,
            },
        ],
        lookahead: None,
        pos_attempts: [
            raw_string,
            raw_string,
        ],
        neg_attempts: [],
        attempt_pos: 0,
        atomicity: NonAtomic,
        stack: Stack {
            ops: [
                Push(
                    Span {
                        str: "\'",
                        start: 0,
                        end: 1,
                    },
                ),
                Push(
                    Span {
                        str: "\'",
                        start: 0,
                        end: 1,
                    },
                ),
            ],
            cache: [
                Span {
                    str: "\'",
                    start: 0,
                    end: 1,
                },
                Span {
                    str: "\'",
                    start: 0,
                    end: 1,
                },
            ],
            snapshots: [
                0,
                1,
            ],
        },
    },
)
[src/main.rs:125] state.restore_on_err(|state|
                         {
                             dbg!(& state);
                             dbg!(self :: named_exp ( state ))
                         }).or_else(|state|
                                        {
                                            dbg!(& state);
                                            dbg!(state . restore_on_err (
                                                 | state | self :: raw_string
                                                 ( state ) ))
                                        }) = Err(
    ParserState {
        position: Position {
            pos: 0,
        },
        queue: [
            Start {
                end_token_index: 0,
                input_pos: 0,
            },
        ],
        lookahead: None,
        pos_attempts: [
            raw_string,
            raw_string,
        ],
        neg_attempts: [],
        attempt_pos: 0,
        atomicity: NonAtomic,
        stack: Stack {
            ops: [
                Push(
                    Span {
                        str: "\'",
                        start: 0,
                        end: 1,
                    },
                ),
                Push(
                    Span {
                        str: "\'",
                        start: 0,
                        end: 1,
                    },
                ),
            ],
            cache: [
                Span {
                    str: "\'",
                    start: 0,
                    end: 1,
                },
                Span {
                    str: "\'",
                    start: 0,
                    end: 1,
                },
            ],
            snapshots: [
                0,
                1,
            ],
        },
    },
)
[src/main.rs:122] state.rule(Rule::exp,
           |state|
               {
                   dbg!(& state);
                   dbg!(state . restore_on_err (
                        | state | {
                        dbg ! ( & state ) ; dbg ! (
                        self :: named_exp ( state ) ) } ) . or_else (
                        | state | {
                        dbg ! ( & state ) ; dbg ! (
                        state . restore_on_err (
                        | state | self :: raw_string ( state ) ) ) } ))
               }) = Err(
    ParserState {
        position: Position {
            pos: 0,
        },
        queue: [],
        lookahead: None,
        pos_attempts: [
            exp,
        ],
        neg_attempts: [],
        attempt_pos: 0,
        atomicity: NonAtomic,
        stack: Stack {
            ops: [
                Push(
                    Span {
                        str: "\'",
                        start: 0,
                        end: 1,
                    },
                ),
                Push(
                    Span {
                        str: "\'",
                        start: 0,
                        end: 1,
                    },
                ),
            ],
            cache: [
                Span {
                    str: "\'",
                    start: 0,
                    end: 1,
                },
                Span {
                    str: "\'",
                    start: 0,
                    end: 1,
                },
            ],
            snapshots: [
                0,
                1,
            ],
        },
    },
)

golddranks · 2019-06-09T06:45:29Z

Allright, got a bit further. It seems that the problem is with snapshots: It saves 0 and 1, but restores only 1 on error, resulting one Push operation staying on stack.

395: Clearing checkpoints in error handler on successful parse r=dragostis a=golddranks * When `restore_on_err` is called, a checkpoint is added. * When parsing fails inside the call, the checkpoint is resumed. Specifically, stack is resumed to the state it was before entering `restore_on_err`. * However, when parsing inside `restore_on_err` succeeds, at the moment, the checkpoint is not cleared on the exit. * This leads to bugs where after returning from `restore_on_err`, another error is encountered, but the checkpoint that was set in `restore_on_err` is incorrectly resumed; this causes the stack to become in inconsistent state. * Fixed the bug by adding a function for clearing checkpoints and calling it on the successful path of `restore_on_err`. * Resolves #394 * Added test for this case. However, the testing infrastructure isn't quite clear for me, so it might be that the test would be better expressed somewhere else, in some other way. Please advise on this. Co-authored-by: Pyry Kontio <[email protected]>

golddranks mentioned this issue Jun 9, 2019

Clearing checkpoints in error handler on successful parse #395

Merged

bors bot closed this as completed in #395 Jun 13, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strange results with stack and non-matching alternative #394

Strange results with stack and non-matching alternative #394

golddranks commented Jun 8, 2019 •

edited

Loading

golddranks commented Jun 8, 2019

golddranks commented Jun 9, 2019

golddranks commented Jun 9, 2019

Strange results with stack and non-matching alternative #394

Strange results with stack and non-matching alternative #394

Comments

golddranks commented Jun 8, 2019 • edited Loading

golddranks commented Jun 8, 2019

golddranks commented Jun 9, 2019

golddranks commented Jun 9, 2019

golddranks commented Jun 8, 2019 •

edited

Loading