Problem matching the body of annotations like `@Annotation(title, some longer description)` #1325

jjalowie · 2023-08-19T08:07:11Z

What is your question?

I'm trying to write a parser for annotations like @Annotation(title, some longer description). I have trouble achieving consistent parsing of the description rule in the below grammar. Given @Annotation(title, some longer description), the description rule sometimes matches some longer description and sometimes it matches some longer description (note the leading space). I want to always enforce matching without leading whitespaces.

If you're having trouble with your code or grammar

I believe it's either a problem with the below grammar or a bug in the parsing process.

Code reproduction:

# File: parser.py

import lark

grammar = """
start: "@Annotation" "(" title "," description ")"
title: /[^,]+/
description: /[^)]+/

%import common.WS
%ignore WS
"""

text = "@Annotation(title, some longer description)"

parser = lark.Lark(grammar)
ir = parser.parse(text)
print(ir.children[1])

Explain what you're trying to do, and what is obstructing your progress.

It seems to me that ignoring the WS rule sometimes takes precedence over the description rule and sometimes vice versaand it happens randomly. Output of executing the same script a few times gives (again, note the leading space that sometimes gets matched for some longer description:

user@machine:/dir> python parser.py 
Tree(Token('RULE', 'description'), [Token('__ANON_2', ' some longer description')])
user@machine:/dir> python parser.py 
Tree(Token('RULE', 'description'), [Token('__ANON_2', 'some longer description')])
user@machine:/dir> python parser.py 
Tree(Token('RULE', 'description'), [Token('__ANON_2', ' some longer description')])
user@machine:/dir> python parser.py 
Tree(Token('RULE', 'description'), [Token('__ANON_2', 'some longer description')])
user@machine:/dir> python parser.py 
Tree(Token('RULE', 'description'), [Token('__ANON_2', ' some longer description')])

The text was updated successfully, but these errors were encountered:

erezsh · 2023-10-02T21:47:23Z

Should be solved in the latest master.

jjalowie · 2023-11-15T08:57:38Z

Thank you!

jjalowie added the question label Aug 19, 2023

jjalowie changed the title ~~Problem matching arbitrary text in parenthesis~~ Problem matching the body of annotations like @Annotation(title, some longer description) Aug 19, 2023

erezsh mentioned this issue Aug 23, 2023

Earley now uses OrderedSet for better output stability #1327

Merged

erezsh added the Earley Issues regarding the Earley parser label Aug 23, 2023

erezsh closed this as completed Oct 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem matching the body of annotations like `@Annotation(title, some longer description)` #1325

Problem matching the body of annotations like `@Annotation(title, some longer description)` #1325

jjalowie commented Aug 19, 2023

erezsh commented Oct 2, 2023

jjalowie commented Nov 15, 2023

Problem matching the body of annotations like @Annotation(title, some longer description) #1325

Problem matching the body of annotations like @Annotation(title, some longer description) #1325

Comments

jjalowie commented Aug 19, 2023

erezsh commented Oct 2, 2023

jjalowie commented Nov 15, 2023

Problem matching the body of annotations like `@Annotation(title, some longer description)` #1325

Problem matching the body of annotations like `@Annotation(title, some longer description)` #1325