Layout a docstring parser base #28

pawamoy · 2020-04-29T19:19:53Z

Related to #25

The idea is to lay things out for allowing contributors to implement different docstrings parsers more easily.

src/pytkdocs/objects.py

src/pytkdocs/parsers/docstrings/__init__.py

shyamd · 2020-04-30T21:33:04Z

If I look at Google implementation, I don't see any use for those book-keeping attributes you set in Parser.__init__. I want to make sure the Parser isn't holding part of the state when it doesn't need to. That's how we get state bleed and weird bugs. Parser doesn't need an __init__ from what I can see.

pawamoy · 2020-04-30T21:46:55Z

The Google parser does use these attributes: _path, _errors, _signature and _return_type are all used throughout most of the methods (parse, read_parameters_section, read_exceptions_section and read_return_section).

I'd like to get rid of this "state" but I don't see how 😕

shyamd · 2020-04-30T21:50:21Z

Then the parse command should clean up that state at the end. I don't mind having Parser level state objects but those should be more generic. Maybe a set of all errors the Parser has encountered kind of thing?

pawamoy · 2020-04-30T22:04:56Z

Then the parse command should clean up that state at the end.

I do that at the beginning of do_parse instead.

Maybe I should simply remove do_parse, and make parse return the errors itself.
But that still doesn't solve the need to store the path, signature and return type, which will be needed for other parsers as well. Maybe helper methods could hide that away.

def parse(docstring, path, signature, return_type):
    self.set_state(path, signature, return_type)
    # do stuff, use self.state.signature and self.state.return_type as needed

    # handle error, self.record_error will prefix with self.state.path
    self.record_error("message")

    # at the end
    return sections, self.pop_state()

Maybe a set of all errors the Parser has encountered kind of thing?

This what _errors is used for. Not sure to understand what you mean 🙂

pawamoy · 2020-04-30T22:10:35Z

Another solution would be use no state at all, and pass all the arguments through every method.

shyamd · 2020-04-30T22:11:43Z

If you have to do that, move those state variables to Google. I think the biggest issue is that Parser is an abstract class that is pretending to have an implementation in it. Those state variables right now are only relevant to the Google implementation.

pawamoy · 2020-04-30T22:36:38Z

I thought it would be convenient to give access to the object path, signature and return type to any parser implementation 😅

I'll see what I can do. Don't hesitate to open a PR based on this branch if you want to give it a try as well 🙂

In any case, thank you for your help!

pawamoy · 2020-05-03T15:25:30Z

Sorry but I don't see how I'm supposed to get rid of the state in the base parser class.

The google parser needs the object path, signature and type, so these values must be passed from Object.parse_all_docstring, which does not know which type of parser it is. It means that if I accept these values in the google parser instead of the base one, every other parser will have to accept them as well. In that case, what's the point of a base class? We could simply let contributors write the whole parsers themselves again and again, freely.

Besides, I think every parser will benefit from having access to the object signature and type, and the object path is required anyway for consistent error reporting.

I will keep the base parser structure like that. I think it makes it easy to write a parser implementation.

shyamd · 2020-05-03T15:48:03Z

Yeah, Let's keep it as is right now. I can make an issue or PR to demonstrate what I mean.

pawamoy · 2020-05-03T15:51:36Z

Alright, thanks 🙂

shyamd reviewed Apr 30, 2020

View reviewed changes

src/pytkdocs/objects.py Outdated Show resolved Hide resolved

src/pytkdocs/parsers/docstrings/__init__.py Outdated Show resolved Hide resolved

src/pytkdocs/parsers/docstrings/__init__.py Outdated Show resolved Hide resolved

pawamoy force-pushed the docstring-parser-base branch from 6271293 to 4692bad Compare April 30, 2020 18:38

pawamoy added 2 commits May 3, 2020 17:11

style: Typos

30e1a7d

fix: Don't allow None for a property's docstring

18acd37

pawamoy force-pushed the docstring-parser-base branch from 4692bad to 8f82958 Compare May 3, 2020 15:11

refactor: Layout a docstring parser base

1ce9802

pawamoy force-pushed the docstring-parser-base branch from 8f82958 to 1ce9802 Compare May 3, 2020 15:17

pawamoy merged commit d427bcc into master May 6, 2020

pawamoy mentioned this pull request May 6, 2020

[FeatureRequest] Support the numpy docstring format #7

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Layout a docstring parser base #28

Layout a docstring parser base #28

pawamoy commented Apr 29, 2020 •

edited

Loading

shyamd commented Apr 30, 2020

pawamoy commented Apr 30, 2020

shyamd commented Apr 30, 2020

pawamoy commented Apr 30, 2020

pawamoy commented Apr 30, 2020

shyamd commented Apr 30, 2020

pawamoy commented Apr 30, 2020

pawamoy commented May 3, 2020

shyamd commented May 3, 2020

pawamoy commented May 3, 2020

Layout a docstring parser base #28

Layout a docstring parser base #28

Conversation

pawamoy commented Apr 29, 2020 • edited Loading

shyamd commented Apr 30, 2020

pawamoy commented Apr 30, 2020

shyamd commented Apr 30, 2020

pawamoy commented Apr 30, 2020

pawamoy commented Apr 30, 2020

shyamd commented Apr 30, 2020

pawamoy commented Apr 30, 2020

pawamoy commented May 3, 2020

shyamd commented May 3, 2020

pawamoy commented May 3, 2020

pawamoy commented Apr 29, 2020 •

edited

Loading