-
Notifications
You must be signed in to change notification settings - Fork 920
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Determine how to improve DataCatalog documentation page: review/propose ticket #2488
Comments
Here are some ideas for improving
|
@merelcht @noklam Here ⬆️ is a rough suggestion of a set of pages to replace the two pages currently in the "Data Catalog" section of the docs (the DataCatalog one is one of the most popular pages). The suggestion is to break into four pages:
What do you think?
The language is also quite dense/confusing in places so the overhaul would also do a set of improvements/additions to make the pages more readable, but this is more about getting your views on re-sectioning it. Happy to discuss in person if it's easier. |
Wow looking at the existing pages is actually quite painful. What a mess 🤯 My thoughts on the breakdown:
That error handling page right now is useless.. We'll have to rethink what information we'd actually like to give here. I think in general we need to make a very clear separation between using the YAML API and the Code API. I would be inclined to move anything Code API to the advanced section. I'm thinking it might be good to add a section on "how to use the catalog" where we describe the YAML API and link to the examples.
If it's possible to clearly separate all Code API vs YAML API sections (it might be hard because we've mixed these a lot in the catalog docs..) then I'd probably move this to advanced use. The main reason for this is that I've more frequently seen beginner users skip the regular Kedro flow and immediately try to use components directly in code and they end up being confused about what to do. If we make it clear that the YAML API is the way to go this might help users understand better how to get started with Kedro and only start using the code APIs when they have a clear need.
To me it makes sense to move the custom dataset section here and then also add the info about versioning a custom dataset to here instead of the general section on versioning. |
Thank you for the proposal.
The error handling section is quite useless indeed, I think we can just remove it. I am fine with the structure, but I would comment [Configuration and the data catalog] isn't very intuitive, it talks a lot about different paths and is confusing. I know Kedro well enough so I understand it, but maybe we need some example explaining how that config merging helps. Or just link it to a more detailed example later.
I don't think this is the first thing users need to learn and would defer it to advance examples.
|
Thank you @merelcht and @noklam With your help, I have this as a possible structure now:
|
This completes this ticket as a review of the docs to propose a restructure. I'll make a separate ticket to execute the changes and will make a draft PR with some of the revisions so that they can be reviewed docs accuracy/correctness improved by the engineering team. |
This ticket picks up where #1742 left off. We need to do a similar piece of work to that done to the Config docs to split it into separate pages for different audience types/goals and structure it better for people needing to find "How to" information.
It may be sensible to move examples out of the narrative docs and back into the API docs too, as suggested by @noklam in #1742.
The first step of this issue is to review what is there, then propose a set of revisions. There can be a new issue to implement those changes
The text was updated successfully, but these errors were encountered: