-
-
Notifications
You must be signed in to change notification settings - Fork 81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
add initial lint level config #784
Conversation
internet went out right as i opened it, writing from my phone. I will fix the lint tomorrow/once it comes back (I forgot to turn on more pedantic lints when I moved editors). |
src/query.rs
Outdated
/// the required version bump for this lint; see [`SemverQuery`].required_update | ||
pub required_update: Option<RequiredSemverUpdate>, | ||
/// the lint level for the query; see [`SemverQuery`].lint_level | ||
pub lint_level: Option<LintLevel>, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's make sure to document what None
means for both of these, and try to keep all the docs to a consistent style (for example, on whether we use capitalization and sentences or not).
src/query.rs
Outdated
@@ -73,6 +102,9 @@ pub struct SemverQuery { | |||
|
|||
pub required_update: RequiredSemverUpdate, | |||
|
|||
/// The error level for when this lint procs (allow, warn, or deny) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We used occurs
to describe an instance of a lint being triggered earlier. Consistent word choice helps understanding, especially because it allows us to use different word choice in a way that stands out much more.
So consider this instead:
/// The error level for when this lint procs (allow, warn, or deny) | |
/// The error level for when this lint occurs (allow, warn, or deny) |
pub fn apply_override(&mut self, query_override: &QueryOverride) { | ||
if let Some(lint_level) = query_override.lint_level { | ||
self.lint_level = lint_level; | ||
} | ||
|
||
if let Some(required_update) = query_override.required_update { | ||
self.required_update = required_update; | ||
} | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This design doesn't feel right to me.
The overrides feel more like a property of the linter execution and its environment, not the lints themselves.
Consider a situation where the workspace has one set of overrides, and individual crates within it have other overrides. If the overrides are applied to the lints themselves, then with this design we have to iterate over hundreds of lints once per crate to apply its overrides. We also risk bugs — we may fail to apply the overrides correctly because of how many moving parts there are.
This design also seems to make it harder to have optimizations like:
- skip running lints that are
allow
level - skip running lints whose SemVer requirement is already satisfied by the crate bump
etc.
I suspect we'll eventually want to have a two-stage linting process, similar to what cargo-nextest
does for testing: first, analyze the environment to determine what needs to be done, and only then do it. In the first stage, we determine which crates to check, with which crate features, against which versions, with which lints and with which overrides. In the second stage, we execute the plan. In addition to performance gains, this will make it simpler to test our tool's behavior in complex environments: we can test that the correct plan is generated separately from testing that the plan is correctly executed.
We don't have to build that two-stage execution right away, but we should try to make it easier, not harder, to build it in the future. With that in mind, consider keeping the SemverQuery
values immutable and addressing the overrides at a higher level?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A related piece of work-in-progress that might be of interest is this draft PR: #600
It proposes a design for describing "the plan" for what to do if various kinds of problems arise while linting. In the future, we'll want both the "what things run on which crates" plan I described earlier, and the "what if something goes wrong" bit here to be part of the same inspectable, testable plan.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry for the late response - school got a little crazy.
Thanks for the feedback! It definitely makes sense that when checking a crate, we want to have pass in the effective levels of each lint, both for semver requirement and lint level. We can calculate the effective lint level for each crate to be checked and pass it as an argument (mapping from lint name to { required_update, lint_level }
) to check_release
(and add it to the CrateCheck
struct in the proposed API). We could also just pass a map that only has an entry for a given lint if it differs from the default value in the lint file.
Passing all effective values might be easier to use, and to test in the proposed API version, as well as potentially easier to calculate multiple levels of overrides, but passing only ones that are different from the default would save space and execution time, but it might be a little more complicated and potentially susceptible to bugs about calculating and using the right effective version.
Which of those seems like the better idea? Or perhaps something else? Thanks!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No worries at all, as you know I've been quite busy too :)
From a testing perspective, I think it would be easier to pass only the overrides ("different from default"). That way, we won't have to regenerate the snapshot test data for every newly-added lint, like we would have to if all computed values were part of the passed data.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Okay, sounds good! Thanks!
Minor, | ||
Major, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
reordering to make Major > Minor
pub fn apply_override(&mut self, query_override: &QueryOverride) { | ||
if let Some(lint_level) = query_override.lint_level { | ||
self.lint_level = lint_level; | ||
} | ||
|
||
if let Some(required_update) = query_override.required_update { | ||
self.required_update = required_update; | ||
} | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry for the late response - school got a little crazy.
Thanks for the feedback! It definitely makes sense that when checking a crate, we want to have pass in the effective levels of each lint, both for semver requirement and lint level. We can calculate the effective lint level for each crate to be checked and pass it as an argument (mapping from lint name to { required_update, lint_level }
) to check_release
(and add it to the CrateCheck
struct in the proposed API). We could also just pass a map that only has an entry for a given lint if it differs from the default value in the lint file.
Passing all effective values might be easier to use, and to test in the proposed API version, as well as potentially easier to calculate multiple levels of overrides, but passing only ones that are different from the default would save space and execution time, but it might be a little more complicated and potentially susceptible to bugs about calculating and using the right effective version.
Which of those seems like the better idea? Or perhaps something else? Thanks!
2942a15
to
f32df41
Compare
c041cd5
to
6759fcb
Compare
Here's one idea for storing different levels of overrides: have a list (stack) of lint name->override mappings, where overrides at the top of the stack (later indices) have higher precedence than lower ones. This should be more reusable/modular where we can add multiple different levels of precedence and we can save allocations/copies when using workspace-level overrides, for instance. Another option is to use a flattened version of this (so only pass one What do you think? |
(Also, the CI is failing because it is looking for |
It's probably not a good idea to include many kinds of changes not directly related to each other in the same PR, since it makes the code harder to review. Larger and more complex PRs take exponentially longer to review and merge, because I have to keep re-reviewing a larger amount of code on each round-trip and it's more likely there's at least one thing worth changing before merging that large amount of code. I wouldn't recommend that approach. So I'd strongly recommend not changing output formatting in this PR at all. Aim to make the smallest, most targeted PR you possibly can, so we can merge it and move on quickly. |
It might even be a good idea to split this PR into several:
It's quite difficult to review a PR that does hundreds of lines of no-functional-changes refactoring while also adding new functionality in some places. It's often so hard to review such a PR that it's on par difficulty-wise with having the reviewer rewrite it from scratch, which is not a recipe for fast iteration and generally frustrates both PR authors and PR reviewers. |
The stack-of-maps approach sounds good btw 👍 I'd just suggest avoiding situations where two different types have names within 1 character of edit distance from each other, like |
closing because work is being done in other, smaller PRs (looking back, wow this was big) |
This PR adds:
Questions: