-
-
Notifications
You must be signed in to change notification settings - Fork 62
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proposal for supporting PHP 8.2+ Disjunctive Normal Form Types in the Tokenizer #387
Comments
Could the old class be deprecated/left in place, while usages migrate to a new class that contains these adjustments? |
Not really. It's a monster-class without any modularity.
This is exactly why I would like opinions on the timeline. PHPCS 4.0 will contain various other tokenizer changes too, so would allow for more gradual migration. |
I'm always happy with more new tokens because my sniffs could be more simple :)
DNF don't work currently so I think it could go to PHPCS 3.x. However I think you should choose what is more easy for you. DNF support can wait another 3-4 months if it will make your work on PHPCS easier :) |
Thanks for the complete report, easy to digest! I like the new About the target release, applying the safety-principle, I'd say 4.0, because of the potential breaking impact on already existing Sniffs out there (not sure if there is any reliable way to try to look for existing cases). In our case (Moodle), we still have PHP 8.1 as min requirement in our current dev version, so we are DNFT-free for now. Side-note, only partially related: One thing that always make me think about all these Tokenizer issues/changes... is how others out there (maybe using other type of - AST, ... - parsers) are impacted by PHP changes (more or less than us). Just saying, I don't know/feel that there is any easy route to provide any tokenizer <==> parser conversion easily. Ciao :-) |
I think that the proposed new tokens sound like a good approach since they have a distinct meaning. As for timing, I agree with the above that since it's currently effectively broken, a minor version should be fine. This is at least a bug fix and at most a new feature. That said, I can imagine that it could be considered breaking if someone hacked around these tokens already. But if they did I would hope that they posted an issue on this repo since it seems to me to be missing support? And if it does break a sniff, the fix is easy enough by pinning the previous minor version. You have your finger on the pulse of sniff development, though, so if you think it likely that folks would be badly impacted by this change and there's no way to let them know ahead of time, then a major version is also fine. We've been behind PHP versions for quite a while now and another instance is not going to be the end of the world. |
@kukulich @stronk7 @sirbrillig Thank you for your input so far. Personally, I'm not aware of standards which have work-arounds in place (which is part of the reason for opening this issue: to try and find out). Timing-wise, if nobody speaks up about having put work-arounds in place, I think adding this to the next minor (once the patch is available) would be best as it takes pressure off PHPCS 4.0, for which quite some work still needs to be done, and would allow for the support to reach end-users sooner, especially as it is likely to take a while before external standards will support PHPCS 4.0.
Well, I've been trying to play catch up with the PHPCS 3.8.0 and 3.9.0 releases. Syntax-support wise, this is the last "big" one we're still missing as far as I'm concerned. |
I was going to say PHPCS 4.0, but if you're not aware of any current workarounds, then it makes sense to bring this forward to 3.x. |
As nobody has spoken up about having work-arounds in place already, I intend to start work on this over the next few days and hope to include DNF support in the 3.10.0 release. |
I've just pulled the first PR in a series of 6 PRs related to this. See #461 This first PR adds the tokenizer support for DNF types as outlined above. Once that PR has been merged, I have three PRs lined up to add support for DNF types to the And then finally a PR to handle DNF types in the I'd like to invite everyone to put the PR through the wringer and test it thoroughly to confirm I've caught all the edge cases and/or to find those edge cases I've still missed. Leaving a comment saying you've tested the PR with your sniffs is appreciated as it will strengthen confidence in the implementation. |
FYI: I've merged the PR with the Tokenizer changes to improve the chances of people using the I've also pulled the changes to the |
FYI: I've merged the above mentioned PRs updating the I've also now pulled PR #478 which updates the one sniff which needed updating (AFAICS). |
Just checking in to see if anyone has been testing the implementation with their external standard and if there is any feedback. If not, I intend to release PHPCS 3.10.0 later this week with this change included. |
The problem
PHP 8.2 introduced Disjunctive Normal Form Types, which now adds parentheses to the tokens allowed within type declarations.
This is currently not yet supported by PHP_CodeSniffer and the
PHP_CodeSniffer\Tokenizers\PHP
class will not always handle tokens within DNF types correctly, which can lead to false positives/false negatives being reported for sniffs and incorrect fixes being made.Typical issues in the Tokenizer:
array
keyword when used in a type declaration may not be retokenized fromT_ARRAY
toT_STRING
.true
/false
/null
keywords when used in a type declaration may not be retokenized fromT_STRING
toT_TRUE
/T_FALSE
/T_NULL
.&
and|
characters may not be retokenized toT_TYPE_UNION
/T_TYPE_INTERSECTION
.Typical resulting sniff issues:
array
keyword may try to act on thearray
keyword when used in a DNF type.&
and|
characters in DNF types.These issues also impact the functioning of the following PHPCS native helper functions and by extension all sniffs which use these methods:
File::getMethodParameters()
(parameter types)File::getMethodProperties()
(return types)File::getMemberProperties()
(property types)File::isReference()
Compounding the Problem
Aside from the above, standards may want to treat (spacing around) parentheses in DNF types differently from (spacing around) parentheses used in other places in a code base.
Proposed Solution
To support DNF types the
PHP_CodeSniffer\Tokenizers\PHP
class will need significant updates.When making these updates, I propose to include a change to retokenize the open and close parentheses when used in DNF types as
T_TYPE_OPEN_PARENTHESIS
andT_TYPE_CLOSE_PARENTHESIS
respectively.Having separate, PHPCS native tokens for the DNF parentheses will allow for sniff developers to decide on a case-by-case basis whether a sniff should or should not act on DNF parentheses.
This change also follows in the footsteps of the retokenization of the
&
and|
tokens (when used in types) toT_TYPE_*
tokens.As part of this change, the new
T_TYPE_OPEN_PARENTHESIS
andT_TYPE_CLOSE_PARENTHESIS
tokens will need to be handled correctly by thePHP_CodeSniffer\Tokenizers\Tokenizer
class and tokens should get theparenthesis_opener
,parenthesis_closer
andnested_parenthesis
tokens, same as if the tokens were the "normal"T_OPEN_PARENTHESIS
andT_CLOSE_PARENTHESIS
tokens.Open question
As PHP 8.2 has been out for a while, some external standards may have implemented work-arounds for the current state of things.
So, aside from opinions on the above proposal itself, I would also like to hear opinions on when this change should be made.
/cc @asispts @dingo-d @fredden @GaryJones @greg0ire @kukulich @michalbundyra @Ocramius @sirbrillig @stronk7 @weierophinney @wimg
Related issues
The text was updated successfully, but these errors were encountered: