[sqllab] Fix limit parsing bug when using limit-offset comma notation #7912

villebro · 2019-07-22T14:26:57Z

SUMMARY

Currently parsing of limit from query with limit-offset comma notation (LIMIT <offset>, <limit>) incorrectly assumes reversed order (LIMIT <limit>, <offset>). This fixes the parsing logic and accompanying unit tests.

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

TEST PLAN

Test in SqlLab
CI/Unit tests

REVIEWERS

@john-bodley (I checked git blame and saw that you had worked on this recently)

john-bodley

Thanks for fixing this. One small comment but otherwise LGTM.

john-bodley · 2019-07-22T16:16:58Z

superset/sql_parse.py

@@ -182,7 +182,7 @@ def _extract_limit_from_query(self, statement):
            _, token = statement.token_next(idx=idx)
            if token:
                if isinstance(token, IdentifierList):
-                    _, token = token.token_next(idx=-1)
+                    token = token.tokens[-1]


Could you use token_next(..., reverse=True). This skips comments, whitespace etc. and thus is probably more robust.

I agree using the native methods would be preferable, and I actually tried to go down that path, but it seems the reverse argument is in fact private (_reverse), and didn't seem to work similar to sorted(..., reverse=True). Will see if I can come up with a more robust solution.

Can you check my new proposal @john-bodley ?

codecov-io · 2019-07-23T19:42:52Z

Codecov Report

Merging #7912 into master will decrease coverage by 7.76%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master    #7912      +/-   ##
==========================================
- Coverage   73.67%   65.91%   -7.77%     
==========================================
  Files         111      465     +354     
  Lines       11695    22257   +10562     
  Branches        0     2425    +2425     
==========================================
+ Hits         8616    14670    +6054     
- Misses       3079     7466    +4387     
- Partials        0      121     +121

Impacted Files	Coverage Δ
superset/sql_parse.py	`99.2% <100%> (ø)`	⬆️
superset/connectors/druid/models.py	`82.2% <0%> (ø)`	⬆️
superset/viz.py	`71.77% <0%> (ø)`	⬆️
superset/assets/src/components/Checkbox.jsx	`100% <0%> (ø)`
...ations/deckgl/layers/Polygon/PolygonChartPlugin.js	`0% <0%> (ø)`
...ets/src/dashboard/components/dnd/DragDroppable.jsx	`94.59% <0%> (ø)`
...c/visualizations/deckgl/layers/Polygon/Polygon.jsx	`0% <0%> (ø)`
superset/assets/src/components/EditableTitle.jsx	`81.53% <0%> (ø)`
superset/assets/src/setup/setupPlugins.js	`0% <0%> (ø)`
...t/assets/src/components/InfoTooltipWithTrigger.jsx	`41.66% <0%> (ø)`
... and 352 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2221445...4ad9ae1. Read the comment docs.

john-bodley · 2019-07-23T21:22:20Z

superset/sql_parse.py

@@ -182,7 +182,10 @@ def _extract_limit_from_query(self, statement):
            _, token = statement.token_next(idx=idx)
            if token:
                if isinstance(token, IdentifierList):
-                    _, token = token.token_next(idx=-1)
+                    # In case of "LIMIT <offset>, <limit>", find comma and extract


@villebro this seems good. I just checked the source code to verify that the IdentifierList contains the , punctuation and thus idx will not be None.

…apache#7912) * Fix limit parsing bug when using limit-offset comma notation * Use native sqlparse semantics to find limit * black

pull-request-size bot added the size/XS label Jul 22, 2019

john-bodley reviewed Jul 22, 2019

View reviewed changes

villebro added 3 commits July 23, 2019 21:45

Fix limit parsing bug when using limit-offset comma notation

8cced92

Use native sqlparse semantics to find limit

991ae37

black

4ad9ae1

john-bodley approved these changes Jul 23, 2019

View reviewed changes

villebro merged commit 72d1011 into apache:master Jul 24, 2019

mistercrunch added 🏷️ bot A label used by `supersetbot` to keep track of which PR where auto-tagged with release labels 🚢 0.34.0 labels Feb 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[sqllab] Fix limit parsing bug when using limit-offset comma notation #7912

[sqllab] Fix limit parsing bug when using limit-offset comma notation #7912

villebro commented Jul 22, 2019

john-bodley left a comment

john-bodley Jul 22, 2019

villebro Jul 22, 2019

villebro Jul 23, 2019

codecov-io commented Jul 23, 2019

john-bodley Jul 23, 2019

[sqllab] Fix limit parsing bug when using limit-offset comma notation #7912

[sqllab] Fix limit parsing bug when using limit-offset comma notation #7912

Conversation

villebro commented Jul 22, 2019

CATEGORY

SUMMARY

BEFORE/AFTER SCREENSHOTS OR ANIMATED GIF

TEST PLAN

REVIEWERS

john-bodley left a comment

Choose a reason for hiding this comment

john-bodley Jul 22, 2019

Choose a reason for hiding this comment

villebro Jul 22, 2019

Choose a reason for hiding this comment

villebro Jul 23, 2019

Choose a reason for hiding this comment

codecov-io commented Jul 23, 2019

Codecov Report

john-bodley Jul 23, 2019

Choose a reason for hiding this comment