Improve tokenize error handling #104169

pablogsal · 2023-05-04T13:23:49Z

There has been quite a lot of instances were poor error handling in the tokenizer leads to crashes or errors being overwritten. We should not rely on continuous patches to every single individual issue but we should improve the situation with more robust infrastructure.

Linked PRs

…04170)

pablogsal · 2023-05-04T14:59:31Z

CC: @lysnikolaou

* The lexer, which include the actual lexeme producing logic, goes into the `lexer` directory. * The wrappers, one wrapper per input mode (file, string, utf-8, and readline), go into the `tokenizer` directory and include logic for creating a lexer instance and managing the buffer for different modes.

* The lexer, which include the actual lexeme producing logic, goes into the `lexer` directory. * The wrappers, one wrapper per input mode (file, string, utf-8, and readline), go into the `tokenizer` directory and include logic for creating a lexer instance and managing the buffer for different modes. --------- Co-authored-by: Pablo Galindo <[email protected]> Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com>

* Fix test_peg_generator after tokenizer refactoring * Remove references to tokenizer.c in comments etc.

hugovk · 2023-11-10T11:06:08Z

Ready to close? Thanks!

…10684) * The lexer, which include the actual lexeme producing logic, goes into the `lexer` directory. * The wrappers, one wrapper per input mode (file, string, utf-8, and readline), go into the `tokenizer` directory and include logic for creating a lexer instance and managing the buffer for different modes. --------- Co-authored-by: Pablo Galindo <[email protected]> Co-authored-by: blurb-it[bot] <43283697+blurb-it[bot]@users.noreply.github.com>

…ython#110727) * Fix test_peg_generator after tokenizer refactoring * Remove references to tokenizer.c in comments etc.

bedevere-bot mentioned this issue May 4, 2023

gh-104169: Ensure the tokenizer doesn't overwrite previous errors #104170

Merged

pablogsal added a commit to pablogsal/cpython that referenced this issue May 4, 2023

pythongh-104169: Ensure the tokenizer doesn't overwrite previous errors

aefa494

pablogsal added a commit that referenced this issue May 4, 2023

gh-104169: Ensure the tokenizer doesn't overwrite previous errors (#1…

eba64d2

…04170)

pablogsal assigned pablogsal and lysnikolaou May 4, 2023

bedevere-app bot mentioned this issue Oct 11, 2023

gh-104169: Refactor tokenizer into lexer and wrappers #110684

Merged

lysnikolaou added a commit to lysnikolaou/cpython that referenced this issue Oct 11, 2023

pythongh-104169: Fix test_peg_generator after tokenizer refactoring

2253a49

bedevere-app bot mentioned this issue Oct 11, 2023

gh-104169: Fix test_peg_generator after tokenizer refactoring #110727

Merged

lysnikolaou added a commit that referenced this issue Oct 12, 2023

gh-104169: Fix test_peg_generator after tokenizer refactoring (#110727)

17d6554

* Fix test_peg_generator after tokenizer refactoring * Remove references to tokenizer.c in comments etc.

pablogsal closed this as completed Nov 10, 2023

Glyphack pushed a commit to Glyphack/cpython that referenced this issue Sep 2, 2024

pythongh-104169: Fix test_peg_generator after tokenizer refactoring (p…

19beeee

…ython#110727) * Fix test_peg_generator after tokenizer refactoring * Remove references to tokenizer.c in comments etc.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve tokenize error handling #104169

Improve tokenize error handling #104169

pablogsal commented May 4, 2023 •

edited by bedevere-app bot

Loading

pablogsal commented May 4, 2023

hugovk commented Nov 10, 2023

Improve tokenize error handling #104169

Improve tokenize error handling #104169

Comments

pablogsal commented May 4, 2023 • edited by bedevere-app bot Loading

Linked PRs

pablogsal commented May 4, 2023

hugovk commented Nov 10, 2023

pablogsal commented May 4, 2023 •

edited by bedevere-app bot

Loading