Flesh out the regex search APIs #35

maxbrunsfeld · 2017-09-29T18:53:24Z

Problem

While @leroix and I were looking into the performance of the new findWordsWithSubsequence method and its use in autocomplete-plus, we noticed that a huge amount of time was being spent in the Cursor.getCurrentWordBufferRange method.

There are a few reasons why this method is slow:

It uses the old TextBuffer.scanInRange API rather than a native search API from superstring.
The scanInRange APIs are especially inefficient when they are passed RegExps that can potentially match across line breaks. Currently, RegExps that contain negated character classes (e.g. [^x] are assumed to be potentially multi-line.
The word-regex used by getCurrentWordBufferRange uses a negated character class based on the editor.nonWordCharacters config setting.

Solution

We should optimize scanInRange, ideally using superstring's native search functionality.

First step

This PR expands the set of search APIs. The final list of search APIs will be as follows:

Before actually changing TextBuffer.scanInRange, I'm going to update Atom to use the native API just for Cursor.getCurrentWordBufferRange. That will fix the most immediate performance problem. Then later we can take on the more risky task of updating scanInRange.

/cc @nathansobo

winstliu · 2017-09-29T19:10:23Z

Ooh, this should really help performance for bracket-matcher's HTML tag matching as well!

maxbrunsfeld · 2017-09-29T19:21:31Z

this should really help performance for bracket-matcher's HTML year matching

Yeah, I've noticed some lag in bracket-matcher's searching. For bracket matcher we could probably even use the async search APIs, since the highlight doesn't need to appear synchronously.

nathansobo · 2017-09-29T19:47:32Z

@maxbrunsfeld Is this the slowness of Cursor.getCurrentWordBufferRange something you just noticed while profiling or is it actually in a code path related to autocompletion? I'm surprised this method gets called super frequently. I'm excited to see these optimizations coming.

maxbrunsfeld · 2017-09-29T19:52:49Z

Is this the slowness of Cursor.getCurrentWordBufferRange something you just noticed while profiling or is it actually in a code path related to autocompletion?

It's in the code path for autocompletion (with both the old and new providers) because we need to avoid returning the word under the cursor as an autocomplete suggestion.

nathansobo · 2017-09-29T22:58:43Z

Yee haa!

maxbrunsfeld added 2 commits September 29, 2017 11:38

Add async TextBuffer.findAll method

ea9216a

Use node 8 on circle and appveyor

95504df

maxbrunsfeld force-pushed the mb-find-in-range branch from a8fe43e to 95504df Compare September 29, 2017 19:00

Add range-based regex search APIs

46443cf

maxbrunsfeld force-pushed the mb-find-in-range branch from 4668d0a to 46443cf Compare September 29, 2017 21:07

maxbrunsfeld merged commit 7a24823 into master Sep 29, 2017

maxbrunsfeld deleted the mb-find-in-range branch September 29, 2017 21:13

maxbrunsfeld mentioned this pull request Sep 29, 2017

Fix slowness in Cursor.getCurrentWordBufferRange atom/atom#15776

Merged

maxbrunsfeld mentioned this pull request Nov 8, 2017

Handle regexes with unicode escape sequences in .find and .findAll #43

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flesh out the regex search APIs #35

Flesh out the regex search APIs #35

maxbrunsfeld commented Sep 29, 2017 •

edited

Loading

winstliu commented Sep 29, 2017 •

edited

Loading

maxbrunsfeld commented Sep 29, 2017

nathansobo commented Sep 29, 2017

maxbrunsfeld commented Sep 29, 2017

nathansobo commented Sep 29, 2017

Flesh out the regex search APIs #35

Flesh out the regex search APIs #35

Conversation

maxbrunsfeld commented Sep 29, 2017 • edited Loading

Problem

Solution

First step

winstliu commented Sep 29, 2017 • edited Loading

maxbrunsfeld commented Sep 29, 2017

nathansobo commented Sep 29, 2017

maxbrunsfeld commented Sep 29, 2017

nathansobo commented Sep 29, 2017

maxbrunsfeld commented Sep 29, 2017 •

edited

Loading

winstliu commented Sep 29, 2017 •

edited

Loading