-
Notifications
You must be signed in to change notification settings - Fork 30.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
node: improve GetActiveRequests performance #3375
Conversation
Is "index" the right word to use here? you're just appending and not actually using an index |
While performance improvements are nice, this definitely adds more code than it removes. Is the use case here continuous monitoring? This is still an undocumented API... for what reason I don't know though, seems it would help to make it public (I've used it too, and find it very useful, but that's usually been limited to shutdown-time). |
@rvagg That's an artifact of an early implementation. I'll find a better name. @ronkorving The use case is to not have a crappy implementation. This is a technique that I plan to expand through core. Search for LOC should have little to no affect on a PR. Added complexity I can understand, and can agree with based on circumstance. This though is fairly straightforward. |
Is there any way we could help the V8 team (I say we.. as if I really could) to make Object::Set as fast as JIT compiled JavaScript? |
|
||
for (auto w : *env->req_wrap_queue()) { | ||
if (w->persistent().IsEmpty() == false) { | ||
argv[i++ % 5] = w->object(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you use ARRAY_SIZE(argv)
instead of hard-coding it in several places? If it gets unwieldy, I suggest writing it as:
static const size_t argc = 5;
Local<Value> argv[argc];
// ...
argv[i++ % argc] = ...;
This could be extended to numerous other places but I assume that's your plan anyway. :-) |
1d1139e
to
1e42a7f
Compare
@bnoordhuis Comments addressed (I think). Yes, I am planning on extending this across core but figured this would be a good low impact place to start off. :) |
for (auto w : *env->req_wrap_queue()) { | ||
if (w->persistent().IsEmpty() == false) { | ||
argv[i++ % argc] = w->object(); | ||
if ((i % argc) == 0) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Shouldn't this check that i > 0? It's going to make a superfluous JS call now if I read it right.
EDIT: Never mind, didn't read it right. It's never zero.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is the most succinct way I found, but even looking back at this over the weekend I needed to remember what the logic was doing. if you have something more readable in mind I'm open to suggestions. :)
1e42a7f
to
2c6de7a
Compare
@bnoordhuis made most suggested changes and added a simple test. |
ary->Set(i++, w->object()); | ||
if (i % argc > 0) { | ||
HandleScope scope(env->isolate()); | ||
fn->Call(ctx, ary, i % argc, argv).ToLocalChecked(); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is the repetition I mean. I'd do the i % argc
only once and cache the result in a const size_t remainder
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ah, got it. was trying to make it work w/ the for loop above.
2c6de7a
to
7daff51
Compare
@bnoordhuis comment addressed. |
Nice change, although it makes me sad that Object::Set is so slow. LGTM so long as CI is green and |
v8 is faster at setting object properties in JS than C++. Even when it requires calling into JS from native code. Make process._getActiveRequests() faster by doing this when populating the array containing request objects. Simple benchmark: for (let i = 0; i < 22; i++) fs.open(__filename, 'r', function() { }); let t = process.hrtime(); for (let i = 0; i < 1e6; i++) process._getActiveRequests(); t = process.hrtime(t); console.log((t[0] * 1e9 + t[1]) / 1e6); Results between the two: Previous: 4406 ns/op Patched: 690 ns/op 5.4x faster
7daff51
to
d1dd435
Compare
@bnoordhuis @jasnell |
LGTM if CI is green |
LGTM |
v8 is faster at setting object properties in JS than C++. Even when it requires calling into JS from native code. Make process._getActiveRequests() faster by doing this when populating the array containing request objects. Simple benchmark: for (let i = 0; i < 22; i++) fs.open(__filename, 'r', function() { }); let t = process.hrtime(); for (let i = 0; i < 1e6; i++) process._getActiveRequests(); t = process.hrtime(t); console.log((t[0] * 1e9 + t[1]) / 1e6); Results between the two: Previous: 4406 ns/op Patched: 690 ns/op 5.4x faster PR-URL: #3375 Reviewed-By: James Snell <[email protected]> Reviewed-By: Ben Noordhuis <[email protected]>
None of the failures are related to the PR. Landed on 494227b. Thanks much! |
v8 is faster at setting object properties in JS than C++. Even when it requires calling into JS from native code. Make process._getActiveRequests() faster by doing this when populating the array containing request objects. Simple benchmark: for (let i = 0; i < 22; i++) fs.open(__filename, 'r', function() { }); let t = process.hrtime(); for (let i = 0; i < 1e6; i++) process._getActiveRequests(); t = process.hrtime(t); console.log((t[0] * 1e9 + t[1]) / 1e6); Results between the two: Previous: 4406 ns/op Patched: 690 ns/op 5.4x faster PR-URL: #3375 Reviewed-By: James Snell <[email protected]> Reviewed-By: Ben Noordhuis <[email protected]>
@trevnorris what are your thought regarding adding this commit to LTS? |
@trevnorris ping |
@thealphanerd oop. sorry. It's a micro-performance optimization. Merging will doubtfully prevent future conflicts. So, can land but don't think it's necessary. |
v8 is faster at setting object properties in JS than C++. Even when it
requires calling into JS from native code. Make
process._getActiveRequests() faster by doing this when populating the
array containing request objects.
Simple benchmark:
Results between the two:
R=@bnoordhuis
Have another addition of improving the same for active handles, but wanted to solicit feedback on the approach early.