Ensure `parse_utf8` has length of string passed in when available #99834

kiroxas · 2024-11-29T13:40:27Z

As seen in #99826, parse utf_8 is faster when it doesn't have to find the size of the string first.
So if calling code has it, make sure to pass it along.

Ivorforce

It might be that this is changing behavior; current code is terminating on NULL even if the actual string length was larger. Not that i'm again it; just noting. The tests will probably see if it still works.

platform/windows/windows_utils.cpp

kiroxas · 2024-11-29T16:28:19Z

It might be that this is changing behavior; current code is terminating on NULL even if the actual string length was larger. Not that i'm again it; just noting. The tests will probably see if it still works.

It should not, as length is just a hint to be able to allocate a destination buffer large enough to hold the result ( which is 1 to 1 if the file is just ASCII, or could be 4 to 1 if only 4 bytes codepoints). It is only used in the conversion algorithm as a stop condition, and if we iterated past the length, it was a bug, as the case I changed allocated length bytes just before the call. If it stopped on null, we will stop on null too, even if it's smaller than length. The loop condition is the same in current implementation and the PR.

platform/windows/windows_utils.cpp

Repiteo · 2024-12-03T20:47:58Z

Thanks!

kiroxas requested review from a team as code owners November 29, 2024 13:40

Ivorforce reviewed Nov 29, 2024

View reviewed changes

platform/windows/windows_utils.cpp Outdated Show resolved Hide resolved

bruvzg reviewed Nov 29, 2024

View reviewed changes

platform/windows/windows_utils.cpp Outdated Show resolved Hide resolved

Chaosus added enhancement topic:core labels Nov 30, 2024

Chaosus added this to the 4.4 milestone Nov 30, 2024

AThousandShips modified the milestones: 4.4, 4.x Nov 30, 2024

When calling code has length of string, pass it to parse_utf8

83d4bde

kiroxas force-pushed the passLengthToParseUTF8 branch from 3a37be1 to 83d4bde Compare December 1, 2024 07:31

kiroxas mentioned this pull request Dec 1, 2024

Avoid duplicated utf8() calls #99893

Merged

bruvzg approved these changes Dec 3, 2024

View reviewed changes

akien-mga added the performance label Dec 3, 2024

akien-mga modified the milestones: 4.x, 4.4 Dec 3, 2024

AThousandShips approved these changes Dec 3, 2024

View reviewed changes

Repiteo merged commit 1719f8e into godotengine:master Dec 3, 2024
20 checks passed

kiroxas deleted the passLengthToParseUTF8 branch December 18, 2024 10:20

DmitriySalnikov mentioned this pull request Jan 31, 2025

.pdb rename error for rust godot crate in version 4.3 #102206

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure `parse_utf8` has length of string passed in when available #99834

Ensure `parse_utf8` has length of string passed in when available #99834

kiroxas commented Nov 29, 2024

Ivorforce left a comment

kiroxas commented Nov 29, 2024

Repiteo commented Dec 3, 2024

Ensure parse_utf8 has length of string passed in when available #99834

Ensure parse_utf8 has length of string passed in when available #99834

Conversation

kiroxas commented Nov 29, 2024

Ivorforce left a comment

Choose a reason for hiding this comment

kiroxas commented Nov 29, 2024

Repiteo commented Dec 3, 2024

Ensure `parse_utf8` has length of string passed in when available #99834

Ensure `parse_utf8` has length of string passed in when available #99834