toke.c dont call libc's memcmp() to test 1 byte in Perl_scan_str() #23533

bulk88 · 2025-08-03T13:54:25Z

delim_byte_len is almost always 1, open_delim_str is almost always '"' or ''' or something similar. I'm not sure which exact string of PP code will make delim_byte_len not be 1, but it would be too rare to optimize for but still must be supported.

Just test the char directly if its length of 1. Invoking libc memcmp() requires 4 ABI inputs on any CPU, and while most of the code paths above the memEQ() lines are constants directly initialized inside Perl_scan_str(), one branch uses "utf8_to_uv_or_die(,,&delim_byte_len)" which optimizes to Perl_utf8_to_uvchr_buf_helper(,,,&delim_byte_len) making the value in STRLEN delim_byte_len unbounded according to all CC. All CCs must assume the value Perl_utf8_to_uvchr_buf_helper() put inside delim_byte_len could be a 4.7GB DVD or 25GB BD .iso file.

Put the retval of SvGROW() to use.

Don't let C auto var delim_byte_len escape with "&" op thru utf8_to_uv_or_die(). Var delim_byte_len can never be stored in a register again by any CC if it escapes and must be reread from C stack after ever possible call if it escapes.

This set of changes does not require a perldelta entry.

delim_byte_len is almost always 1, open_delim_str is almost always '"' or '\'' or something similar. I'm not sure which exact string of PP code will make delim_byte_len not be 1, but it would be too rare to optimize for but still must be supported. Just test the char directly if its length of 1. Invoking libc memcmp() requires 4 ABI inputs on any CPU, and while most of the code paths above the memEQ() lines are constants directly initialized inside Perl_scan_str(), one branch uses "utf8_to_uv_or_die(,,&delim_byte_len)" which optimizes to Perl_utf8_to_uvchr_buf_helper(,,,&delim_byte_len) making the value in STRLEN delim_byte_len unbounded according to all CC. All CCs must assume the value Perl_utf8_to_uvchr_buf_helper() put inside delim_byte_len could be a 4.7GB DVD or 25GB BD .iso file. Put the retval of SvGROW() to use. Don't let C auto var delim_byte_len escape with "&" op thru utf8_to_uv_or_die(). Var delim_byte_len can never be stored in a register again by any CC if it escapes and must be reread from C stack after ever possible call if it escapes.

leonerd

LGTM.

Commit message is reasonably descriptive and readable as to what the change actually is.

khwilliamson

My primary issue with this is that "What is it about this code that makes it different from other uses of memEQ in general.?" Or rather, "If this is a good idea here, why not do it in memEQ and get the benefits everywhere?" And for that matter, how do we know that memcmp isn't an inline function that does the same thing? I have looked at various libc sources in the past, and recall this sort of thing being done. It's a reasonable thing to do

The commit title is fine, and the message is better than recent ones, but I don't think it is good enough. There are misspellings and short cuts like "thru" which may not be known except to native English speakers. The message must be easily comprehensible to people without the specialized knowledge it assumes, nor the jargon.

It also is wrong. Perl_utf8_to_uvchr_buf_helper is only called in unusual circumstances. And so all that second paragraph that I don't understand is based on false premises.

khwilliamson · 2025-08-08T20:35:50Z

toke.c

        /* set 'to' to the next character in the sv's string */
-        to = SvPVX(sv)+SvCUR(sv);
+        to = pv + SvCUR(sv);


This is good, but I think it would be better in a separate commit; one that wouldn't be reverted if the rest turned out to be.

khwilliamson · 2025-08-08T20:37:47Z

toke.c

-                {
+                    &&   ((delim_byte_len == 1
+                            ? (s[1] == open_delim_str[0]
+                                || s[1] == close_delim_str[0])


I'm unhappy that this and elsewhere reformats code that corresponds to Perl Best Practices to not conform.

khwilliamson · 2025-08-09T15:57:31Z

It occurred to me that maybe there could be a memEQ_multi_byte_likely() to avoid the extra conditional for cases where we think it more likely that a full memcmp is required

The maximum length returned to advance is 14. Would an ASSUME in the inline function tell the compilers they don't have to plan for a full Size_t value. We could also change the parameter on the internal-only helper function so that it is a U8.

leonerd approved these changes Aug 7, 2025

View reviewed changes

khwilliamson requested changes Aug 8, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

toke.c dont call libc's memcmp() to test 1 byte in Perl_scan_str() #23533

toke.c dont call libc's memcmp() to test 1 byte in Perl_scan_str() #23533

Uh oh!

bulk88 commented Aug 3, 2025

Uh oh!

leonerd left a comment

Uh oh!

khwilliamson left a comment

Uh oh!

khwilliamson Aug 8, 2025

Uh oh!

khwilliamson Aug 8, 2025

Uh oh!

khwilliamson commented Aug 9, 2025

Uh oh!

Uh oh!

toke.c dont call libc's memcmp() to test 1 byte in Perl_scan_str() #23533

Are you sure you want to change the base?

toke.c dont call libc's memcmp() to test 1 byte in Perl_scan_str() #23533

Uh oh!

Conversation

bulk88 commented Aug 3, 2025

Uh oh!

leonerd left a comment

Choose a reason for hiding this comment

Uh oh!

khwilliamson left a comment

Choose a reason for hiding this comment

Uh oh!

khwilliamson Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

khwilliamson Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

khwilliamson commented Aug 9, 2025

Uh oh!

Uh oh!