Page MenuHomeFreeBSD

awk: avoid walking off end of some invalid format strings
ClosedPublic

Authored by emaste on Apr 14 2023, 2:51 PM.
Tags
None
Referenced Files
Unknown Object (File)
Fri, Jan 17, 8:29 AM
Unknown Object (File)
Tue, Jan 7, 7:04 PM
Unknown Object (File)
Tue, Jan 7, 6:11 PM
Unknown Object (File)
Tue, Jan 7, 6:02 PM
Unknown Object (File)
Mon, Jan 6, 7:05 AM
Unknown Object (File)
Mon, Jan 6, 6:40 AM
Unknown Object (File)
Nov 28 2024, 6:14 AM
Unknown Object (File)
Nov 28 2024, 5:48 AM
Subscribers

Details

Summary
This is a minimal change to avoid the invalid memory access; a more
comprehensive change ought to happen upstream.

Reported by:    phk
Sponsored by:   The FreeBSD Foundation

Diff Detail

Lint
Lint Skipped
Unit
Tests Skipped

Event Timeline

emaste created this revision.

I don't understand why you did what you did worked... Perhaps a sentence or two explaining why that's the right place to do this.

Also, what does the test case printf("%#") or %- or other pathology do? Or some other unterminated sequence?

contrib/one-true-awk/run.c
851

Why not have that

if (*(s+1) == '\0') break;

here, either before or after this if.

contrib/one-true-awk/run.c
851

That would exit the loop before detecting and reporting the error if the string is just "%", and would not fix the error phk reported for e.g. %1

I don't understand why you did what you did worked... Perhaps a sentence or two explaining why that's the right place to do this.

This was the simplest way to avoid walking off the end of the string, if we've already consumed everything. I don't think it's the "right" place to do this, it's just the least invasive.

Add a comment about the break condition.

contrib/one-true-awk/run.c
861

This s++ is the offending increment, moving s to the \0. But we want to keep the warnings at line 916/921, so we don't want to exit the loop here.

small improvement: move the \0 check into the offending loop

markj added inline comments.
contrib/one-true-awk/run.c
861

Should the order of these checks be reversed? That is, don't you want to know that *s != '\0' before checking whether *(s+1) != '\0'?

sample output w/ this change:

$ ./a.out 'BEGIN {printf("%3u\n", 123)}'
123

$ ./a.out 'BEGIN {printf("%3u\n")}'
./a.out: not enough args in printf(%3u
)
 source line number 1

 $ ./a.out 'BEGIN {printf("%3\n", 123)}'
./a.out: weird printf conversion %3
 source line number 1
%3123

$ ./a.out 'BEGIN {printf("%")}'
./a.out: weird printf conversion 
 source line number 1
./a.out: not enough args in printf(%)
 source line number 1
contrib/one-true-awk/run.c
861

Hmm, yes - I did it in this order because I think we don't want the t++ for the case that we lack the conversion character.

In practice I think it's actually OK (although unintentional) but is indeed confusing. What about

for (t = fmt; *s != '\0' && *(s+1) != '\0'; s++) {
        *t++ = *s;

(I say it's OK because on the first loop iteration *s is not \0 (from line 845), and so it's fine to examine *(s+1) on each pass.)

This looks good to me. I'm neutral on order per mark.

This revision is now accepted and ready to land.Apr 14 2023, 4:56 PM

third try: just turn missing conversion specifier into a fatal error

This revision now requires review to proceed.Apr 14 2023, 5:00 PM
contrib/one-true-awk/run.c
916

will add break; as well to keep Coverity or other tools happy

thanks for making this much better.

This revision is now accepted and ready to land.Apr 14 2023, 5:10 PM