Parsing awk is tricky | An awk implementation

submited by
Style Pass
2024-09-02 03:30:05

If you only know what the Awk book (by A, K, & W, 1st or 2nd Ed.) says, you’d be missing some details. For example, the Awk book and many other documents say a for statement is:

But try this: awk 'BEGIN{for (; x++ < 3; print x);}' (using gawk, nawk, nnawk, or goawk; mawk and bbawk (busybox awk) fail.)

This works because expression₁ and expression₃ are actually simple_statement which can be an expression, delete statement, or print statement. (This is in both the yacc grammar for nawk/nnawk and in the POSIX reference grammar.) Why? Maybe no one knows. Maybe it somehow made the syntax better for yacc. I asked gawk maintainer Arnold Robbins about this, and he suggested

But even if it could be useful, very few users would ever know about it because it’s undocumented outside the formal grammar.

Consider this: awk 'BEGIN{if (1 < 2 && x = 3) print x}'. According to the POSIX grammar this should be a syntax error, but it prints 3.

Leave a Comment