I had bookmarked this “simple regex language” for regex a while ago, but this morning as I revisited it I was struck by just how neat this project is: begin with any of (digit, letter, one of “._%+-“) once or more, literally “@”, any of (digit, letter, one of “.-“) once or more, literally “.”, […]
Delightful post about a paper about a mad fast streaming regex engine called Hyperscan
Capturing subexpressions: at once stage we had an experimental product that (in block mode only, not streaming) could accurately model libpcre’s capturing semantics. This worked by scanning the data backwards with a backwards version of the pattern, making a trace of the states visited, and tracing forwards through the backward state trace in order to […]