20 05 2013
Why \d is slower than [0-9]
I learned an interesting thing today about regular expressions via this stackoverflow question.
\d, commonly used as a shorthand for digits (which we usually think of as
0-9) actually checks against all valid unicode digits.
Given that, it makes sense why
\d in a regular expression is slower, since it has to check against all possible digit types. In C# you can limit the regular expression to use ECMAScript standards which doesn’t include the full unicode subset of digits.
While I’m neither the question asker nor answerer, I wanted to share since this is something I didn’t know about before.