How can I safely validate an untrusted regex in Perl?

前端 未结 2 1491
情歌与酒
情歌与酒 2021-01-19 15:20

This answer explains that to validate an arbitrary regular expression, one simply uses eval:

while (<>) {
    eval \"qr/$_/;\"
    print $         


        
相关标签:
2条回答
  • 2021-01-19 15:58

    There is some discussion about this over at The Monastery.

    TLDR: use re::engine::RE2 (-strict => 1);

    Make sure at add (-strict => 1) to your use statement or re::engine::RE2 will fall back to perl's re.

    The following is a quote from junyer, owner of the project on github.

    RE2 was designed and implemented with an explicit goal of being able to handle regular expressions from untrusted users without risk. One of its primary guarantees is that the match time is linear in the length of the input string. It was also written with production concerns in mind: the parser, the compiler and the execution engines limit their memory usage by working within a configurable budget – failing gracefully when exhausted – and they avoid stack overflow by eschewing recursion.

    0 讨论(0)
  • 2021-01-19 16:01

    The solution is simply to change

    eval("qr/$_/")
    

    to

    eval("qr/\$_/")
    

    This can be written more clearly as follows:

    eval('qr/$_/')
    

    But that's still not optimal. The following would be far better as it doesn't involve generating and compiling Perl code at run-time:

    eval { qr/$_/ }
    

    Note that neither solution protects you from denial of service attacks. It's quite easy to write a pattern that will take longer than the life of the universe to complete. To hand that situation, yYou could execute the regex match in a child for which CPU ulimit has been set.

    0 讨论(0)
提交回复
热议问题