Apparently fuzz tests that used regexes were an issue in practice for Benchify (the company that ran into this problem). From the blog post:
Benchify observed that the model was much more likely to generate a test with no false positives when using string methods instead of regexes, even if the test coverage wasn’t as extensive.
Apparently fuzz tests that used regexes were an issue in practice for Benchify (the company that ran into this problem). From the blog post: