I\'m trying to find an efficient way to split very large text files (gigabytes, million to billion of lines) based on a regex. The regex contains capturing groups that must