I have a text file that contains tweets per line, that need to be altered for a machine learning format. Im using python and basic unix text manipulation (regex) to achieve
If you needed this as an all regex, then have a look at my solution here Changing lines of text into binary type pattern