There is this example, specifically this part of it:
text_tokenizer tok(text,boost::char_separator(" \\t\\n.,;:!?\'\\"-")); unsig