Newsletter3K is a good python Library for News content extraction. It works mostly well .I want to extract names after first "by" word in visible text