Does anyone of an algorithm that extracts contents from a webpage? like instapaper?
there is an open source application that parses the text of an article out from any webpage
https://github.com/jiminoc/goose/wiki
should do the trick