I\'m currently using a CrawlSpider to look for any links and therefore follow them.
In order to crawl urls without HREF tags (plain text) i\'m extracting them and the