I am using scrapy to scrap blogs and then store the data in mongodb. At first i got the InvalidDocument Exception. So obvious to me is that the data is not in the right encoding
First, when you do "somestring".encode(...), isn't changing "somestring", but it returns a new encoded string, so you should use something like:
"somestring".encode(...)
"somestring"
item['author'] = item['author'].encode('utf-8', 'strict')
and the same for the other fields.