Looking for dataset to test FULLTEXT style searches on [closed]
I am looking for a corpus of text to run some trial fulltext style data searches across. Either something I can download, or a system that generates it. Something a bit more random would be better e.g. 1,000,000 wikipedia articles in a format easy to insert into a 2 column database (id, text). Any ideas or suggestions? I'll throw this out there since I'm familiar with it - Prosper.com makes their member loan listings available for analysis through an XML export . The export would have about 50,000 loan requests with descriptions and over 1,000,000 member profiles (although many of those are