I am looking for twitter or other social networking sites dataset for my project. I currently have the CAW 2.0 twitter dataset but it only contains tweets of users. I want a dat
Try the following three datasets:
Contains around 97 milllion tweets:
http://demeter.inf.ed.ac.uk/index.php?option=com_content&view=article&id=2:test-post-for-twitter&catid=1:twitter&Itemid=2
ed note: the dataset previously linked above is no longer available because of a request from Twitter to remove it.
Contains user graph of 47 million users:
http://an.kaist.ac.kr/traces/WWW2010.html
Following dataset contains network as well as tweets, however the data was collected by snowball sampling or something hence the friends network is not uniform. It has around 10 million tweets you can mail the researcher for even more data.
http://www.public.asu.edu/~mdechoud/datasets.html
Though have a look at the license the data is distributed under.
Hope this helps, Also can you tell me what kind of work are planning with this dataset? I have few hadoop / pig scripts to use with dataset