Tag
crawling
2 datasets-
TechTC - Technion Repository of Text Categorization Data Sets
Offsite — The Technion Repository of Text Categorization Datasets provides a large number of diverse test collections for use in text categorization research. -
Samples of Facebook users and Facebook user application installations
Offsite — Two representative samples of (~1 million) Facebook users collected in April 2009 with friend list, privacy settings and network membership for each user. One representative sample of ~13K Facebook application installations by ~300K users. Two samples of weighted college Facebook users collected in Oct 2010.