3 datasets
  • BlogoCenter Data Sets

    Offsite — The following datasets are available: Real-Web dataset containing hash values of the content of 353,739 web pages collected over a period of six months (Feb. 1999 – July 1999). Same real-web dataset formated in three columns (web_site, web_page, change_history). Change history is a sequence of bits: 1 means that the specific page has changed between the respective visits ...
  • OpenCalais API

    Offsite — The OpenCalais Web Service automatically creates rich semantic metadata for the content you submit – in well under a second. Using natural language processing (NLP), machine learning and other methods, Calais analyzes your document and finds the entities within it. But, Calais goes well beyond classic entity identification and returns the facts and events hidden within ...
  • FindTheBest.com Blog Software listing

    Offsite — Find and compare the best blog software programs by name, programming language, price, and more.