Tag

metadata

18 datasets
  • The Whitburn Project: 120 Years of Music Chart History

    Offsite — For the last ten years, obsessive record collectors in Usenet have been working on the Whitburn Project — a huge undertaking to preserve and share high-quality recordings of every popular song since the 1890s. To assist their efforts, they’ve created a spreadsheet of 37,000 songs and 112 columns of raw data, including each song’s duration, beats-per-minute, songwriters, ...
  • MusicBrainz

    Offsite — MusicBrainz is a user-maintained open music community that collects, and makes available to the public, music metadata, including information about artists, release groups, releases, tracks, labels and the many relationships between them. The database also contains a full history of all the changes that the MusicBrainz community has made to the music metadata. The music ...
  • Document Metadata Based on a Sample of Web Documents from the Open Directory

    Offsite — DMOZ100k06 is a large research data set about document metadata based on a random sample of 100,000 web documents from the Open Directory combined with data retrieved from the social bookmarking service delicious.com, the content rating system ICRA, and the search engine Google. The data set is freely available for other research. Michael G. Noll
  • Amsterdam Museum Data Set (RDF)

    Offsite — The Amsterdam Museum dataset describes more than 70,000 cultural heritage objects related to the city of Amsterdam described by the museum. The metadata was retrieved from an XML Web API of the museum’s Adlib collection database and converted to RDF compliant with the Europeana Data Model (EDM). This makes the Amsterdam Museum data the first of its kind to be officially ...
  • MIDAS - Heritage project

    Offsite — From the website: > What is MIDAS? > MIDAS sets out an agreed list of the items or ‘units’ of information that should be included in an inventory or other systematic record of the historic environment. These units of information are grouped together under broad headings or ‘information schemes’. These cover areas such as Monument Character, Events, People and ...
  • Open Media Database

    Offsite — About “omdb (open media database) is a free database for film media. There is no set editorial staff, but rather a large number of movie addicts and lovers who volunteer their time to provide material and develop the site. Anybody can add or change existing information on omdb once they have done the quick and simple task of signing up for their user login name. ...
  • Biblios.net - the world's largest database of freely-licensed library records

    Offsite — About > The beta test environment for LibLime’s new cataloging service, ‡biblios.net, is now available! > ‡biblios.net is a subscription-based, hosted version of the open-source ‡biblios metadata editor that we released earlier this year. In addition to the editor, ‡biblios.net includes some extended community features such as integrated real-time chat, forums, and ...
  • Discogs Release

    Free Download — This dataset consists of a collection of Infoboxes from Wikipedia on the topic of Discogs Release. Wikipedia describes Discogs, short for discographies, as a website and database of information about audio recordings, including commercial releases, promotional releases, and bootleg or off-label releases. The Discogs servers, currently hosted under the domain name ...
  • Audioscrobbler Data

    Offsite — Description “Much of the data available to view on Last.fm is available in several formats through the Audioscrobbler Web Services API.” Format Data variously available in Plain, XML, XSPF, iCal and RSS. License “All web services here are for non-commercial use only under the Creative Commons Attribution-NonCommercial-ShareAlike License. If you want to use these ...
  • Lyricsfly Lyrics REST API

    Offsite — Application Programming Interface is available to anyone who wishes to use our database for their own music project, website or program. If you currently use the web to search out lyrics or use code tricks to access other lyrics websites to display relevant lyrics text for your content you can now have a reliable source without the hassle. example code for php: ...
  • Airborne Antarctic Ozone Experiment (AAOE-87)

    Offsite — This data is from the Airborne Antarctic Ozone Experiment (AAOE) which was based in Punta Arenas, Chile during August and September 1987. The data was primarily collected onboard the NASA ER-2 and DC-8 aircraft, along with ozonesonde data collected at four Antarctic stations: Halley Bay, McMurdo, Palmer Station, and the South Pole. The experiment tested the chemical and ...
  • International Music Database Project (IMDBP)

    Offsite — About > IMDBP strives to categorize every single piece of music ever written in a format that is: 1. Flexible, extensible; 2. Thorough, uncompromising detail; 3. Efficient and intuitive to use for the average user, including the elimination of duplicate information entry and other potential inconsistencies
  • Wikipedia XML Data

    Offsite — This data set contains a complete copy of all Wikimedia wikis, in the form of wikitext source and metadata embedded in XML as provided by the Wikimedia Foundation. The data set will be updated every month and the 3 previous months will always be available for use. We will list previous snapshots in the text of this description.
  • DBPedia

    Offsite — ,DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. The DBpedia knowledge base currently describes more than 2.6 million things, including at least 213,000 persons, 328,000 places, 57,000 music albums, 36,000 films, 20,000 companies. The knowledge base consists of 274 million pieces of ...
  • Wikipedia Extraction (WEX)

    Offsite — The Freebase Wikipedia Extraction (WEX) is a processed dump of the English language Wikipedia. The wiki markup for each article is transformed into machine-readable XML, and common relational features such as templates, infoboxes, categories, article sections, and redirects are extracted intabular form. Freebase WEX is provided as a set of database tables in TSV format ...
  • Infochimps Site Metadata Dump

    Free Download — A full dump of all the metadata in the Infochimps repository. Includes complete information on collections, datasets, sources, licenses, tags, and fields.
  • OpenCalais API

    Offsite — The OpenCalais Web Service automatically creates rich semantic metadata for the content you submit – in well under a second. Using natural language processing (NLP), machine learning and other methods, Calais analyzes your document and finds the entities within it. But, Calais goes well beyond classic entity identification and returns the facts and events hidden within ...
  • OpenDover API

    Offsite — OpenDover is the leading webservice that lets you tag your documents based on sentiments and emotions found in your documents. The OpenDover API can handle different ways of sentiment tagging, depending on what your needs are, or what the content is that you provide via the API. The OpenDover knowledge base consists of thousands of opinion words, domain-related words and ...