Tag

computer_science

3 datasets
  • New SwetoDblp RDF dataset released with 11M triples

    Offsite — The LSDIS (Large Scale Distributed Information Systems) lab at the University of Georgia has released a new version of the SwetoDblp dataset. SwetoDblp is a large-size ontology (spin-off of SWETO ontology) focused on bibliography data of Computer Science publications where the main data source is DBLP (Digital Bibliography & Library Project). The dataset has about 11M ...
  • LSDIS : SwetoDblp

    Offsite — SwetoDblp is a large-size ontology (spin-off of SWETO ontology) focused on bibliography data of Computer Science publications where the main data source is DBLP (Digital Bibliography & Library Project). SwetoDblp was created from a large XML document available at DBLP’s website and other datasets that are used to add relationships to other entities such as Publishers, ...
  • Duplicate Detection, Record Linkage, and Identity Uncertainty: Datasets

    Offsite — The following datasets have been provided for evaluating duplicate detection, record linkage, and identity uncertainty systems. Several of these are not yet available for downloading; please contact the authors. The datasets include a segmented citation dataset based on the Cora research paper search engine, a collection of 864 restaurant records from the Fodor’s and ...