• Medientyp: E-Artikel; Sonstige Veröffentlichung
  • Titel: Two approaches to the dataset interlinking recommendation problem
  • Beteiligte: Lopes, Giseli Rabello [VerfasserIn]; Leme, Luiz André P. Paes [VerfasserIn]; Pereira Nunes, Bernardo [VerfasserIn]; Casanova, Marco Antonio [VerfasserIn]; Dietze, Stefan [VerfasserIn]; Benatallah, Boualem [VerfasserIn]; Bestavros, Azer [VerfasserIn]; Manolopoulos, Yannis [VerfasserIn]; Vakali, Athena [VerfasserIn]; Zhang, Yanchun [VerfasserIn]
  • Erschienen: Heidelberg : Springer Verlag, 2014
  • Erschienen in: Web Information Systems Engineering – WISE 2014 ; Lecture Notes in Computer Science ; 8786
  • Ausgabe: accepted Version
  • Sprache: Englisch
  • DOI: https://doi.org/10.15488/1380; https://doi.org/10.1007/978-3-319-11749-2_25
  • ISBN: 978-3-319-11748-5; 978-3-319-11749-2
  • Schlagwörter: Data interlinking ; Recommender systems ; Bayesian networks ; Classification (of information) ; Rank scores ; Linked datum ; Real-world datasets ; Web of datum ; Exploratory search ; Linked Data ; Social networks ; Bayesian classifier ; Social networking (online)
  • Entstehung:
  • Anmerkungen: Diese Datenquelle enthält auch Bestandsnachweise, die nicht zu einem Volltext führen.
  • Beschreibung: Whenever a dataset t is published on the Web of Data, an exploratory search over existing datasets must be performed to identify those datasets that are potential candidates to be interlinked with t. This paper introduces and compares two approaches to address the dataset interlinking recommendation problem, respectively based on Bayesian classifiers and on Social Network Analysis techniques. Both approaches define rank score functions that explore the vocabularies, classes and properties that the datasets use, in addition to the known dataset links. After extensive experiments using real-world datasets, the results show that the rank score functions achieve a mean average precision of around 60%. Intuitively, this means that the exploratory search for datasets to be interlinked with t might be limited to just the top-ranked datasets, reducing the cost of the dataset interlinking process. The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-11749-2_25. ; EC/FP7/LinkedUp ; CNPq/160326/2012-5 ; CNPq/303332/2013-1 ; CNPq/557128/2009-9 ; FAPERJ/E-26/170028/2008 ; FAPERJ/E-26/103.070/2011 ; FAPERJ/E-26/101.382/2014 ; CAPES/1410827
  • Zugangsstatus: Freier Zugang