De-duplicating the OpenAIRE Scholarly Communication Big Graph
Main Authors: | Atzori, Claudio, Manghi, Paolo, Bardi, Alessia |
---|---|
Format: | Proceeding poster Journal |
Bahasa: | eng |
Terbitan: |
, 2018
|
Subjects: | |
Online Access: |
https://zenodo.org/record/1489140 |
Daftar Isi:
- The OpenAIRE infrastructure populates a scholarly communication big graph interlinking metadata objects of publications, datasets, software, organizations, funders, and projects.In order to de-duplicate this graph, OpenAIRE has developed GDup , an integrated, scalable, general-purpose system for entity deduplication over big information graphs. GDup offers functionalities to realize a fully-fledged entity deduplication workflow over a generic input graph, inclusive of Ground Truth support, end-user feedback, and strategies for identifying and merging duplicates to obtain an output disambiguated graph.