TDC Tools: streamlining information retrieval applications thanks to a tabular Document-Concept representation of the biomedical literature

Main Authors: Moreau, Erwan, Hardiman, Orla, Heverin, Mark, O'Sullivan, Declan
Format: info publication-preprint Journal
Terbitan: , 2022
Subjects:
Online Access: https://zenodo.org/record/6380693
Daftar Isi:
  • Motivation: Several methods can be used to obtain the content of the biomedical literature articles annotated with standardized concepts. Various information retrieval (IR) applications exploit these resources, e.g. Literature-Based Discovery (LBD). Such applications often require a complex processing pipeline to transform the raw literature into an appropriate structured representation. Results: TDC Tools offer an intermediate level of representation aimed to facilitate the design and implementation of IR systems exploiting a concept-based view of the literature. Conceptually this intermediate representation decouples the data extraction part from the task-specific exploitation part, thus improving the modularity, interoperability and ultimately the reusability of such systems. Availability: The software and its dependencies are published under open-source license and provided with detailed instructions: https://github.com/erwanm/tdc-tools.