Open PHACTS: solutions and the Foundation
Main Author: | Willighagen, Egon |
---|---|
Format: | info Proceeding eJournal |
Bahasa: | eng |
Terbitan: |
, 2014
|
Online Access: |
https://zenodo.org/record/3588454 |
Daftar Isi:
- Open PHACTS is a five year project of the Innovative Medicines Initiative (IMI), ending in February 2016. It aims to reduce the barriers to drug discovery in industry, academia and for small businesses. The Open PHACTS consortium is building a freely available platform, integrating data from a variety of information resources, and providing tools and services to query these integrated data to support life sciences research. Currently, pharmaceutical companies expend significant and often duplicated efforts aligning and integrating internal information with public data sources. This process is difficult and inefficient and the vast majority of data sources cannot easily interoperate, often requiring additional steps to map identifiers or manually curate and correct the content. Open PHACTS is creating a precompetitive infrastructure to make these data integration approaches available both to industry and to academia and smaller companies, who have historically not had access to large-scale integrated data resources. Here we give an overview of the resulting Open PHACTS Discovery Platform, the semantic web solutions used in this, and describe the integration of the data into dedicated and generic data analysis tools. The platform consists of components that communicate with each other using open standards and cover the full data lifecycle, from data loading to data sharing. Solutions underlying the platform include those for data provenance, data normalization, data standardization, and data access. In particular, we have developed minimal reporting standards for provenance, technologies to express the level of equivalence of entities from different databases, a database identifier mapping infrastructure based on semantic web technologies, unit and end point normalization, as well as chemical structure normalization. For this we use open ontologies (BioAssay Ontology, QUDT, CHEMINF, etc), standards (RDF, SPARQL, REST, etc), and proposed solutions as outlined in published specifications. On top of these approaches, user oriented solutions have been developed based on a number of research questions selected by the pharmaceutical industry(3). Example questions include: “Give me all oxidoreductase inhibitors active <100><1 μm”. The OPS platform provides a uniform route by which these questions can be addressed, exposed to the user by a novel pharmaceutical web service platform, called the Linked Data API (LDA). As well as provided an API and web-portal to access integrated data, the Open PHACTS platform also supports an ecosystem of third-party applications addressing specialised needs such as polypharmacology, hit-selection, target validation and knowledge discovery. Additionally, more generic integrations have been developed too, like client libraries to the LDA in various programming languages, such as JavaScript, and Scala.