Automatic process for data ingestion into the OpenCitations collections
Scientific-Disciplinary Group
01/INFO-01 - Informatics
Description
OpenCitations is an Open Science infrastructure that provides a large body of bibliographic metadata and scholarly citation data, with a level of quality and coverage comparable to proprietary services such as Web of Science and Scopus. The primary objective of this work is to continuously improve the data ingestion procedure in order to make it fully automatic. This includes the preprocessing of sources, the conversion and ingestion of new data into the OpenCitations databases, the verification of the correct execution of the process and of the data it produces, and finally the publication of new versions of the collections in specific repositories and application contexts. The specific goal is to implement—through the identification and use of a suitable framework—an automatic, flexible, and extensible pipeline, appropriately integrated with the current technologies used by OpenCitations in its existing technological infrastructure.
Job posting website
Funding body
ALMA MATER STUDIORUM - UNIVERSITA' DI BOLOGNA - - DIPARTIMENTO DI FILOLOGIA CLASSICA E ITALIANISTICA
How to apply
Other
Selection process
Click to expand
View the original posting on the MUR website: Go to MUR website