Automatic process for data ingestion into the OpenCitations collections

Position: Research appointment (pre-doc) Institute: Uni. Bologna
Posted on: 23/02/2026 Deadline: 23/03/2026

Scientific-Disciplinary Group

01/INFO-01 - Informatics

Description

OpenCitations is an Open Science infrastructure that provides a large body of bibliographic metadata and scholarly citation data, with a level of quality and coverage comparable to proprietary services such as Web of Science and Scopus. The primary objective of this work is to continuously improve the data ingestion procedure in order to make it fully automatic. This includes the preprocessing of sources, the conversion and ingestion of new data into the OpenCitations databases, the verification of the correct execution of the process and of the data it produces, and finally the publication of new versions of the collections in specific repositories and application contexts. The specific goal is to implement—through the identification and use of a suitable framework—an automatic, flexible, and extensible pipeline, appropriately integrated with the current technologies used by OpenCitations in its existing technological infrastructure.

Funding body

ALMA MATER STUDIORUM - UNIVERSITA' DI BOLOGNA - - DIPARTIMENTO DI FILOLOGIA CLASSICA E ITALIANISTICA

How to apply

Other

Selection process

Click to expand
to apply for research grants fill out the form available at the following address: https://bandi.unibo.it/ricerca/incarichi-di-ricerca