An Interactive Overview of Key Resources and Demonstrations
Cellar is the common data repository of the Publications Office of the European Union. Digital publications and metadata are stored in and disseminated via Cellar, in order to be used by humans and machines. Aiming to transparently serve users, Cellar stores multilingual publications and metadata, it is open to all EU citizens and provides machine-readable data.
The Publications Office aims to provide a unified, central repository, Cellar, to store and disseminate all EU publications and data. Key objectives include:
Cellar receives publications and their associated metadata based on the Metadata Encoding Transmission Standard (METS). It stores and disseminates digital publications and metadata for various EU services.
The Cellar infrastructure is built around high-availability Virtuoso installations.
A publication can be requested via a RESTful interface, using as a criterion its linguistic version and format.
http://publications.europa.eu/resource/{ps-name}/{ps-id}?language={dec-lang}
{ps-name}
: A valid production system name.{ps-id}
: A valid production system id, identifying a work.Accept:{mime-type}
: e.g., application/pdf, application/msword,
application/rdf+xml, etc.Accept-Language:{acc-lang}
: A 3-char ISO_639-3 language code.Accept-Max-Cs-Size:{size}
: Max content stream size in bytes.A collection of publications can be requested via a RESTful interface.
http://publications.europa.eu/resource/{ps-name}/{ps-id}?language={dec-lang}
{ps-name}
: A valid production system name.{ps-id}
: A valid production system id, identifying a work.Accept:{mime-type}
: e.g., application/list;mtype={manifestation-type}
or application/zip;mtype={manifestation-type}.Accept-Language:{acc-lang}
: A 3-char ISO_639-3 language code.Cellar is part of a wider ecosystem that includes several key components handling the publication lifecycle.
IMMCbuilder creates an IMMC package containing digital publications and metadata.
CERES receives and validates IMMC packages, generating METS packages for Cellar.
Digital content is stored in Cellar's file system, RDBMS, and knowledge graph.
Dissemination occurs via EUR-Lex and the OP Portal using RSS, REST API, and SPARQL interfaces.
Archivist sends publications and metadata to EUDOR, the long-term archive.
Source | Datasets | Triples (est.) | Growth |
---|---|---|---|
EU Legal Content | EU Law (EUR-Lex), Treaties, Case Law, International Agreements | Large-scale | Continuous growth with new legislation and case law |
Publications | Official Journals, General Publications, Research Papers | Large-scale | Stable and continuous |
Metadata & Vocabularies | FRBR-based metadata, EuroVoc, Named Authority Lists (NALs), Thesauri | Medium-scale | Ongoing updates and enrichments |
Total | — | Billions+ (inferred from system scale) | Sustained growth across all data types |
Further details are available in the Cellar Stats Comparison and KGC Presentation.
Data Type | Description | Access |
---|---|---|
Public Legal Data | Querying for specific legal acts, their status, entry-into-force dates, and inter-relations. | SPARQL Endpoint |
Bibliographic Data | Retrieving EU publications and documents based on the WEMI (Work, Expression, Manifestation, Item) model. | SPARQL Endpoint |
Controlled Vocabularies | Accessing and navigating concepts from NALs like EuroVoc, corporate bodies, and subject matter directories. | SPARQL Endpoint |
Entity Data | Retrieving structured information on individuals and organizations from sources like the EU Whoiswho. | SPARQL Endpoint |
Instance | Reasoning |
---|---|
Production | To handle the high volume of public queries and data serving requirements for the live Cellar service. Ensures high availability and performance. |
Non-Production | For development, testing, data loading, and quality assurance without impacting the production environment. |
Cello is an OPAL-based prototype that demonstrates an AI Agent/Assistant developed in natural language using markdown, with loose coupling between a selected LLM and the Cellar SPARQL endpoint.
In this demo, we allow the OpenLink AI Layer (OPAL) to generate its own queries.