Big Data For All

We make messy, fragmented, and biased data reliable and transparent. By merging public and private sources into secure data-sharing spaces, we give AI the trustworthy foundations it needs. The result: reliable analytics and scalable AI solutions that power smarter decisions in operations, planning, research, and markets.

business card

Data Sharing Spaces

Our Data Sharing Spaces implement the European Interoperability Framework (EIF) to connect diverse datasets on an as-needed, as-permitted basis. They enable organisations to share trustworthy data securely while respecting legal and technical differences.

Systems

Future-Proofing Systems

Regenerative, AI-supported data spaces that not only store clean data but also preserve the know-how to repair, modernise, and extend the life of legacy systems.

Eviota

Connected financial and sustainability reporting

Listen Local

Listen Local is a trustworthy, ethical AI-powered system that aims to help great artists in small organizations and small countries using big data.

Accomplishments

TextileBase

Reprex

Sep 2024 – Present

TextileBase links museum, archive, and research records on garments into a multilingual knowledge graph to support dress history, cultural heritage, and sustainable fashion.

FUDSS

Reprex

Mar 2025 – Present

We built the Finno-Ugric Data Sharing Space as a federated platform that solves some of the hardest data challenges — linking scattered, multilingual, public and private collections while respecting local knowledge and legal frameworks. By stress-testing our methods in this complex cultural setting, we developed tools directly applicable to equally difficult business data problems across borders and industries.

Smart Policy Documents Monitor National Policy

Slovak Ministry of Culture

Mar 2023 – Dec 2025 Bratislava, Slovakia

Reprex’s Smart Policy Documents technology will be used to monitor the national cultural and creative industry policies of the Slovak Republic.

Finalist, Winner of Audience Prize

The Hague Innovators Challenge 2022

Sep 2022 – Nov 2022 The Hague, Netherlands

Our automated data observatory concept supported by a data-sharing space wins the audience prize.

OpenMuse

Open Music Europe Consortium

Jul 2022 – Dec 2025 Berlin, Germany

We initiated a Horizon Europe Research and Innovation action to launch our research automation tools from data collection to dissemination for scientific, business and policy partners.

Eviota

Inova+ & European Music Council

Jul 2022 – Feb 2023 Brussels, Belgium

Awarded project in the Green Recovery section of MusicAire.

Trustworthy AI Systems Hub

Competition Policy Centre, University of East Anglia & partners

Feb 2022 – Apr 2023 London, United Kingdom

With our Digital Music Observatory and Listen Local we are partners in finding potential adverse outcomes of AI-driven, autonomous music recommendation systems on market competition.

Cultural Creative Sectors Industries Data Observatory

University of Amsterdam

Oct 2021 – Present Amsterdam, Netherlands

A first replication and extension attempt of our beachhead product, the Digital Music Observatory to serve the film, fashion, book, design, gaming industries.

Listen Local Slovakia Demo App & Feasibility Study

Slovak Arts Council & SOZA

Sep 2020 – Dec 2020 Bratislava, Slovakia

Formalizing our automated data observatory product and our bridghead into the music industry

AI+Blockchain Product/Market Fit Validation

University X

Oct 2020 – Dec 2020 Delft, Zuid-Holland, Netherlands

Formalizing our automated data observatory product and our bridghead into the music industry

Central European Music Industry Report

CEEMID, state51 group

Oct 2020 – Dec 2020 Brussels, Belgium

A 12-country music data harmonization and reporting project with a a best-practice report.

Software

tuRtle

Synchronize datasets with global knowledge hubs

Jan 2024 – Present

The new, very early stage tuRtle package helps the annotation of datasets created in the R statistical environment with the Resource Description Frameworks Turtle language for linking across the Internet.

dataset

Synchronize datasets with global knowledge hubs

Jun 2022 – Present

The primary aim of dataset is create well-referenced, well-described, interoperable datasets that can be easily placed on global knowledge graphs using the W3C DataSet and RDF definition, or syncronized via APIs that follow the Statistical Data and Metadata eXchange standards.

eviota

Connect financial and environment accounts, create financial + ESG reports

Jul 2022 – Dec 2025

Our minimum viable product will create CSRD sustainability reports.

statcodelists

Make your data codes understood globally

Aug 2023 – Present

Use the standardized codebooks of the Statistical Data and Metadata eXchange for international, language-independent, machine-to-machine data exchanges.

regions

Create from raw survey data more granular statistics in any EU country.

May 2020 – Present

Malta and Germany are hard to compare. Compare provinces with provinces, regions with regions. Documentation & download

retroharmonize

Harmonize questions banks, recycle answers from past surveys

Jul 2020 – Present

Never start a questionnaire from scratch. Recycle questions from question banks, answer from open data repositories, and let your respondents add theirs. Documentation & download

iotables

Create economic or environmental impact assessments in any EU country.

Nov 2018 – Present

Tax, employment, green house gas multipliers, induced effects, policy scenarios. Documentation & download

Featured Publications

Daniel Antal, Kata Gábor, Pigozne Ieva, Bogáta Tímár

February 2026 In DHNB

Federating Open Knowledge through Wikibase: The Case of The Finno-Ugric Data Sharing Space

This paper presents the design and early implementation of the Finno-Ugric Data Sharing Space (DSS), a multilingual, community-driven prototype for linking cultural heritage data across institutional and geographic boundaries.

Daniel Antal

September 2025

Green Paper on AI, Data Governance, and Metadata Policies for Europe’s Music Ecosystem

This document situates the Open Music Observatory as a central reference point. The Observatory is a prototype of a modern European Music Observatory developed by the OpenMusE consortium, currently populated with data on economy, diversity, society, and innovation, and operating multiple federated modules.

Daniel Antal, Ieva Pigozne

September 2025 In Culture Crossroads

Linking Garments to Knowledge: TextileBase as an Interdisciplinary Graph for Dress and Textile Research

This article introduces TextileBase, a multilingual knowledge graph that connects dispersed data on garments from museums, archives, and libraries. By transforming artefact records, photographs, and texts into interoperable knowledge statements, it enables interdisciplinary research across dress history, ethnography, and sustainable fashion. The preprint demonstrates early results using Baltic and Finno-Ugric datasets and shows how TextileBase improves searchability, semantic interoperability, and reuse of cultural heritage data.

Daniel Antal, Anna Márta Mester

April 2025

Open Music Registers

Technical report on Open Music Registers, demonstrating a federated infrastructure for harmonising music-related registers into an interoperable dataspace.

Daniel Antal

December 2024 In STC

A szlovák adatkicserélési tér magyarországi föderációjának lehetőségei

The Slovak Comprehensive Music Database (SKCMDb) provides a trustworthy, interoperable register of music created in or connected to Slovakia, making national and minority repertoires visible and accessible through linked databases that bridge public institutions and private rights organisations.

Martin Senfleben, Thomas Margoni, Daniel Antal, Balazs Bodó, Stef van Gompel, Christian Handke, Martin Kretschmer, Joost Poort, João Quintais, Sebastian Felix Schwemer

February 2021 In JIPITEC

Ensuring the Visibility and Accessibility of European Creative Content on the World Market: The Need for Copyright Data Improvement in the Light of New Technologies

This influential article analyses how Europe can strengthen the visibility and accessibility of its cultural and creative works by improving copyright data infrastructures. It highlights the risks of poor metadata, the opportunities of Article 17 of the CDSM Directive, and the importance of trustworthy systems for licensing and remuneration. The music sector, where fragmented metadata leads to lost royalties and unfair competition, provides key examples. The work continues to inform our projects on trustworthy AI, data governance, and cultural data spaces.

Daniel Antal, Dáša Bulíková (translator)

December 2020 Published by SOZA.

Feasibility Study On Promoting Slovak Music In Slovakia & Abroad

Why are the total market shares of Slovak music relatively low both on the domestic and the foreign markets? How can we measure the market share of the Slovak music in the domestic and foreign markets? We offer some answers and solution based on empirical research and with the creation of a database and an AI application."

Balazs Bodó, Daniel Antal, Zoltan Puha

December 2020

Can scholarly pirate libraries bridge the knowledge access gap? An empirical study on the structural conditions of book piracy in global and European academia

The topic of the paper is Library Genesis (LG), the biggest piratical scholarly library on the internet, which provides copyright infringing access to more than 2.5 million scientific monographs, edited volumes, and textbooks. The paper uses advanced statistical methods to explain why researchers around the globe use copyright infringing knowledge resources. The analysis is based on a huge usage dataset from LG, as well as data from the World Bank, Eurostat, and Eurobarometer, to identify the role of macroeconomic factors, such as R&D and higher education spending, GDP, researcher density in scholarly copyright infringing activities.