We are a Netherlands-based start-up company that makes big data reliable and accountable, delivering trustworthy analytics and AI solutions. We validate multiple data sources and are able to merge private and proprietary data with open data. We bring novel insight into policy and business problems, as well as scientific research. Our work addresses the potentially negative effects of black-box proprietary algorithms. Our diverse team is particularly experienced in music, the creative industries, and digital humanities, where data is scattered in small organizations.

The Data Sisyphus
Poor metadata management causes much repeated tasks, errors, non-billable hours and uncredited work.
Open Data
Open data cannot be just ‘downloaded’. It is not ready-to-use, and often not even public.
Trustworthy AI
What can go wrong with the algorithm? Finding unwanted outcomes and correcting them in complex systems.
Research Automation
Repeaded data processing and validation steps are best made, documented, logged by computers.
Data
Curation
Data sits everywhere and it is not easy to find even at home. Our curators know where to dig.
Professional Data Processing
Uncut diamonds need to be polished. Data is only potential information, raw and unprocessed.
Metadata: Documentation & Codebooks
Adding FAIR metadata exponentially increases the value of data. We use DataCite and SDMX statistical coding.
Data-as-Service

Reusable, easy-to-import, interoperable, always fresh data in tidy formats with a modern API.

Our flagship demo projects are the Listen Local ethical music recommendation system based on our Demo Music Observatory data integration and knowledge sharing platform. We have validated our product/market fit in the prestigious Yes!Delft AI+Blockchain Lab. We are members of the Dutch AI Coalition and participate in the work of the European AI Alliance.

See our services: data curation, open data access, survey harmonization, reproducible research and validated trustworthy AI applications.

Download our introduction.

Follow news about us or the more comprehensive Data & Lyrics blog.

Contact us.

Services

*

Automated Data Observatories

From open data and open-source statistical software to data-as-service.

Digital
Music Observatory
Our first observatory, with seven years of data sharing history, a model for the European Music Observatory.
Competition Data Observatory
Our youngest, early-stage prototype observatory for computation antitrust.
Green Deal Data Observatory
An ambitious project to connect environmental sensory data, political and policy survey data with socio-economic indicators.
Economy Data Observatory
An incubator for socio-economic data observatories. Its first offspring is the Competition Data Observatory.
*
Green Deal Data Observatory
An ambitious project to connect environmental sensory data, political and policy survey data with socio-economic indicators.
Green Deal Data Observatory
Competition Data Observatory
Our observatory is monitoring the certain segments of the European economy, and develops tools for computational antitrust in Europe.
Competition Data Observatory
Economy Data Observatory
An incubator for socio-economic data observatories. Its first offspring is the Competition Data Observatory.
Economy Data Observatory
Digital Music Observatory
The Digital Music Observatory is a fully automated, open source, open data observatory that links public datasets in order to provide a comprehensive view of the European music industry.
Digital Music Observatory

Team

Founders

Avatar

Andrés García Molina

Data Scientist & Ethnomusicologist

Avatar

Daniel Antal

Co-founder

Avatar

Istvan Simon

Reproducible Business Workflows

Avatar

Reka Szentirmay

Co-Founder

Team

Avatar

Emily Hansell Clark

Digital Humanities

Avatar

Kátya Nagy

Music Research Assistant

Avatar

Stef Koenis

Data Scientist & Ethnomusicologist

Recent News

For more posts, visit our blog Data&Lyrics

Projects

*

Accomplish­ments

AI & Blockchain Validation Lab
Our Demo Music Observatory & Listen Local projects are developed in the music accelerator of Music Moves Europe
Dutch AI Coalition
Joined the Dutch AI PPP
AI & Blockchain Validation Lab
Formulated informed blockchain models, hypotheses, and use cases.
Reprex is founded, replacing the CEEMID project (2014-2020)

Open-Source Software

Our peer-reviewed, open source statistical software packages

We believe that transparency is the key to the highest data quality. We use only open source software. We open up the critical elements of our software for peer-review.

  • We use open-source software, there is no vendor lock-in.

  • Our data products go through many, automated (unit) tests, replacing countless error-prone human validation working hours.

  • The critical elements of our code go through external validation and peer-review by computational statisticians and data scientists.

Quickly discover relevant content by filtering our software releases..

Team

Founders

Avatar

Andrés García Molina

Data Scientist & Ethnomusicologist

Avatar

Daniel Antal

Co-founder

Avatar

Istvan Simon

Reproducible Business Workflows

Avatar

Reka Szentirmay

Co-Founder

Team

Avatar

Emily Hansell Clark

Digital Humanities

Avatar

Kátya Nagy

Music Research Assistant

Avatar

Stef Koenis

Data Scientist & Ethnomusicologist

Recent Publications

Publications featuring our datasets and technology

Quickly discover relevant content by filtering publications.

Open-Source Software

Our peer-reviewed, open source statistical software packages

Quickly discover relevant content by filtering our software releases..

Contact