We are a Netherlands-based start-up company that brings the benefits of big data to small organizations. In order to help small organizations remain competitive in the era of big data, we provide access to trustworthy AI, validation of third-party data, and strategies to combat the potentially negative effects of biased corporate sales and marketing algorithms. Our diverse team is particularly experienced in music, the creative industries, and digital humanities.
Our flagship demo projects are the Listen Local ethical music recommendation system based on our Demo Music Observatory data integration and knowledge sharing platform. We have validated our product/market fit in the prestigious Yes!Delft AI+Blockchain Lab
See our services: data curation, open data access, survey harmonization, reproducible research and validated trustworthy AI applications.
Download our introduction.
Follow news about us or the more comprehensive Data & Lyrics blog.
Contact us .
For more posts, visit our blog Data&Lyrics
Our demo projects
Our peer-reviewed, open source statistical software packages
We believe that transparency is the key to the highest data quality. We use only open source software. We open up the critical elements of our software for peer-review.
We use open-source software, there is no vendor lock-in.
Our data products go through many, automated (unit) tests, replacing countless error-prone human validation working hours.
The critical elements of our code go through external validation and peer-review by computational statisticians and data scientists.
The goal of retroharmonize is to facilitate retrospective (ex-post) harmonization of data, particularly survey data, in a reproducible manner.
Our paper argues that fair competition in music streaming is restricted by the nature of the remuneration arrangements between creators and the streaming platforms, the role of playlists, and the strong negotiating power of the major labels. It concludes that urgent consideration should be given to a user-centric payment system, as well as greater transparency of the factors underpinning playlist creation and of negotiated agreements.
While the US have already taken steps to provide an integrated data space for music as of 1 January 2021, the EU is facing major obstacles not only in the field of music but also in other creative industry sectors. Weighing costs and benefits, there can be little doubt that new data improvement initiatives and sufficient investment in a better copyright data infrastructure should play a central role in EU copyright policy. A trade-off between data harmonisation and interoperability on the one hand, and transparency and accountability of content recommender systems on the other, could pave the way for successful new initiatives.
The topic of the paper is Library Genesis (LG), the biggest piratical scholarly library on the internet, which provides copyright infringing access to more than 2.5 million scientific monographs, edited volumes, and textbooks. The paper uses advanced statistical methods to explain why researchers around the globe use copyright infringing knowledge resources. The analysis is based on a huge usage dataset from LG, as well as data from the World Bank, Eurostat, and Eurobarometer, to identify the role of macroeconomic factors, such as R&D and higher education spending, GDP, researcher density in scholarly copyright infringing activities.
Publications featuring our datasets and technology
Our peer-reviewed, open source statistical software packages