Reprex is an impact startup based in the Netherlands specializing in making data reliable and accountable while delivering trustworthy analytics and AI solutions. Our diverse team works from several locations and countries; our ideal candidates are located in South Holland (the Hague/Rotterdam/Delft/Leiden). That said, we are open to candidates from any location. We bridge industry and academia through various advanced statistical and ethical AI verification projects: while our R&D partners are leading universities, we aim to deploy our solutions in business environments.
We welcome candidates from all backgrounds and mother tongues who are proficient in R and have a good working knowledge of the English language.
Reprex team members follow the Contributor Covenant, “pledg[ing] to make participation in our community a harassment-free experience for everyone, regardless of age, body size, visible or invisible disability, ethnicity, sex characteristics, gender identity and expression, level of experience, education, socio-economic status, nationality, personal appearance, race, religion, or sexual identity and orientation.”
Our internal communication platform is Github for software development and Keybase, an open-source alternative to Slack. You do not have to be a master of Github Action and Github products but you must be able to make pull requests, commits, and solve issues on this platform. To apply to any of the positions below, please send an email with a link to your résumé and a brief description of your interest. You can ask questions on Keybase, too.
We are looking for intermediate or advanced R users with a passion for open data and open science for the maintenance of our CRAN-released R packages and the development of further packages. We are pursuing a hybrid model, providing the R community with open-source packages, and engaging in paid work that utilizes this software in commercial or academic environments.
Our ideal candidate(s) are
a) at least intermediate-level R programmers or possess domain-specific knowledge relevant to our packages, or
b) advanced in R programming and agnostic to actual packages
c) excited to maintain and develop one or more of our packages
All of our packages follow the modernization of the R language and are built on rlang and vctrs. All the packages use the tidyverse as a dependency, which creates a consistent user interface (i.e. dplyr, tidyr, tidyselect.)
You will need a free Keybase account and free Github account for group working. If you do not have a GitHub account, you will be expected to make an account and accomplish the free practices on this link. This is as an essential skill to work on open collaboration, open science or open source projects.
Please download this file (bio_template.md) and edit it according to the instruction in the file (they are invisible here on the website) in a text editor, such as NotePad, WordPad, VIM and send it back to as an .md file or .txt file. If you are not familiar with text editors, here is a bio_template.docx or bio_template.rtf version.
In the bio template write a motivation letter: what would you like to learn and what would you like to do as your tiny project at Reprex? See a few examples at the end of this page (below our requirements) to get inspired!
You will work up to 20 hours a week: please state in your application how many hours you would like to work. We will provide some flexibility for tests, exams, holidays. You will partly work remotely, and partly based in our inclusive environment of Apollo 14, The Hague’s Impact Hub for impact startups.
Add a profile photo (at least 600px wide, not bigger than 3-4 megabytles). We will make publish this picture in black and white behind a circle mask. Make sure that the crop of your face is not too tight in a circle. To avoid different formats of portraits, they will be behind a circle mask, as on the partners page. We will delete your photo if you will not be our intern, if yes, we will place it among our contributors profiles.
Save the file as
your_name.rtf and add the small profile profile photo as
your_name.jpg to an email that you send to the attention of the Reprex team to the email address:
Text file ❓ Photo❓Yes, we want to see if you are able to work with different, non-Microsoft/non-Apple specific, interoperable files, and if you are a human who got to this point in reading 😊 . We will delete all files ♻ of those candidates who will not become interns and make it to our team️.
In the email, please write a few sentences about your motivation, availability and place some links to your work (GitHub repos or things you have done in R.)
Please visit our dataobservatory.eu github page, and check out our advice for new contributors.
iotables: an R package for reproducible input-output analysis, economic, and environmental impact assessment. The domain specific knowledge is input-output economics, multiplier analysis, and environmental impact analysis. A working knowledge of SNA or an interest in macro-finance is a plus. We develop this application within the rOpenGov community and the rOpenSci community. The application has various uses in banking, insurance, music industry, and policy design.
retroharmonize: an R package for retrospective survey harmonization and survey recycling. The domain specific knowledge is an interest in international, multi-language surveys, longitudinal surveys, and the reuse of survey data. We develop this application within the rOpenGov community and the rOpenSci community. The application has various uses in survey harmonization, data integration, and survey design.
regions: an R package for adjusting sub-national boundaries for the making of regional statistics. While the U.S. has relatively stable sub-national boundaries (the US postal codes), most nations change their internal boundaries very frequently. Currently, regions tracks these changes in Europe, but our package could and should be extended to all ISO-conforming sub-national boundaries globally. An ideal domain-specific interest is geography, cartography, and/or small-area statistics. The package is currently not developed actively, but we expect it to be developed in a small-area statistics context, or for surveying withing a regional component.
dataset creates datasets from standared R objects (data.fame, data.table, tibble, or well-structured lists like json) that are highly interoperable and can be placed into relational databases, semantic web applications, archives, repositories. They follow the FAIR principles: they are findable, accessible, interoperable and reusable. They contain the entire processing history of the dataset for reproducability.
The goal of statcodelists is to promote the reuse and exchange of statistical information and related metadata with making the internationally standardized SDMX code lists available for the R user.
We are also contributing to a range of packages relevant for music analysis, open data access and open science data access and we are planning the release of new open source and non-open-source products.
We are looking for individual(s) who can resolve issues via Github. Time commitments are flexible and compensation is commensurate with experience and skill.
We are looking for a contract-based Shiny developer who can create engaging, user-friendly multi-language Shiny interfaces to our R products. We are interested in working with candidates with experience in Shiny development and/or deployment skills, in particular, the ability to dockerize and deploy in the cloud. Currently we deploy on AWS and Netlify, but potentially we may need to deploy on other cloud servers.
Our Shiny applications have multiple users:
Music organizations and music researchers connected to our Digital Music Observatory
Sustainable finance, sustainable reporting, and climate mitigation policy experts related to our Green Deal Data Observatory
Antitrust experts, antitrust authorities, and merger analysts associated with our Competition Data Observatory
Various creative industry stakeholders related to our Cultural and Creative Sectors Industries Data Observatory, mainly related to book publishing and film production.
Some of our applications are expected to be able to communicate with various Rest APIs, e.g.: the Eurostat and Spotify Rest APIs.
Our applications must work with several language; buttons, alternate texts, and descriptions must be parameterized and available for localization. The visual elements must follow simple visual structures and a unified colour palette.
Attribution: the female avatar is designed by Andy Horvath - Flaticon.
Any computer science/statistics/social sciences/digital humanities
last year Master students, PhD students are welcome, too.