Why scientific data essays - Complex Stories

Open Source Programs Offices (OSPOs) help organizations adopt open source solutions to digital challenges. They grew out of technology companies' growing realization that dedicated teams of open source enthuasiasts could effectively guide their developers through open source software development—from licensing to design patterns and code reusability.

At the Vermont Research Open-Source Program Office (VERSO), we help research groups adopt good practices in research software development. As an OSPO, we share the belief that academia can benefit from having a small team of open source enthuasists that can effectively guide research groups in navigating open source software development.

We recognize that research groups operate under different constraints and may hold different ideas about the value of good coding practices than industry does. After all, research groups are not tech companies. That said, we believe that academia can learn from how modern organizations handle the challenges of hyperscale technologies, while recognizing when industry practices do not always apply to research.

This blog is intended to demonstrate how and when adopting best practices in research software development creates value. We will showcase research projects we're involved in through scientific data essays, then use blog posts to explain which open source practices proved useful and why.

A first case study: data orchestration

Consider the practice of writing data orchestration to handle messy, collaborative research projects. This approach to the data lifecycle exemplifies what we mean by "good practices"—an alternative to the common research habit of hacking together duct-tape software solutions that "just work".