January was a busy month - here's why!

Remember 2019? This was our first year of piloting our collaborative data curation service in the Data Curation Network. Our 10 partner institutions submitted 74 datasets from their overall deposits that year (see below) to be matched with a data curator with domain and software expertise. And 95% of these datasets were successfully matched to one of our DCN curators for expert review for quality and FAIRness. At our busiest, the DCN handled eleven datasets in one month (September) while the rest of the year we averaged 6 datasets per month (see below).

We are very proud of how smoothly the network ran in its first pilot year, and how many researchers were impacted (249!), but we wanted to test our capacity with more datasets. So rather than each partner institution choosing a subset of datasets to send to the network for curation, we tried something different in January 2020, that we called “January Jam”.

January Jam!

For January 2020 – typically a busy month for dataset submissions due to the winter break – we asked each of our ten partners to submit every dataset they received to the DCN for curation matchmaking. Since Dryad has a much higher volume than the rest of the repositories, they choose to submit only 1 dataset per work day. Partners could still decide to curate the dataset locally (e.g., a repeat submitter, a tight deadline, etc.), but this approach allowed us to get a better picture of all the overall demand and curation effort happening across the network.

This experiment went really well! In Jan 2020, 44 datasets passed through the DCN – more than half of what we saw in 2019! While 8 of these datasets were curated locally, the other 36 (~82%) were successfully matched to a DCN curator at another institution.

Datasets submitted during the January Jam event show a more representative sample of the domains and data types at each of our DCN institutions (see figures below).

DCN curators typically commit 5% FTE time to the DCN project. In January our curators logged 74.9 curation hours (43% of their commitment). This would seem to indicate that we haven’t reached our full capacity yet, however, our curation capacity for any discipline, data type or file format will probably not match up perfectly to the datasets we receive in any given month. We also must consider the availability of our curators (e.g., vacation, existing workload) to complete a curation assignment by the deadline. This makes calculating our maximum operating capacity very tricky!

Going Forward?

Things went well in January – tracking the full picture of data curated across the network was a valuable addition to our implementation pilot. Therefore, we decided to continue this experiment into February, and see how it goes from there!

See more details about individual datasets on our website!

Beyond compliance: Curation as essential open science infrastructure

November 20, 2023December 8, 2023

The following was adapted from a presentation by Wind Cowles, Associate Dean for Data, Research, and Teaching at Princeton University, and Mikala Narlock, Director of the Data Curation Network based at the University of Minnesota. This was presented on October 19, 2023 during a workshop titled, “Developing New Approaches to Promote Equitable and Inclusive Implementation…

Curation Activities

Activities Involved with Data Curation

October 23, 2016December 8, 2023

The next step in the DCN project timeline involves hosting engagement activities where we ask researchers: what are the most important data curation activities for you data? Why? Our list of curation activities is now in the publications section of our website. For example, Wendy and I met with Cornell Researchers on Friday and heard that…

Curation Activities | Research

Depositor satisfaction with curation services: Preliminary results

August 24, 2021March 25, 2024

We wanted to share some results about how data depositors feel about the curation services they receive from our repository staff. In spring 2021, members of the Data Curation Network surveyed 568 researchers who had recently deposited data into one of 6 academic data repositories (see table 1). Our survey in Qualtrics asked respondents to…

Collaboration | Curation Activities | Research

Conceptualizing Curation: Curation, is Curation, is Curation

October 19, 2020December 8, 2023

Post by Sophia Lafferty-Hess, DCN Curator at Duke University. In a recent Journal of Librarianship and Scholarly Communications article, curation and repository staff from Duke and the University of North Carolina at Chapel Hill shared the outputs of a “thought exercise” to conceptualize data curation activities within our individual institutional contexts. This exercise was part…

Curation Activities

Levels of Data Curation in ARL – Summary Charts

June 6, 2017December 8, 2023

As we prepare for the webinar on the results of surveying 124 ARL institutions, we find that there is so much interesting data to share….and not enough time to discuss. For example, how do (n=51) ARL libraries support 47 data curation activities? See the results below! Raw data is attached and read the full report…

Collaboration | Curation Activities | Curation Resources | DCN News | Education

Presenting the CURATE(D) Workflow, version 2.0

April 12, 2022December 8, 2023

Over the course of many months, members of the Data Curation Network have revised the CURATE(D) workflow, and are delighted to share it with everyone! The revised workflow now includes: A preamble, that includes information about how to use the CURATE(D) steps, in particular the iterative nature of curation and prompting questions to consider whether…

January Jam!

Going Forward?

Similar Posts