Versioning / Timestamping Practices for DataSHIELD nodes

aakkoc · 15 January 2024 08:35

This is a thread where I would like to ask some best practices.

In DataSHIELD projects I am involved in, there is an interest to have version snapshots available on each node. The demand is, if we release any results as part of a study, we should be able to rewind data tables on the server back to the point in time that this result was obtained.

At this time, two approaches are apparent to us:

Setting a data cut-off date, feasible as long as we have a column indicating data collection or upload time. A repeat of the work is possible so long as no data is removed and the data cut-off date is saved. (i.e. We can use the same January 1st 2023 cut-off in 2033). There might be some workarounds needed whenever new columns are introduced to the table.
Creating a new table with a timestamp each time there is a major data revision, f.x. Project/Table_202401 for January, Project/Table_202403 for March etc. This costs further storage space and also warrants constant communication with analysts as to which table version they should target. However, unlike the 1st approach this accomodates for mistakes that may have been made in the past; so if for example an individual was mis-added in an earlier snapshot these versioned tables will always retain that information.

What exactly has been the lifecycle pattern of your dataSHIELD nodes?

Topic		Replies	Views
Experience of calculating time differences via DataSHIELD Developer support	2	452	5 May 2020
DataSHIELD v5.1 Now available Releases new-version , software-release	7	771	29 May 2020
DataSHIELD disclosure settings - migration of pages to new wiki Statistical development disclosure , datashield-wiki , wiki	3	137	12 March 2024
Save the Date - 2021 DataSHIELD Conference Old news conference	3	598	17 September 2021
Archiving process for repos in DataSHIELD github Operational management github , repository , archive	3	15	12 August 2024

Versioning / Timestamping Practices for DataSHIELD nodes

Related topics