Data and Code for "The paradox of competition: How funding models could undermine the uptake of data sharing practices"

This repository contains simulation code, simulation output and analysis code and output for the above preprint.

The model was coded in NetLogo, and is available from the file data_sharing_policies.nlogo.

Code for the generation of the networks is available in the self-contained notebook network_generation/00-network-generation.qmd.

The analysis pipeline consisted of multiple steps:

Run the model using the scripts in batch_commands.
Pre-process the output files from the simulation to prepare them for analysis in Spark (files 01-move-to-parquet.R and 02-pivot-columns.R in pre-process).
Analyse various parts of the model with the analysis notebooks in analysis.

Due to size constraints we share the outputs from steps (1) and (3). The intermediate files from step (2) are larger than the files from step (1), and contain the same content.

A short note on naming conventions. In the paper, we speak of four types of networks, but the names are slightly different than in the code. This is just because the naming became more precise over the course of the analysis. The mapping between the output files and the reported networks is as follows:

"vary_incentives.csv.bz2" = No network.
"vary_incentives_individuals_clustered.csv.bz2" = high clustering.
"vary_incentives_individuals_fragmented.csv.bz2" = low clustering.
"vary_incentives_individuals_random network.csv.bz2" = random network.

Sensitivity analysis

The repo also contains outputs and analysis notebooks for the sensitivity analysis. The analysis was done in Spark due to the large file sizes. We share three "packages" of data that we used for the sensitivity analysis:

gain-sensitivity-data.tar.bz2
sigma-sensitivity-data.tar.bz2
costs-sensitivity-data.tar.bz2

These archives contain two files each: a general file for the sensitivity analysis, and one with individual-level data. Both are stored as .parquet files.

They are already processed (similar to step 2 above), so can be readily analysed using the files 10-Figure-1-sensitivity.qmd, 10-Figure-2-sensitivity.qmd, and 10-Figure-3-sensitivity.qmd which are available under analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 365 Commits
R		R
analysis		analysis
batch_commands		batch_commands
debugging		debugging
documentation		documentation
network_generation		network_generation
pre_process		pre_process
.Rprofile		.Rprofile
.gitignore		.gitignore
README.md		README.md
data_sharing_policies.Rproj		data_sharing_policies.Rproj
data_sharing_policies.nlogo		data_sharing_policies.nlogo
renv.lock		renv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data and Code for "The paradox of competition: How funding models could undermine the uptake of data sharing practices"

Sensitivity analysis

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Data and Code for "The paradox of competition: How funding models could undermine the uptake of data sharing practices"

Sensitivity analysis

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages