Presenters-0063: Difference between revisions

From CSDMS
(Created page with "{{Presenters temp |CSDMS meeting event title=CSDMS 2017 annual meeting: Modeling Coupled Earth and Human Systems - The Dynamic Duo |CSDMS meeting event year=2017 |CSDMS meetin...")
 
No edit summary
 
Line 15: Line 15:
|CSDMS meeting youtube code=0
|CSDMS meeting youtube code=0
|CSDMS meeting participants=0
|CSDMS meeting participants=0
}}
{{Presenters keywords temp
|Presentation keywords=CoMSES Net
}}
{{Presenters keywords temp
|Presentation keywords=Good computing practices
}}
}}
{{Presenters additional material
{{Presenters additional material
|Working group member=Cyberinformatics and Numerics Working Group, Human Dimensions Focus Research Group
|Working group member=Cyberinformatics and Numerics Working Group, Human Dimensions Focus Research Group
}}
}}

Latest revision as of 16:12, 6 August 2018

CSDMS 2017 annual meeting: Modeling Coupled Earth and Human Systems - The Dynamic Duo


Good enough practices for reproducible scientific computation



Allen Lee

Arizona State University, United States
allen.lee@asu.edu


Abstract
[[CSDMS meeting abstract presentation::How difficult would it be to create a transparent, fully reproducible codebase that can be downloaded from a trusted digital repository, compiled, and then run with minimal effort? How about a codebase that can be understood, reparameterized with coherent alternative assumptions, and analyzed by other researchers or future graduate students? As of this writing, it appears to be quite hard just to locate the code necessary to build / run an executable [1] [2] and this is a problem that only gets worse over time as software and system dependencies evolve. Luckily there are many Good Enough [3] practices that can be added incrementally to your development workflow to help others understand, properly review, and build on the software artifacts that increasing numbers of research findings depend upon.
This clinic will cover practical tips, workflows, and tools to help you create reproducible [4] and citable [5] computational pipelines while avoiding common pitfalls and potential issues. We will go over good practices for version control, documentation, data and metadata management, and demonstrate how CoMSES Net is using emerging technologies like Docker containerization to facilitate reproducible computational pipelines. Other topics (depending on participant interest and experience) include automated tests, continuous integration, and modular components / microservices.

[1] - http://reproducibility.cs.arizona.edu/
[2] - https://cbie.asu.edu/practice-archiving-model-code-agent-based-models
[3] - https://swcarpentry.github.io/good-enough-practices-in-scientific-computing
[4] - https://www.practicereproducibleresearch.org
[5] - https://www.force11.org/group/software-citation-working-group
]]



Please acknowledge the original contributors when you are using this material. If there are any copyright issues, please let us know (CSDMSweb@colorado.edu) and we will respond as soon as possible.

Of interest for:
  • Cyberinformatics and Numerics Working Group
  • Human Dimensions Focus Research Group