Skip to content

Commit

Permalink
Browse files Browse the repository at this point in the history
  • Loading branch information
josephholler committed Aug 12, 2024
2 parents 066e190 + c37b499 commit ec28d0b
Show file tree
Hide file tree
Showing 7 changed files with 30 additions and 61 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,3 @@
# Workshop-SDSS-2023

Website: https://hegsrr.github.io/Workshop-SDSS-2023/
Website: https://hegsrr.github.io/Workshop-SDSS-2024/
9 changes: 6 additions & 3 deletions docs/_config.yml
Original file line number Diff line number Diff line change
@@ -1,10 +1,13 @@
title: Reproducing and Replicating Spatial Data Science
author: "Joseph Holler"
title: How to start teaching replicability in spatial data science courses
author:
- "Peter Kedron"
- "Joseph Holler"
- "Andrew Trgovac"

# The `>` after `description:` means to ignore line-breaks until next key.
# If you want to omit the line-break after the end of text, use `>-` instead.
description: >
SDSS 2023 Workshop on Reproducing and Replicating Spatial Data Science: a Tutorial with Breakout Discussion
SDSS 2024 Tutorial workshop on teaching replicability in spatial data science courses
# Build settings

Expand Down
45 changes: 11 additions & 34 deletions docs/about.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,39 +3,16 @@ layout: page
title: About
---

Scientific research is increasingly expected to be reproducible as a matter of transparency and public trust in research, such that other researchers can use the same data and methods to produce the same results.
Reproducibility and replicability are also integral to the mechanisms of self-correction and theory development in science.
Reproduction studies are needed to evaluate the internal validity of prior research findings.
Replication studies with different data and in different geographic contexts are needed to assess the external validity and generalizability of prior research claims.
Researchers are increasingly motivated to adopt reproducible research practices to meet expectations of publishers and funding agencies and to expand the broader impacts of their work.
Researchers and students are motivated to reproduce and replicate prior studies in order to learn from their methods, assess their validity, and extend from or build upon their prior work.
There are concurrent and interdependent needs to develop:
A consensus is emerging across the sciences that the reproducibility of research must be improved and that more independent reproductions and replications are needed to assess the state of existing knowledge.
A similar view is emerging among GIScientists and spatial data scientists. Multiple authors have called for improvements in the transparency and accessibility of procedures and data, the adoption of open science practices, and the execution of reproduction and replication studies. However, a parallel consensus has yet to develop about how to teach reproducibility in spatial data science.
Educators have no competency model to use to structure their curricula, no set of best practices to inform their class exercises, and few model courses to emulate at their own institutions.

1. infrastructure to facilitate reproducibility and open science,
2. exemplar cases of reproduction and replication studies in spatial data science, and
3. a reproducibility and replicability curriculum.
In this workshop, we introduce a series of materials designed to help educators introduce reproducibility into their spatial data science courses and institutional curricula.
At the course scale, we discuss how to present reproduction and replication as dynamic processes of critical evaluation and scientific knowledge creation.
We conceptually link reproduction attempts with the problem-based learning model, and provide an actionable framework educators can use to integrate R&R into a semester course.
We also present how this framework can be scaled to fit different course levels and instructional modalities.
Drawing on our experience attempting reproductions and replications and using R&R projects in the classroom, we discuss common challenges faced when implementing this pedagogical approach and the path to successfully publishing such studies.

We will present working prototypes of infrastructure, exemplar cases, and curriculum
developed over the first two years of the National Science Foundation award, *Transforming theory-building and STEM education through reproductions and replications in the geographical sciences*.
Following presentations, we will form breakout groups to discuss applications to individual research programs and future steps for scaling up reproducible research practices in spatial data science.

We will present *infrastructure* in the form a Git repository template for reproducible research compendia and handbook.
The template research compendium and handbook help guide the research process and organize research materials while maximizing reproducibility, and may be applied to both individual and collaborative research; and to original studies or reproduction/replication studies.
The compendium includes space for project-level metadata and organizing documentation, intellectual property license, and preferred citation.
The directory structure organizes space for protocols and code, proprietary and public data, metadata, results, documents, and manuscripts.
We include templates for pre-analysis plans and post-analysis reports and guidance for registrations and integration with the Open Science Foundation (OSF).
Finally, we include sample Rmarkdown and Python Jupyter Notebook files with useful code and structure for maximizing reproducibility.
Our tutorial introduction to the *infrastructure* will be sequenced and paired with our *approach* to conducting and publishing reproduction and replication studies.
This infrastructure has been developed and refined over the course of completing seven reproduction or replication studies with undergraduate students in three courses, graduate students in two courses, and numerous graduate and undergraduate research assistants and independent researchers.

We will present our *infrastructure* using an *exemplar case* of a spatial data science study we have reproduced with our students.
The study models best practices for open and reproducible science while highlighting the contributions and advantages of reproduction studies.
The case also highlights approaches and advantages to using reproduction studies as an integral component of spatial data science *curriculum*.
The majority of the reproduction study has been implemented, written, and presented by students.
The infrastructure (template and handbook), exemplar case, and course curriculum are all being made available to the public with open access licensing so that tutorial participants can review and reuse the materials in their own scholarship.

We will conclude the tutorial with *breakout groups* to discuss:

1. sharing questions or concerns about our approach to reproducibility and replicability (R&R),
2. developing action plans for adoption in current research and teaching programs, and
3. collectively discussing next steps to scale up R&R in spatial data science.
At the curricular scale, we introduce a competency rubric for reproducibility that outlines essential skills students should acquire and levels of achievement to measure success.
We also highlight a recently developed evaluation tool to help educators assess their existing materials.
Finally, we highlight some of our own ongoing projects and invite those interested to join our wider effort to enhance the reproducibility of spatial data science.
4 changes: 0 additions & 4 deletions docs/evaluation.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,3 @@ The timing of the surveys will be:
We will share survey results here.

You may read the [informed consent document](assets/IRB_Consent.pdf) for the survey instrument.

At the beginning of the workshop, 54 participants reported their engagement with eleven open and reproducible research practices. Results are shown below.

![open and reproducible research practices prior to workshop](assets/orrp_status_prior.png)
17 changes: 6 additions & 11 deletions docs/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,20 +5,15 @@ title: Attend

# Fourth Spatial Data Science Symposium

Website: [sdss2023.spatial-data-science.net](http://sdss2023.spatial-data-science.net/)
Date: September 5, 2023
Time: 11:30 am to 12:30 am EST
Location: Virtual
Website: [sdss2024.spatial-data-science.net](http://sdss2024.spatial-data-science.net/)
Date: October 23-24, 2024
Time: t.b.d.
Location: Virtual

## Expected Participation

We anticipate that this tutorial will be interesting and valuable for the following groups:

- Graduate students interested in learning methods from the literature and either publishing reproduction or replication reports or extending prior research in their own master's theses or PhD dissertations.
- Faculty and career researchers interested in adopting more open and reproducible research practices for their own original work, or simply crafting more competitive data management plans and working more efficiently and accurately.
- Professors interested in teaching reproduction or replication studies in their courses or as part of their advising and mentoring
- Graduate advisors and research mentors interested in training their research assistants / advisees on more reproducible practices
- Journal editors interested in publishing reproduction or replication studies, or more thoroughly incorporating reproducibility into their review of original research
Primarily, the tutorial is intended for **educators** interested in incorporating lessons on reproducibility and replicability into their courses, or attempting to reproduce published work with their students and scholars with administrative responsibilities for **curriculum development** within their department, or across their institution.
Secondarily, the tutorial will be useful for **students** and **researchers** interested in improving the reproducibility of their own spatial data science research.

## Acknowledgement

Expand Down
2 changes: 1 addition & 1 deletion docs/resources.md
Original file line number Diff line number Diff line change
Expand Up @@ -14,7 +14,7 @@ title: Resources

## Video

<iframe width="560" height="315" src="https://www.youtube.com/embed/SMfEq0jKf6U?si=tyBsYOPvflC9Zejc" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" allowfullscreen></iframe>
to be posted...

## Related research products

Expand Down
12 changes: 5 additions & 7 deletions docs/team.md
Original file line number Diff line number Diff line change
Expand Up @@ -3,10 +3,8 @@ layout: page
title: Team
---

| Name | GitHub | Email | College | ORCID |
| :--: | :--: | :--: | :--: | :--: |
| Joseph Holler\* | [josephholler](https://josephholler.github.io) | [email protected] | Middlebury College | <a href="https://orcid.org/0000-0002-2381-2699">0000-0002-2381-2699 <img alt="ORCID logo" src="https://info.orcid.org/wp-content/uploads/2019/11/orcid_16x16.png" width="16" height="16" /></a> |
| Peter Kedron | [Peter-Kedron](https://peterkedron.com/) | | University of California, Santa Barbara | <a href="https://orcid.org/0000-0002-1093-3416">0000-0002-1093-3416 <img alt="ORCID logo" src="https://info.orcid.org/wp-content/uploads/2019/11/orcid_16x16.png" width="16" height="16" /></a> |
| Sarah Bardin | [SarahBardin](https://github.com/SarahBardin) | [email protected] | Arizona State University | <a href="https://orcid.org/0000-0001-8657-1725">0000-0001-8657-1725 <img alt="ORCID logo" src="https://info.orcid.org/wp-content/uploads/2019/11/orcid_16x16.png" width="16" height="16" /></a> |

\* Workshop organizer
| Name | Email | College | ORCID |
| :--: | :--: | :--: | :--: |
| [Peter Kedron](https://peterkedron.com/) | [email protected] | University of California, Santa Barbara | <a href="https://orcid.org/0000-0002-1093-3416">0000-0002-1093-3416 <img alt="ORCID logo" src="https://info.orcid.org/wp-content/uploads/2019/11/orcid_16x16.png" width="16" height="16" /></a> |
| [Joseph Holler](https://josephholler.github.io) | [email protected] | Middlebury College | <a href="https://orcid.org/0000-0002-2381-2699">0000-0002-2381-2699 <img alt="ORCID logo" src="https://info.orcid.org/wp-content/uploads/2019/11/orcid_16x16.png" width="16" height="16" /></a> |
| [Andrew Trgovac](https://www.andrewtrgovac.com/) | [email protected] | Arizona State University | <a href="https://orcid.org/0000-0002-1390-9868">0000-0002-1390-9868 <img alt="ORCID logo" src="https://info.orcid.org/wp-content/uploads/2019/11/orcid_16x16.png" width="16" height="16" /></a> |

0 comments on commit ec28d0b

Please sign in to comment.