Skip to content
This repository has been archived by the owner on Oct 25, 2018. It is now read-only.

metaissue: create exercises for all sections #6

Open
14 tasks
satra opened this issue May 20, 2018 · 1 comment
Open
14 tasks

metaissue: create exercises for all sections #6

satra opened this issue May 20, 2018 · 1 comment
Assignees

Comments

@satra
Copy link
Contributor

satra commented May 20, 2018

  1. FAIR Data - BIDS datasets
    talk 1: Intro to FAIR
  • Intro: 16 attributes of FAIR - e.g. Is there a clear license, what is a PID, What is meant by metadata, … link attributes for 2 modules below. No real exercise here to save time
    talk 2: Standardization and BIDS
  • exercise: dicom to BIDS conversion exercise: basic conversion (point towards ReproIn in next section)- but manual at this point
    talk 3: FAIR Metadata: searching and using Scicrunch
  • exercise: BIDS metadata - participants.tsv and semantic annotation. Manual lookup and pyNIDM
    talk 4: Brief Intro to NIDM
  • exercise: NIDM conversion tool to create sidecar file. BiDS2NIDM and Sparql query
  1. Computational basis
    talk 1: Shell: Getting around the “black box”
  • Exercise: Basic manipulations of command line history - turning your shell into your “notebook”
    talk 2: (Neuro)Debian/Git/GitAnnex/DataLad: Distributions and Version Control
  • Exercise: Installation of sample Debian and DataLad packages, and introspection of their provenance
    talk 3: ReproEnv: Virtual machines/Containers, Neurodocker
  • Exercise: Run containers - Create different environments
  1. Neuroimaging Workflows
    talk 1: ReproIn: BIDS datasets straight from the MR scanner
  • Exercise: Use of HeuDiConv/ReproIn and DataLad for basic neuroimaging study with complete and unambiguous provenance tracking of all actions
    talk 2: ReproFlow: Reusable scripts and environments, PROV
  • Exercise: Run, rinse, and repeat
    talk 3: ReproTest: Variability sources (analysis models, operating systems, software versions)
  • Exercise: Run analysis with different environments/different datasets
  1. Statistics for reproducibility
    Assumes we have a csv file with say 100 subjects and columns like: “age, sex, pheno1, pheno2… “
    talk 1: evil p-value : what they are - and are not
  • Exercise: test with
    talk 2:
  • Exercise:
    talk 3:
  • Exercise:

16:00-16:30 Conclusion & Getting Feedback
Nina Preuss, Preuss Enterprises, United States

  • Surveys
@satra
Copy link
Contributor Author

satra commented May 21, 2018

in general, perhaps 10 min talks + 15 min hands on exercises, which should leave a bit of slop factor and time for questions in each section.

yarikoptic pushed a commit that referenced this issue Jun 17, 2018
Correct copy pasted solution for datalad-run + minor typo
@yarikoptic yarikoptic removed their assignment Dec 19, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants