Skip to content

Commit

Permalink
Update Neurips2021 dataset loader (#447)
Browse files Browse the repository at this point in the history
* Add validation batches to train set

* Add comment
  • Loading branch information
KaiWaldrant authored Apr 29, 2024
1 parent 6db0442 commit b6b85a3
Showing 1 changed file with 2 additions and 1 deletion.
3 changes: 2 additions & 1 deletion src/datasets/loaders/openproblems_neurips2021_bmmc/script.py
Original file line number Diff line number Diff line change
Expand Up @@ -48,7 +48,8 @@ def convert_matrix(adata):
if "is_train" not in adata.obs.columns:
batch_info = adata.obs["batch"]
batch_categories = batch_info.dtype.categories
train = ["s1d1", "s2d1", "s2d4", "s3d6", "s3d1"]
# From https://github.com/openproblems-bio/neurips2021_multimodal_viash/blob/75281c039ab98b459edbf52058a18597e710ed4d/src/common/datasets/process_inhouse_datasets/script.R#L14-L17
train = ["s1d1", "s1d2", "s2d1", "s2d4", "s3d1", "s3d6", "s3d7"]
adata.obs["is_train"] = [ "train" if x in train else "test" for x in batch_info ]

# Construct Modality datasets
Expand Down

0 comments on commit b6b85a3

Please sign in to comment.