Masked autoencoder pre-training for virtual staining models #67

ziw-liu · 2024-02-08T23:56:26Z

No description provided.

new black version has different rules

edyoshikun

This branch has worked well for both 2.2D and 3D LUNeXT training/test so I am happy with it.

Combined and Concatenated data loader work
Augmentations show no issue

mattersoflight · 2024-04-08T16:14:50Z

@ziw-liu Looks like this is now the de facto branch for both training recipes: end-to-end training and pre-training + fine-tuning. Can you confirm if that is the case? Please merge in main if the merge won't break the current experiments.

Combined and concatenated data loaders are also valuable for infection phenotyping work. cc: @Soorya19Pradeep

ziw-liu · 2024-04-08T16:21:06Z

Looks like this is now the de facto branch for both training recipes: end-to-end training and pre-training + fine-tuning. Can you confirm if that is the case? Please merge in main if the merge won't break the current experiments.

This is the case. However note that I didn't do comprehensive backwards compatibility testing so it could break previously trained models.

mattersoflight · 2024-04-08T16:29:21Z

it could break previously trained models.

👍🏼 Since we are tracking the key hyper-parameters with configs, we can retrain models we need.

* refactor data loading into its own module * update type annotations * move the logging module out * move old logging into utils * rename tests to match module name * bump torch * draft fcmae encoder * add stem to the encoder * wip: masked stem layernorm * wip: patchify masked features for linear * use mlp from timm * hack: POC training script for FCMAE * fix mask for fitting * remove training script * default architecture * fine-tuning options * fix cli for finetuning * draft combined data module * fix import * manual validation loss reduction * update linting new black version has different rules * update development guide * update type hints * bump iohub * draft ctmc v1 dataset * update tests * move test_data * remove path conversion * configurable normalizations (#68) * inital commit adding the normalization. * adding dataset_statistics to each fov to facilitate the configurable augmentations * fix indentation * ruff * test preprocessing * remove redundant field * cleanup --------- Co-authored-by: Ziwen Liu <[email protected]> * fix ctmc dataloading * add example ctmc v1 loading script * changing the normalization and augmentations default from None to empty list. * invert intensity transform * concatenated data module * subsample videos * livecell dataset * all sample fields are optional * fix multi-dataloader validation * lint * fixing preprocessing for varying array shapes (i.e aics dataset) * update loading scripts * fix CombineMode * compose normalizations for predict and test stages * black * fix normalization in example config * fix collate when multi-sample transform is not used * ddp caching fixes * fix caching when using combined loader * move log values to GPU before syncing Lightning-AI/pytorch-lightning#18803 * removing normalize_source from configs. * typing fixes * fix test data path * fix test dataset * add docstring for ConcatDataModule * format --------- Co-authored-by: Eduardo Hirata-Miyasaki <[email protected]>

ziw-liu added 29 commits January 10, 2024 15:19

refactor data loading into its own module

c6692f1

update type annotations

3d8e7e2

move the logging module out

fdcbf55

move old logging into utils

a291381

rename tests to match module name

3cf8fa2

bump torch

d4cd41d

draft fcmae encoder

e87d396

add stem to the encoder

dccce5f

wip: masked stem layernorm

5508731

wip: patchify masked features for linear

3eec48e

use mlp from timm

8c54feb

hack: POC training script for FCMAE

83ecf4a

fix mask for fitting

2fffc99

remove training script

2a598b2

default architecture

b9b1880

fine-tuning options

fd7700d

fix cli for finetuning

054249f

draft combined data module

d867e10

fix import

b06a300

manual validation loss reduction

39eafab

update linting

9fbf7a5

new black version has different rules

update development guide

e00f5f3

update type hints

9e345b6

bump iohub

96deca5

draft ctmc v1 dataset

e06aa57

Merge branch 'main' into fcmae

ea8b300

update tests

72de113

move test_data

13d0aa0

remove path conversion

78aed97

edyoshikun self-requested a review February 24, 2024 19:04

ziw-liu and others added 17 commits February 29, 2024 11:06

all sample fields are optional

43d641d

fix multi-dataloader validation

42f81cf

lint

4546fc7

fixing preprocessing for varying array shapes (i.e aics dataset)

306f3ef

update loading scripts

1a0e3ce

fix CombineMode

d3ec94d

compose normalizations for predict and test stages

d3db2bb

black

a549d4e

fix normalization in example config

a38da8b

fix collate when multi-sample transform is not used

d9a471d

ddp caching fixes

669ee83

fix caching when using combined loader

b2e23b8

move log values to GPU before syncing

e247661

Lightning-AI/pytorch-lightning#18803

removing normalize_source from configs.

b14bb60

typing fixes

baf3e47

fix test data path

debd45f

fix test dataset

bf075ce

ziw-liu added the enhancement New feature or request label Mar 28, 2024

ziw-liu marked this pull request as ready for review March 28, 2024 23:13

edyoshikun approved these changes Apr 2, 2024

View reviewed changes

ziw-liu added 2 commits April 4, 2024 16:54

add docstring for ConcatDataModule

bfee907

format

3df5623

ziw-liu merged commit 0536d29 into main Apr 8, 2024
3 checks passed

ziw-liu deleted the fcmae branch April 8, 2024 16:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Masked autoencoder pre-training for virtual staining models #67

Masked autoencoder pre-training for virtual staining models #67

ziw-liu commented Feb 8, 2024

edyoshikun left a comment

mattersoflight commented Apr 8, 2024 •

edited

Loading

ziw-liu commented Apr 8, 2024

mattersoflight commented Apr 8, 2024 •

edited

Loading

Masked autoencoder pre-training for virtual staining models #67

Masked autoencoder pre-training for virtual staining models #67

Conversation

ziw-liu commented Feb 8, 2024

edyoshikun left a comment

Choose a reason for hiding this comment

mattersoflight commented Apr 8, 2024 • edited Loading

ziw-liu commented Apr 8, 2024

mattersoflight commented Apr 8, 2024 • edited Loading

mattersoflight commented Apr 8, 2024 •

edited

Loading

mattersoflight commented Apr 8, 2024 •

edited

Loading