add paths for yaml
davidackerman committed Feb 14, 2024
1 parent 9574af5 commit fed8a6d
Showing 2 changed files with 34 additions and 37 deletions.
37 changes: 14 additions & 23 deletions examples/distance_task/liver_peroxisome.ipynb
Original file line number Diff line number Diff line change
Expand Up
"metadata": {},
"source": [
"## Config Store\n",
"To define where the data goes, create a dacapo.yaml configuration file either in `~/.config/dacapo/dacapo.yaml` or in `./dacapo.yaml`. Here is a template:\n",
"To define where the data goes, create a dacapo.yaml configuration file. Here is a template:\n",
"```yaml \n",
"mongodbhost: mongodb://dbuser:dbpass@dburl:dbport/\n",
"mongodbname: dacapo\n",
"runs_base_dir: /path/to/my/data/storage\n",
"The runs_base_dir defines where your on-disk data will be stored. The mongodbhost and mongodbname define the mongodb host and database that will store your cloud data. If you want to store everything on disk, replace mongodbhost and mongodbname with a single type `files` and everything will be saved to disk:\n",
"The `runs_base_dir` defines where your on-disk data will be stored. The `mongodbhost` and `mongodbname` define the mongodb host and database that will store your cloud data. If you want to store everything on disk, replace `mongodbhost` and `mongodbname` with a single type: files and everything will be saved to disk:\n",
"```yaml \n",
"type: files\n",
"runs_base_dir: /path/to/my/data/storage\n",
Expand Down
"cell_type": "code",
"execution_count": 7,
"execution_count": null,
"metadata": {},
"outputs": [],
"source": [
"from dacapo.train import train_run\n",
Expand All
"source": [
"If you want to start your run on some compute cluster, you might want to use the command line interface: dacapo train -r {}. This makes it particularly convenient to run on compute nodes where you can specify specific compute requirements."
"cell_type": "code",
"execution_count": null,
"metadata": {},
"outputs": [],
"source": []
"metadata": {
Expand Down
34 changes: 20 additions & 14 deletions examples/distance_task/
Original file line number Diff line number Diff line change
Expand Up
These are then combined in a single **dacapo.experiments.Run** that includes your starting point (whether you want to start training from scratch or continue off of a previously trained model) and stopping criterion (the number of iterations you want to train).

## Environment setup
If you have not already done so, you will need to install DaCapo. We recommend you do this by first creating a new environment and then installing DaCapo using pip.
If you have not already done so, you will need to install DaCapo. You can do this by first creating a new environment and then installing DaCapo using pip.

conda create -n dacapo python=3.10
Expand All @@ -35,24 +35,25 @@ cd dacapo
pip install -e .

Be sure to select this environment in your Jupyter notebook or JupyterLab.

## Config Store
To define where the data goes, create a dacapo.yaml configuration file either in `~/.config/dacapo/dacapo.yaml` or in `./dacapo.yaml`. Here is a template:

To define where the data goes, create a dacapo.yaml configuration file. Here is a template:
mongodbhost: mongodb://dbuser:dbpass@dburl:dbport/
mongodbname: dacapo
runs_base_dir: /path/to/my/data/storage
The runs_base_dir defines where your on-disk data will be stored. The mongodbhost and mongodbname define the mongodb host and database that will store your cloud data. If you want to store everything on disk, replace mongodbhost and mongodbname with a single type `files` and everything will be saved to disk:

The `runs_base_dir` defines where your on-disk data will be stored. The `mongodbhost` and `mongodbname` define the mongodb host and database that will store your cloud data. If you want to store everything on disk, replace `mongodbhost` and `mongodbname` with a single type: files and everything will be saved to disk:

type: files
runs_base_dir: /path/to/my/data/storage

from import create_config_store
Expand Down Expand Up @@ -357,13 +358,13 @@ from import Run
start_config = None
# Uncomment to start from a pretrained model
# start_config = StartConfig(
# "setup04",
# "best",
# )
start_config = StartConfig(
iterations = 200000
validation_interval = 5000
validation_interval = 5
repetitions = 3
for i in range(repetitions):
run_config = RunConfig(
Expand Down Expand Up @@ -392,9 +393,9 @@ for i in range(repetitions):


## Train
Expand All @@ -410,3 +411,8 @@ train_run(run)

If you want to start your run on some compute cluster, you might want to use the command line interface: dacapo train -r {}. This makes it particularly convenient to run on compute nodes where you can specify specific compute requirements.


