Data folder¶
BioSegment works with datasets located in a ROOT_DATA_FOLDER
folder, mostly a mounted network drive. Users can access the network drive directly to import or export files with their own tools and workflows.
Structure¶
- ROOT_DATA_FOLDER
- setup.json
- JSON file containing configurations the backend reads during initialization of the database. Not required. Used in development. See the example at
scripts/example_setup.json
.
- JSON file containing configurations the backend reads during initialization of the database. Not required. Used in development. See the example at
- EM/
- {dataset_name} e.g. EMBL
- raw/
- {pngs}
- raw/
- {dataset_name} e.g. EMBL
- models/
- {model_name} e.g. unet_2d
- = output folder of neuralnets training
- saved model e.g. best_checkpoint.pytorch
- {model_name} e.g. unet_2d
- segmentations/
- {dataset_name}
- labels/
- = ground truth labels of the dataset
- {pngs}
- {segmentation_name}
- = output folder of neuralnets inference
- {pngs}
- labels/
- {dataset_name}
- annotations/
- {dataset_name}
- {annotation_name}
- saved annotations e.g. annotations.json
- {annotation_name}
- {dataset_name}
- setup.json
New dataset¶
Adding a new dataset requires creating a folder with the dataset title in the EM/
folder e.g. {ROOT_DATA_FOLDER}/EM/{NEW_DATASET_NAME}
. Then the dataset files need to be set in this folder e.g. a single .tiff
or multiple .png
files.