V00.01.00003 release (#356)

* cleaning up files which are no longer needed * fixes after removing forking workflow (#322) * PR to resolve merge issues * updated main build as well * added ability to read in git branch name directly * manually updated the other files * fixed number of classes for main build tests (#327) * fixed number of classes for main build tests * corrected DATASET.ROOT in builds * added dev build script * Fixes for development inside the docker container (#335) * Fix the mound command for the HRNet pretrained model in the docker readme * Properly catch InvalidGitRepository exception * make repo paths consistent with non-docker runs -- this way configs paths do not need to be changed * Properly catch InvalidGitRepository exception in train.py * Readme update (#337) * README updates * Removing user specific path from config Authored-by: Fatemeh Zamanian <Fatemeh.Zamanian@microsoft.com> * Fixing #324 and #325 (#338) * update colormap to a non-discrete one -- fixes #324 * fix mask_to_disk to normalize by n_classes * changes to test.py * Updating data.py * bug fix * increased timeout time for main_build * retrigger build * retrigger the build * increase timeout * fixes 318 (#339) * finished 318 * increased checkerboard test timeout * fix 333 (#340) * added label correction to train gradient * changing the gradient data generator to take inline/crossline argument conssistent with the patchloader * changing variable name to be more descriptive Co-authored-by: maxkazmsft <maxkaz@microsoft.com> * bug fix to model predictions (#345) * replace hrnet with seresnet in experiments - provides stable default model (#343) Co-authored-by: yalaudah <yazeed.alaudah@microsoft.com> Co-authored-by: Fatemeh <fazamani@microsoft.com>
microsoft · Jun 8, 2020 · 15d45fb · 15d45fb
1 parent 904157c
commit 15d45fb
Show file tree

Hide file tree

Showing 30 changed files with 538 additions and 160 deletions.
diff --git a/README.md b/README.md
@@ -3,12 +3,13 @@
 
 This repository shows you how to perform seismic imaging and interpretation on Azure. It empowers geophysicists and data scientists to run seismic experiments using state-of-art DSL-based PDE solvers and segmentation algorithms on Azure.  
 
+
 The repository provides sample notebooks, data loaders for seismic data, utilities, and out-of-the-box ML pipelines, organized as follows:
 - **sample notebooks**: these can be found in the `examples` folder - they are standard Jupyter notebooks which highlight how to use the codebase by walking the user through a set of pre-made examples
 - **experiments**: the goal is to provide runnable Python scripts that train and test (score) our machine learning models in the `experiments` folder. The models themselves are swappable, meaning a single train script can be used to run a different model on the same dataset by simply swapping out the configuration file which defines the model. 
-- **pip installable utilities**: we provide `cv_lib` and `deepseismic_interpretation` utilities (more info below) which are used by both sample notebooks and experiments mentioned above
+- **pip installable utilities**: we provide `cv_lib` and `interpretation` utilities (more info below) which are used by both sample notebooks and experiments mentioned above
 
-DeepSeismic currently focuses on Seismic Interpretation (3D segmentation aka facies classification) with experimental code provided around Seismic Imaging in the contrib folder.
+DeepSeismic currently focuses on Seismic Interpretation (mainly facies classification) with experimental code provided around Seismic Imaging in the contrib folder.
 
 ### Quick Start
 
@@ -26,7 +27,7 @@ If you run into any problems, chances are your problem has already been solved i
 The notebook is designed to be run in demo mode by default using a pre-trained model in under 5 minutes on any reasonable Deep Learning GPU such as nVidia K80/P40/P100/V100/TitanV.
 
 ### Azure Machine Learning
-[Azure Machine Learning](https://docs.microsoft.com/en-us/azure/machine-learning/) enables you to train and deploy your machine learning models and pipelines at scale, ane leverage open-source Python frameworks, such as PyTorch, TensorFlow, and scikit-learn. If you are looking at getting started with using the code in this repository with Azure Machine Learning, refer to [Azure Machine Learning How-to](https://github.com/Azure/MachineLearningNotebooks/tree/master/how-to-use-azureml) to get started.
+[Azure Machine Learning](https://docs.microsoft.com/en-us/azure/machine-learning/) enables you to train and deploy your machine learning models and pipelines at scale, and leverage open-source Python frameworks, such as PyTorch, TensorFlow, and scikit-learn. If you are looking at getting started with using the code in this repository with Azure Machine Learning, refer to [Azure Machine Learning How-to](https://github.com/Azure/MachineLearningNotebooks/tree/master/how-to-use-azureml) to get started.
 
 ## Interpretation
 For seismic interpretation, the repository consists of extensible machine learning pipelines, that shows how you can leverage state-of-the-art segmentation algorithms (UNet, SEResNET, HRNet) for seismic interpretation.
@@ -120,9 +121,12 @@ To prepare the data for the experiments (e.g. split into train/val/test), please
 cd scripts
 
 # For patch-based experiments
-python prepare_dutchf3.py split_train_val patch --data_dir=${data_dir} --label_file=train/train_labels.npy --output_dir=splits \
+python prepare_dutchf3.py split_train_val patch --data_dir=${data_dir}/data --label_file=train/train_labels.npy --output_dir=splits \
 --stride=50 --patch_size=100 --split_direction=both
 
+# For section-based experiments
+python prepare_dutchf3.py split_train_val section --data-dir=${data_dir}/data --label_file=train/train_labels.npy --output_dir=splits \ --split_direction=both
+
 # go back to repo root
 cd ..
 ```
@@ -164,7 +168,7 @@ We use [YACS](https://github.com/rbgirshick/yacs) configuration library to manag
 - __yml config files__ - YAML configuration files under `configs/` are typically created one for each experiment. These are meant to be used for repeatable experiment runs and reproducible settings. Each configuration file only overrides the options that are changing in that experiment (e.g. options loaded from `defaults.py` during an experiment run will be overridden by arguments loaded from the yaml file). As an example, to use yml configuration file with the training script, run:
 
     ```
-    python train.py --cfg "configs/hrnet.yaml"
+    python train.py --cfg "configs/seresnet_unet.yaml"
     ```
 
 - __command line__ - Finally, options can be passed in through `options` argument, and those will override arguments loaded from the configuration file. We created CLIs for all our scripts (using Python Fire library), so you can pass these options via command-line arguments, like so:
@@ -229,8 +233,8 @@ This section contains benchmarks of different algorithms for seismic interpretat
 
 
 #### Reproduce benchmarks
-In order to reproduce the benchmarks, you will need to navigate to the [experiments](experiments) folder. In there, each of the experiments are split into different folders. To run the Netherlands F3 experiment navigate to the [dutchf3_patch/local](experiments/dutchf3_patch/local) folder. In there is a training script [([train.sh](experiments/dutchf3_patch/local/train.sh))
-which will run the training for any configuration you pass in. Once you have run the training you will need to run the [test.sh](experiments/dutchf3_patch/local/test.sh) script. Make sure you specify
+In order to reproduce the benchmarks, you will need to navigate to the [experiments](experiments) folder. In there, each of the experiments are split into different folders. To run the Netherlands F3 experiment navigate to the [dutchf3_patch/local](experiments/interpretation/dutchf3_patch/local) folder. In there is a training script [([train.sh](experiments/interpretation/dutchf3_patch/local/train.sh))
+which will run the training for any configuration you pass in. Once you have run the training you will need to run the [test.sh](experiments/interpretation/dutchf3_patch/local/test.sh) script. Make sure you specify
 the path to the best performing model from your training run, either by passing it in as an argument or altering the YACS config file. 
 
 ## Contributing

diff --git a/contrib/experiments/interpretation/dutchf3_patch/distributed/train.py b/contrib/experiments/interpretation/dutchf3_patch/distributed/train.py
@@ -2,10 +2,10 @@
 # Licensed under the MIT License.
 #
 # To Run on 2 GPUs
-# python -m torch.distributed.launch --nproc_per_node=2 train.py --cfg "configs/hrnet.yaml"
+# python -m torch.distributed.launch --nproc_per_node=2 train.py --cfg "configs/seresnet_unet.yaml"
 #
 # To Test:
-# python -m torch.distributed.launch --nproc_per_node=2 train.py TRAIN.END_EPOCH 1 TRAIN.SNAPSHOTS 1 --cfg "configs/hrnet.yaml" --debug
+# python -m torch.distributed.launch --nproc_per_node=2 train.py TRAIN.END_EPOCH 1 TRAIN.SNAPSHOTS 1 --cfg "configs/seresnet_unet.yaml" --debug
 #
 # /* spell-checker: disable */
 """Train models on Dutch F3 dataset
@@ -138,7 +138,7 @@ def run(*options, cfg=None, local_rank=0, debug=False):
         stride=config.TRAIN.STRIDE,
         patch_size=config.TRAIN.PATCH_SIZE,
         augmentations=train_aug,
-    )    
+    )
 
     val_set = TrainPatchLoader(
         config.DATASET.ROOT,
@@ -154,10 +154,10 @@ def run(*options, cfg=None, local_rank=0, debug=False):
 
     if debug:
         val_set = data.Subset(val_set, range(config.VALIDATION.BATCH_SIZE_PER_GPU))
-        train_set = data.Subset(train_set, range(config.TRAIN.BATCH_SIZE_PER_GPU*2))
-    
+        train_set = data.Subset(train_set, range(config.TRAIN.BATCH_SIZE_PER_GPU * 2))
+
     logger.info(f"Training examples {len(train_set)}")
-    logger.info(f"Validation examples {len(val_set)}")    
+    logger.info(f"Validation examples {len(val_set)}")
 
     train_sampler = torch.utils.data.distributed.DistributedSampler(train_set, num_replicas=world_size, rank=local_rank)
 
@@ -193,7 +193,7 @@ def run(*options, cfg=None, local_rank=0, debug=False):
 
     model = torch.nn.parallel.DistributedDataParallel(model, device_ids=[device], find_unused_parameters=True)
 
-    snapshot_duration = epochs_per_cycle * len(train_loader) if not debug else 2*len(train_loader)
+    snapshot_duration = epochs_per_cycle * len(train_loader) if not debug else 2 * len(train_loader)
 
     warmup_duration = 5 * len(train_loader)
 

diff --git a/contrib/experiments/interpretation/dutchf3_patch/distributed/train.sh b/contrib/experiments/interpretation/dutchf3_patch/distributed/train.sh
@@ -1,3 +1,3 @@
 #!/bin/bash
 export PYTHONPATH=/data/home/mat/repos/DeepSeismic/interpretation:$PYTHONPATH
-python -m torch.distributed.launch --nproc_per_node=8 train.py --cfg configs/hrnet.yaml
+python -m torch.distributed.launch --nproc_per_node=8 train.py --cfg configs/seresnet_unet.yaml
diff --git a/contrib/experiments/interpretation/dutchf3_voxel/README.md b/contrib/experiments/interpretation/dutchf3_voxel/README.md
@@ -1,13 +1,13 @@
-First, make sure that `${HOME}/data/dutch_f3` folder exists and you have write access.
+First, make sure that `${HOME}/data/dutch` folder exists and you have write access.
 
 Next, to get the main input dataset which is the [Dutch F3 dataset](https://terranubis.com/datainfo/Netherlands-Offshore-F3-Block-Complete), 
 navigate to [MalenoV](https://github.com/bolgebrygg/MalenoV) project website and follow the links (which will lead to 
 [this](https://drive.google.com/drive/folders/0B7brcf-eGK8CbGhBdmZoUnhiTWs) download). Save this file as 
-`${HOME}/data/dutch_f3/data.segy`
+`${HOME}/data/dutch/data.segy`
 
 To download the train and validation masks, from the root of the repo, run
 ```bash
-./contrib/scripts/get_F3_voxel.sh ${HOME}/data/dutch_f3
+./contrib/scripts/get_F3_voxel.sh ${HOME}/data/dutch
 ```
 
 This will also download train and validation masks to the same location as data.segy.

diff --git a/contrib/experiments/interpretation/dutchf3_voxel/configs/texture_net.yaml b/contrib/experiments/interpretation/dutchf3_voxel/configs/texture_net.yaml
@@ -15,7 +15,7 @@ WINDOW_SIZE: 65
 
 DATASET:
   NUM_CLASSES: 2
-  ROOT: /home/maxkaz/data/dutchf3
+  ROOT: /home/username/data/dutchf3
   FILENAME: data.segy  
 
 MODEL:

diff --git a/contrib/experiments/interpretation/dutchf3_voxel/train.py b/contrib/experiments/interpretation/dutchf3_voxel/train.py
@@ -158,7 +158,7 @@ def run(*options, cfg=None):
     def _select_pred_and_mask(model_out):
         # receive a tuple of (x, y_pred), y
         # so actually in line 51 of
-        # cv_lib/cv_lib/segmentation/dutch_f3/metrics/__init__.py
+        # cv_lib/cv_lib/segmentation/dutch/metrics/__init__.py
         # we do the following line, so here we just select the model
         # _, y_pred = torch.max(model_out[0].squeeze(), 1, keepdim=True)
         y_pred = model_out[0].squeeze()

diff --git a/contrib/experiments/interpretation/penobscot/local/test.py b/contrib/experiments/interpretation/penobscot/local/test.py
@@ -2,7 +2,7 @@
 # Licensed under the MIT License.
 #
 # To Test:
-# python test.py TRAIN.END_EPOCH 1 TRAIN.SNAPSHOTS 1 --cfg "configs/hrnet.yaml" --debug
+# python test.py TRAIN.END_EPOCH 1 TRAIN.SNAPSHOTS 1 --cfg "configs/seresnet_unet.yaml" --debug
 #
 # /* spell-checker: disable */
 """Train models on Penobscot dataset
@@ -244,9 +244,7 @@ def _select_max(pred_tensor):
     def _tensor_to_numpy(pred_tensor):
         return pred_tensor.squeeze().cpu().numpy()
 
-    transform_func = compose(
-        np_to_tb, decode_segmap, _tensor_to_numpy,
-    )
+    transform_func = compose(np_to_tb, decode_segmap, _tensor_to_numpy,)
 
     transform_pred = compose(transform_func, _select_max)
 

diff --git a/contrib/experiments/interpretation/penobscot/local/train.py b/contrib/experiments/interpretation/penobscot/local/train.py
@@ -2,7 +2,7 @@
 # Licensed under the MIT License.
 #
 # To Test:
-# python train.py TRAIN.END_EPOCH 1 TRAIN.SNAPSHOTS 1 --cfg "configs/hrnet.yaml" --debug
+# python train.py TRAIN.END_EPOCH 1 TRAIN.SNAPSHOTS 1 --cfg "configs/seresnet_unet.yaml" --debug
 #
 # /* spell-checker: disable */
 """Train models on Penobscot dataset
@@ -43,6 +43,7 @@
 
 mask_value = 255
 
+
 def _prepare_batch(batch, device=None, non_blocking=False):
     x, y, ids, patch_locations = batch
     return (
@@ -253,9 +254,7 @@ def _select_max(pred_tensor):
     def _tensor_to_numpy(pred_tensor):
         return pred_tensor.squeeze().cpu().numpy()
 
-    transform_func = compose(
-        np_to_tb, decode_segmap, _tensor_to_numpy,
-    )
+    transform_func = compose(np_to_tb, decode_segmap, _tensor_to_numpy,)
 
     transform_pred = compose(transform_func, _select_max)
 

diff --git a/contrib/experiments/interpretation/voxel2pixel/test_parallel.py b/contrib/experiments/interpretation/voxel2pixel/test_parallel.py
@@ -245,7 +245,7 @@ def main_worker(gpu, ngpus_per_node, args):
 
 
 parser = argparse.ArgumentParser(description="Seismic Distributed Scoring")
-parser.add_argument("-d", "--data", default="/home/maxkaz/data/dutchf3", type=str, help="default dataset folder name")
+parser.add_argument("-d", "--data", default="/home/username/data/dutchf3", type=str, help="default dataset folder name")
 parser.add_argument(
     "-s",
     "--slice",

diff --git a/contrib/experiments/interpretation/voxel2pixel/train.py b/contrib/experiments/interpretation/voxel2pixel/train.py
@@ -15,7 +15,7 @@
 import utils
 
 # Parameters
-ROOT_PATH = "/home/maxkaz/data/dutchf3"
+ROOT_PATH = "/home/username/data/dutchf3"
 INPUT_VOXEL = "data.segy"
 TRAIN_MASK = "inline_339.png"
 VAL_MASK = "inline_405.png"

diff --git a/cv_lib/cv_lib/utils.py b/cv_lib/cv_lib/utils.py
@@ -1,4 +1,3 @@
-
 # Copyright (c) Microsoft Corporation.
 # Licensed under the MIT License.
 
@@ -8,6 +7,7 @@
 import numpy as np
 from matplotlib import pyplot as plt
 
+
 def normalize(array):
     """
     Normalizes a segmentation mask array to be in [0,1] range
@@ -16,12 +16,19 @@ def normalize(array):
     min = array.min()
     return (array - min) / (array.max() - min)
 
-def mask_to_disk(mask, fname, cmap_name="Paired"):
+
+def mask_to_disk(mask, fname, n_classes, cmap_name="rainbow"):
     """
     write segmentation mask to disk using a particular colormap
+    mask (float): this contains the predicted labels in the range [0, n_classes].
+    fname (str): of the the image to be saved 
+    n_classes (int): total number of classes in the dataset
+    cmap_name (str): name of the matplotlib colormap to be used. The default "rainbow"
+        colormap works well for any number of classes. 
     """
     cmap = plt.get_cmap(cmap_name)
-    Image.fromarray(cmap(normalize(mask), bytes=True)).save(fname)
+    Image.fromarray(cmap(mask / n_classes, bytes=True)).save(fname)
+
 
 def image_to_disk(mask, fname, cmap_name="seismic"):
     """
@@ -30,7 +37,8 @@ def image_to_disk(mask, fname, cmap_name="seismic"):
     cmap = plt.get_cmap(cmap_name)
     Image.fromarray(cmap(normalize(mask), bytes=True)).save(fname)
 
-def decode_segmap(label_mask, colormap_name="Paired"):
+
+def decode_segmap(label_mask, colormap_name="rainbow"):
     """
     Decode segmentation class labels into a colour image
         Args:
@@ -48,6 +56,7 @@ def decode_segmap(label_mask, colormap_name="Paired"):
 
     return out
 
+
 def load_log_configuration(log_config_file):
     """
     Loads logging configuration from the given configuration file.

diff --git a/docker/Dockerfile b/docker/Dockerfile
@@ -25,7 +25,8 @@ RUN wget --quiet https://repo.anaconda.com/miniconda/Miniconda3-latest-Linux-x86
     wget --quiet https://github.com/microsoft/seismic-deeplearning/archive/master.zip -O master.zip && \
     unzip master.zip && rm master.zip
 
-RUN cd seismic-deeplearning-master && \
+RUN mv seismic-deeplearning-master seismic-deeplearning && \
+    cd seismic-deeplearning && \
     conda env create -n seismic-interpretation --file environment/anaconda/local/environment.yml && \
     source activate seismic-interpretation && \
     python -m ipykernel install --user --name seismic-interpretation && \
@@ -34,7 +35,7 @@ RUN cd seismic-deeplearning-master && \
 
 # TODO: add back in later when Penobscot notebook is available
 # Download Penobscot dataset:
-# RUN cd seismic-deeplearning-master && \
+# RUN cd seismic-deeplearning && \
 #    data_dir="/home/username/data/penobscot" && \
 #    mkdir -p "$data_dir" && \
 #    ./scripts/download_penobscot.sh "$data_dir"  && \
@@ -44,7 +45,7 @@ RUN cd seismic-deeplearning-master && \
 #    cd ..
 
 # Download F3 dataset:
-RUN cd seismic-deeplearning-master && \
+RUN cd seismic-deeplearning && \
     data_dir="/home/username/data/dutch" && \
     mkdir -p "$data_dir" && \
     ./scripts/download_dutch_f3.sh "$data_dir" && \

diff --git a/docker/README.md b/docker/README.md
@@ -1,14 +1,14 @@
-This Docker image allows the user to run the notebooks in this repository on any operating system without having to setup the environment or install anything other than the Docker engine. For instructions on how to install the Docker engine, click [here](https://www.docker.com/get-started). 
+This Docker image allows the user to run the notebooks in this repository on any Unix based operating system without having to setup the environment or install anything other than the Docker engine. We recommend using [Azure Data Science Virtual Machine (DSVM) for Linux (Ubuntu)](https://docs.microsoft.com/en-us/azure/machine-learning/data-science-virtual-machine/dsvm-ubuntu-intro) as outlined [here](../README.md#compute-environment). For instructions on how to install the Docker engine, click [here](https://www.docker.com/get-started). 
 
 # Download the HRNet model: 
 
-To run the [`HRNet_Penobscot_demo_notebook.ipynb`](https://github.com/microsoft/seismic-deeplearning/blob/master/examples/interpretation/notebooks/HRNet_Penobscot_demo_notebook.ipynb), you will need to manually download the [HRNet-W48-C](https://1drv.ms/u/s!Aus8VCZ_C_33dKvqI6pBZlifgJk) pretrained model. You can follow the instructions [here.](https://github.com/microsoft/seismic-deeplearning#hrnet). 
+To run the [`Dutch_F3_patch_model_training_and_evaluation.ipynb`](https://github.com/microsoft/seismic-deeplearning/blob/master/examples/interpretation/notebooks/Dutch_F3_patch_model_training_and_evaluation.ipynb), you will need to manually download the [HRNet-W48-C](https://1drv.ms/u/s!Aus8VCZ_C_33dKvqI6pBZlifgJk) pretrained model. You can follow the instructions [here.](../README.md#pretrained-models). 
 
 If you are using an Azure Virtual Machine to run this code, you can download the model to your local machine, and then copy it to your Azure VM through the command below. Please make sure you update the `<azureuser>` and `<azurehost>` feilds.
 ```bash
 scp hrnetv2_w48_imagenet_pretrained.pth <azureuser>@<azurehost>:/home/<azureuser>/seismic-deeplearning/docker/hrnetv2_w48_imagenet_pretrained.pth
 ```
-Once you have the model downloaded (ideally under the `docker` directory), you can process to build the Docker image. 
+Once you have the model downloaded (ideally under the `docker` directory), you can proceed to build the Docker image. 
 
 # Build the Docker image:
 
@@ -22,7 +22,7 @@ This process will take a few minutes to complete.
 # Run the Docker image:
 Once the Docker image is built, you can run it anytime using the following command:
 ```bash
-sudo docker run --rm -it -p 9000:9000 -p 9001:9001 --gpus=all --shm-size 11G --mount type=bind,source=$PWD/hrnetv2_w48_imagenet_pretrained.pth,target=/home/models/hrnetv2_w48_imagenet_pretrained.pth seismic-deeplearning
+sudo docker run --rm -it -p 9000:9000 -p 9001:9001 --gpus=all --shm-size 11G --mount type=bind,source=$PWD/hrnetv2_w48_imagenet_pretrained.pth,target=/home/username/seismic-deeplearning/docker/hrnetv2_w48_imagenet_pretrained.pth seismic-deeplearning
 ```
 If you have saved the pretrained model in a different directory, make sure you replace `$PWD/hrnetv2_w48_imagenet_pretrained.pth` with the **absolute** path to the pretrained HRNet model. The command above will run a Jupyter Lab instance that you can access by clicking on the link in your terminal. You can then navigate to the notebook or script that you would like to run. 
 

diff --git a/environment/anaconda/local/environment.yml b/environment/anaconda/local/environment.yml
@@ -38,3 +38,5 @@ dependencies:
     - scipy==1.1.0
     - jupytext==1.3.0
     - validators
+    - pyyaml
+
diff --git a/examples/interpretation/README.md b/examples/interpretation/README.md
@@ -1,3 +1,5 @@
-The folder contains notebook examples illustrating the use of segmentation algorithms on openly available datasets. Make sure you have followed the [set up instructions](../README.md) before running these examples. We provide the following notebook examples 
+The folder contains notebook examples illustrating the use of segmentation algorithms on openly available datasets. Make sure you have followed the [set up instructions](../../README.md) before running these examples. We provide the following notebook examples 
 
 * [Dutch F3 dataset](notebooks/Dutch_F3_patch_model_training_and_evaluation.ipynb): This notebook illustrates section and patch based segmentation approaches on the [Dutch F3](https://terranubis.com/datainfo/Netherlands-Offshore-F3-Block-Complete) open dataset. This notebook uses denconvolution based segmentation algorithm on 2D patches. The notebook will guide you through visualization of the input volume, setting up model training and evaluation. 
+
+To understand the configuration files and the dafault parameters refer to this [section in the top level README](../../README.md#configuration-files)