EchoDFKD

Overview

EchoDFKD is a framework to enable model training and evaluation using only interactions with other existing models, without the need for human labels or real data. The framework is based on the idea of data-free knowledge distillation, where a student model is trained to mimic the behavior of a teacher model on a synthetic dataset generated from another trained model. The framework is designed to be general and can be applied to any domain where trained models are more easily available than labeled data which were used to train them. Here, we apply it to the domain of echocardiography, for the task of left ventricle segmentation.

Installation

Prerequisites

Ensure you have the following installed:

Python 3.6+
PyTorch 1.10+
OpenCV
PyTorch Lightning
NumPy
Pandas

You can install all these packages using the provided requirements.txt file:

python3 -m venv ~/echodfk
source ~/echodfk/bin/activate
pip install -r requirements.txt

Prepare models

You will need to download some teachers model weights. In our experiment we use the trained model from the EchoNet-Dynamic project, which is available on the following link : https://github.com/douyang/EchoNetDynamic/releases/download/v1.0.0/deeplabv3_resnet50_random.pt . Don't hesitate to try other teacher models. Place weight files in models/your_teacher_name (for instance, models/echonet_deeplabV3).

Prepare Your Datasets

You can download a synthetic dataset on https://huggingface.co/HReynaud (or you might want to generate your own synthetic dataset). If you want to run the experiments that show the performance of the model on the EchoNet-Dynamic dataset, you also need to download the dataset from the EchoNet-Dynamic website. The dataset is available for free but you need to request access. Recently, that dataset was also available on a Kaggle link.

Configure Paths and Hyperparameters

You might want to change the paths in the settings.py file to match your local setup. Concerning the directory containing the videos, you can simply set an environment variable with, for example, a command like export VIDEO_DIR="/home/your_name/example_path/a4c-video-dir/Videos/" and the code will automatically find the videos.

You can also change the hyperparameters in the hyperparameters directory.

Run the pipeline

The pipeline follows these steps:

Production of a synthetic dataset
Production of targets on synthetic dataset
Training of the student model
Inference
Model evaluation
Visuals

You can run the pipeline by executing the following command:

python core/run_all.py

Or you can run each step separately if you prefer.

Directory Structure

The repository is structured as follows:

EchoDFKD/
│
├── a4c-video-dir/             # Directory containing video files and related data
│   ├── FileList.csv           # contains volumes & EF, and train/val/test split for real data
│   ├── synthetic_FileList.csv # contains volumes & EF, and train/val/test split for synthetic data
│   ├── Videos/                # Dir containing real clips in AVI format (converted from DICOM)
│   ├── Videos_synthetic/      # Dir containing synthetic AVI videos
│   └── VolumeTracings.csv     # File from EchoNet-Dynamic containing human labels
│
├── ConvLSTM_Segmentation/     # Subrepo containing the student model architecture
│   └── ...
│
├── core/                               # Whole pipeline
│   ├── produce_targets.py              # Produces targets for synthetic dataset (first step)
│   └── train.py                        # Trains the student model (second step)
│   └── inference.py                    # Performs inference on the test dataset (third step)
│   └── evaluate_LVEF.py                # Evaluates the student model on the test set (fourth step, part 1)
│   └── evaluate_DICE.py                # Evaluates the student model on the test set (fourth step, part 2)
│   └── evaluate_aFD.py                 # Evaluates the student model on the test set (fourth step, part 3)
│   └── create_visuals.py               # Creates visuals for the student model (fifth step)
│   └.. (create_synthetic_dataset.py ?) # WIP, would be step 0
│
├── data/                               # Will store large intermediate files
│   └── ...
│
├── echoclip/                           # Echoclip related data/feature files
│   └── ...
│
├── echonet_a4c_example.py              # define the important class Example, representing a clip
│
├── echonet_deeplab_dir/
│   └── size.csv                        # ED&ES labelled frames no. for each video
│
├── examples_and_vizualisation/ 
│   ├── Study_labels.ipynb              # Visualize labels produced by humans
│   └── Study_EchoCLIP_outputs.ipynb    # Visualize EchoCLIP-based phase inference
│
├── hyperparameters/                    # Hyperparameter configurations
│   └── ...
│
├── models/                             # Directory for storing model weights and hyperparams
│   └── ...
│
├── Output/                             # Directory for model outputs
│   └── ...
│
└── settings.py                         # Constants, paths, settings

Results on the LV segmentation task

Here is an improved version of the figure from the paper showing the DICE scores of our models. Following recent works, we use a more subtle model than a linear one to fit the experimental curve, that better captures the saturation phenomenon when the dataset size becomes limiting.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
ConvLSTM_Segmentation @ 595935a		ConvLSTM_Segmentation @ 595935a
a4c-video-dir		a4c-video-dir
core		core
data/repaired_labels		data/repaired_labels
echonet_deeplab_dir		echonet_deeplab_dir
examples_and_vizualisation		examples_and_vizualisation
hyperparameters		hyperparameters
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
echonet_a4c_example.py		echonet_a4c_example.py
figure_perf_seg_nogithubcache.png		figure_perf_seg_nogithubcache.png
requirements.txt		requirements.txt
settings.py		settings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EchoDFKD

Overview

Installation

Prerequisites

Prepare models

Prepare Your Datasets

Configure Paths and Hyperparameters

Run the pipeline

Directory Structure

Results on the LV segmentation task

About

Releases

Packages

Contributors 2

Languages

GregoirePetit/EchoDFKD

Folders and files

Latest commit

History

Repository files navigation

EchoDFKD

Overview

Installation

Prerequisites

Prepare models

Prepare Your Datasets

Configure Paths and Hyperparameters

Run the pipeline

Directory Structure

Results on the LV segmentation task

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages