AttentionNet and CellNet Software

This is the official implementation with training code for Thesis: Cell Morphology Based Diagnosis of Cancer using Convolutional Neural Networks: CellNet. For technical details, please refer to:

Cell Morphology Based Diagnosis of Cancer using Convolutional Neural Networks: CellNet
Qiang Li*, Otesteanu Corin* Manfred Claassen*
Paper In preparation
[Software Report] [CellNetSoftware Video] [Research Grant Page]

General Workflow of Proposed Project. After Image Flow Cytometry (Morphological identification of tumor T-cells in the blood), those generated images can be categorized into six typical classes: lighting artifacts, out of focus cell, debris, contaminated cell, outside FOV cells and multiple cells concatenated together. Using AttentionNet as an automatic detector and segment-or, we can filter out most artifacts in the images and only keep the morphological characteristics of the cell for CellNet classification.

Results

These are the reproduction results from this repository. All the training/testing lsg log file on ETH Zurich leonhard cluster can be downloaded from our lsf file and all original data for generating those data analyse graph can be downloaded from all data file

Evaluate AttentionNet performance on Sezary Syndrome Dataset

Comparison in terms of detection/segmentation accuracy with Yolo-based methods in (Redmon & Farhadi, 2018) (He et al., 2019). Here we selected 850 representative images for training from the sezary syndrome dataset, consist of noise images and typical cell images (manually labeled HD cell image and SS cell image). In the evaluation stage, we utilized 723 images ( 308 HD cell images, 306 SS cell images, and 109 noises images). We tried to simulate the actual cell data distribution, as noise image less than cell image in the real sezary dataset. It is worth noting that by applying AttentionNet*, we mean adopting a bunch of algorithms mentioned above together, including GBCIOU segmentation, KMean++ Clustering in pro-processing, and 13 × 13, 26 × 26 output Yolo layers, compared to original Yolo widely used in only detection or object localization scenario without segmentation. TP means cell detected as cell, FP implicit stands for noise detected as cell, and TN refers to noise image correctly labeled. mAP here refers mean Average Precision.

_Model	_{TP (cell detected as cell)}	_{FP (noise detected as cell)}	_{TN (noise detected as noise)}	_{No detection}	_mAP
_YOLOV3-tiny	_63.19%	_0.91%	_87.16%	_33.05%	_0.55
_{AttentionNet* Solution}	_96.25%	_11%	_80.73%	_1.93%	_0.88
_{TF-Yolo with Kmean++ Clustering}	_91.20%	_9.17%	_66.05%	_11.20%	_0.73

Evaluate CellNet performance on CiFar10

This is the Boxplot of resnet18, Ournet, ghostnet on cifar without AttentionNet. Due to the fact that every image from this dataset is 32*32 pixel image, it's getting hard to train a well segmentor by AttentionNet to filter out the other artifacts in the image. As it illustrated that, even without AttentionNet preprocessing, our net already achieved the best performance.

_Model	_{Weights(million)}	_{Top-1 Val Acc.(%)}	_{FLops(million)}
_VGG-16	₁₅	_93.6	₃₁₃
_ResNet-18	₁₁	_91.96	₁₈₀
_GhostNet	_5.18	_91.45	₁₄₁
_OurNet	_2.91	_92.45	_41.7

CIFAR-10 dataset consists of 60,000 32 × 32 color images in 10 classes, with 50,000 training images and 10,000 test images. A common data augmentation scheme including random crop and mirroring is adopted as well.

Note:

Speed are tested on a ETH Zurich Leonhard Cluster.
You will see AttentionNet and ghostresNet in several places, please do not be frustrated in the paper, and ghostresNet = CellNet in the paper, just nickname:)!.

Evaluate CellNet performance on Pneumonia Dataset

On benchmark pneumonia dataset, the Pneumonia/Normal classification val accuracy of our Net converges into nearly 91.785% better than Ghost Net and ResNet18, In addition, after around 80 epochs the accuracy of our Net converged, comparing to Inception V3 after 7000 epochs reaches 88.0%.

_Model	_{Weights(million)}	_{Top-1 Val Acc.(%)}	_{FLops(million)}
_InceptionV3	_23.81	₈₈	₅₄₀
_ResNet-18	₁₁	_87.50	₁₈₀
_GhostNet	_5.18	_88.69	₁₄₁
_OurNet	_2.91	_91.78	_41.7

Evaluate CellNet performance on Sezary Syndrome Dataset

ResNet18 [17] and ShuffleNetv2 [25] were verified so far the most representative best performance on Sezary Syndrome Dataset. But Our* Net can achieve higher classification perfor-mance (e.g. 95.638% top-1 accuracy ) than ResNet 18 [17], ShuffleNet V2 [25] and GhostNet [16], while less weights and computational cost.

_Model	_{Weights(million)}	_{Top-1 Val Acc.(%)}	_{FLops(million)}
_ResNet-18	₁₁	_95.28	₁₈₀
_GhostNet	_5.18	_93.411	₁₄₁
_OurNet	_2.91	_95.638	_41.7
_{ShuffleNet V2}	_1.4	_83.868	₄₁

Note:

Speed are tested on a ETH Zurich Leonhard Cluster.
Performance are tested with AttentionNet preprocessing.
This is I Chart of ournet, resnet 18, shufflenet without AttentionNet- Summary Report

This is Time Series Plot of Shufflenet V, ResNet18 Val, GhostNet18 V, on Sezary syndrome with AttentionNet Preprocessing

Evaluate CellNet performance on COVID-19 Dataset

In order to help the medical scientists, we made this COVID-19 CT dataset. Based on the initial COVID-19 Image Data Collection, which contains only 123 frontal view X-rays. We also collected data from the newest publications on the European Journal of Radiology and collected nearly 1583 healthy Lung CT/Xray images as comparative data from recently available resources and publications.

_Model	_{Weights(million)}	_{Top-1 Val Acc.(%)}	_{FLops(million)}
_ResNet-18	₁₁	_94.389	₁₈₀
_GhostNet	_5.18	_92.739	₁₄₁
_OurNet	_2.91	_94.719	_41.7
_{MobileNet V2}	_3.4	_95.38	₃₀₁
_{Vgg11_BN}	_13.28	_87.129	_132.87
_DenseNet121	_7.98	_95.71	₂₈₃
_AlexNet	_60.95	₀	₇₂₇
_{SqueezeNet V2}	_--	₀	₄₀

Note:

-- denoted un-provided.
Speed are tested on a ETH Zurich Leonhard Cluster.
Performance are tested without AttentionNet preprocessing.
This is I Chart of ournet, resnet 18, shufflenet without AttentionNet- Summary Report

Comparison of state-of-art methods for training on COVID-19 Dataset. Our models' weights are 2.91 million, comparing toDenseNet121 7.98 million of weights, MobileNet V2 3.4 million of weights, and 301 million of FLOPs; considering the higher complexity and parameter amount of other SOTA Nets, our Net is very competitive on classification tasks for the biomedical dataset.

Evaluate AttentionNet performance on Sezary Syndrome Dataset with Saliency Map

To better visualize the performance of the AttentionNet and demonstrate the necessity of AttentionNet, we wrote a saliency script to generate an attention map. ResNet18 puts more attention on the outside of ROI, while VGG and our Net focus more on ROI. AttentionNet is playing a vital role in eliminating the artifacts, enforcing the models more focus on the cell itself.

Note:

For more attention maps see saliencymap folder.


Original pic: hd070916_2 (7102).png	After AttentionNet segmentation	Ournet with AttentionNet	Ournet without AttentionNet	Res18 with AttentionNet	Res18 without AttentionNet	Vgg16 with AttentionNet


Original pic: hd070916_2 (7558).png	After AttentionNet segmentation	Ournet with AttentionNet	Ournet without AttentionNet	Res18 with AttentionNet	Res18 without AttentionNet	Vgg16 with AttentionNet


Original pic: hd1 (3697).png	After AttentionNet segmentation	Ournet with AttentionNet	Ournet without AttentionNet	Res18 with AttentionNet	Res18 without AttentionNet	Vgg16 with AttentionNet


Original pic: hd1 (4550).png	After AttentionNet segmentation	Ournet with AttentionNet	Ournet without AttentionNet	Res18 with AttentionNet	Res18 without AttentionNet	Vgg16 with AttentionNet


Original pic: hd17_5 (1876).png	After AttentionNet segmentation	Ournet with AttentionNet	Ournet without AttentionNet	Res18 with AttentionNet	Res18 without AttentionNet	Vgg16 with AttentionNet


Original pic: hd1 (4400).png	After AttentionNet segmentation	Ournet with AttentionNet	Ournet without AttentionNet	Res18 with AttentionNet	Res18 without AttentionNet	Vgg16 with AttentionNet


Original pic: hd3 (1).png	After AttentionNet segmentation	Ournet with AttentionNet	Ournet without AttentionNet	Res18 with AttentionNet	Res18 without AttentionNet	Vgg16 with AttentionNet


Original pic: ss2_8 (117).png	After AttentionNet segmentation	Ournet with AttentionNet	Ournet without AttentionNet	Res18 with AttentionNet	Res18 without AttentionNet	Vgg16 with AttentionNet


Original pic: ss1_2 (270).png	After AttentionNet segmentation	Ournet with AttentionNet	Ournet without AttentionNet	Res18 with AttentionNet	Res18 without AttentionNet	Vgg16 with AttentionNet


Original pic: ss2_8 (142).png	After AttentionNet segmentation	Ournet with AttentionNet	Ournet without AttentionNet	Res18 with AttentionNet	Res18 without AttentionNet	Vgg16 with AttentionNet


Original pic: ss2_8 (468).png	After AttentionNet segmentation	Ournet with AttentionNet	Ournet without AttentionNet	Res18 with AttentionNet	Res18 without AttentionNet	Vgg16 with AttentionNet

The generalization performance with our best weight(with/without finetuning)

Non-cerebriform dataset

CellNet before finetuning

Prediction with our CellNet best weight trained so far on Non-cerebriform dataset, As shown in the figure, the TP and TN achieved the general highest score on HD/SS with more considerable image amount. Moreover, average accuracy up to 99.53%-96.51% among HD image, and average accuracy achieved 92.19%-98.78% among SS image, but there is some small folder obtain 38.29%-37.48% on SS1 and SS2, 40.17% in SS6_B folder as well.

CellNet after finetuning

After further finetuning, basically using best weight trained so far + new subset of Non-cerebriform, and set mini batch=679, trained around 100 epochs. We test the performance again. As shown, the accuracy is improved with SS1 and SS2 and SS6_B folder surprisingly up to 64.34%, 82.64%, and 96.91%.

This is the comparison between Cellnet and ResNet18 on the Non-cerebriform dataset with finetuning. As illustrated, our net has comparable Acc. even some higher on some folder.

cerebriform dataset

Prediction with our CellNet best weight trained so far on the cerebriform dataset, As shown in the figure, the TP and TN achieved comparable accuracy(in %) with resnet18.

Now our software upload on nash cloud as well, and support pretrained_weight further training, and all the prediction lsg files you can check here: lsg file for you to check

Table results of Ceribriform dataset /Non-Ceribriform dataset

How to train with your data

You want to have it try by your own dataset with our cellnet. No problem! These are all the commands

Take a look at our CellNet software framework and Our CellNet won Top AI Camp Deecamp2020 Medical Track 2nd place

With the help of the power of Qt and the high efficiency of Python, using PyQt/PySide for desktop development will be a wonderful plus for demonstrating our excellent software. The current Qt/PyQt/PySide based GUI development common development methods are list follow: QWidget + QSS, QtWebkit+ HTML + CSS + js and Qt Quick. All these three technologies can efficiently and quickly develop the crossplatform desktop software. Qt’s formal development method is Qt Quick, which uses the JSON like language qml for rapid development. It is easy to learn, expansible, and wildly used in Ubuntu, LinuxDeepin, and other Linux desktop application development. It enables the developer for a rapid development framework and putting more effort into amplifying the corresponding business logic and easy to build the framework prototypes quickly.

The Proposed software structure diagram. To better demonstrate our model’s diagnostic performance, we selected the classic medical bench-mark datasets from competitions on Kaggle, such as the melanoma dataset, the diabetic retinopathy dataset, the actinic keratosis, vascular lesion dataset dermatofibroma dataset, squamous cell carcinoma dataset. Meanwhile, we selected nearly 11 representative classification networks, enable users to choose the di-agnostic network that fits their customer dataset. Besides, we inherit the computer vision classification network and the classic classification network of NLP. We develop desktop applications and open APIs to facilitate a better user experience, and ETH Leonhard and Megengine jointly provide our computing power.

License

All Software copy right licensed by QiangLi
@inproceedings{Qiang21ICLRW,
 author = {Qiang Li and Lily Xu and Corin Otesteanu},
 title = {All you need is Cell Attention: A Cell Annotation Tool for Single-Cell Morphology Data},
 booktitle = {AI4PH Workshop on ICLR},
 year = {2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 326 Commits
COVID19		COVID19
Diabetic Rectinopathy		Diabetic Rectinopathy
HDSS		HDSS
ISIC2019_infos		ISIC2019_infos
New functions added		New functions added
__pycache__		__pycache__
cellyolo		cellyolo
chest_xray		chest_xray
cifar10		cifar10
logo		logo
melanoma/val		melanoma/val
models		models
paperimage		paperimage
saliencymap		saliencymap
training log file for verification		training log file for verification
trainsfer learning on Non-ceribriform		trainsfer learning on Non-ceribriform
val		val
venv		venv
.gitattributes		.gitattributes
README.md		README.md
SaliencyMap.py		SaliencyMap.py
The commond so far to runing the project.docx		The commond so far to runing the project.docx
all_data.tsv		all_data.tsv
cellnet.py		cellnet.py
cellnet.ui		cellnet.ui
cellnetres.qrc		cellnetres.qrc
cellnetres_rc.py		cellnetres_rc.py
cellyolomodels.py		cellyolomodels.py
detect.py		detect.py
feature.py		feature.py
main.py		main.py
master thesis related graph data (updated until 05072020).xlsx		master thesis related graph data (updated until 05072020).xlsx
prediction.py		prediction.py
readtsf.py		readtsf.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AttentionNet and CellNet Software

Results