Ching-Hsun Tseng, Shao-Ju Chien, Po-Shen Wang, Shin-Jye Lee, Wei-Huan Hu, Bin Pu, Xiao-jun Zeng
Motion mode (M-mode) recording is an essential part of echocardiography to measure cardiac dimension and function. However, the current diagnosis cannot build an automatic scheme, as there are three fundamental obstructs: Firstly, there is no open dataset available to build the automation for ensuring constant results and bridging M-mode echocardiography with real-time instance segmentation (RIS); Secondly, the examination is involving the time-consuming manual labelling upon M-mode echocardiograms; Thirdly, as objects in echocardiograms occupy a significant portion of pixels, the limited receptive field in existing backbones (e.g., ResNet) composed from multiple convolution layers are inefficient to cover the period of a valve movement. Existing non-local attentions (NL) compromise being unable real-time with a high computation overhead or losing information from a simplified version of the non-local block. Therefore, we proposed RAMEM, a real-time automatic M-mode echocardiography measurement scheme, contributes three aspects to answer the problems: 1) provide MEIS, a dataset of M-mode echocardiograms for instance segmentation, to enable consistent results and support the development of an automatic scheme; 2) propose panel attention, local-to-global efficient attention by pixel-unshuffling, embedding with updated UPANets V2 in a RIS scheme toward big object detection with global receptive field; 3) develop and implement AMEM, an efficient algorithm of automatic M-mode echocardiography measurement enabling fast and accurate automatic labelling among diagnosis. The experimental results show that RAMEM surpasses existing RIS backbones (with non-local attention) in PASCAL 2012 SBD and human performances in real-time MEIS tested. The code of MEIS and dataset are available at https://github.com/hanktseng131415go/RAME.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Instance Segmentation | PASCAL VOC 2012 | FLOPs (G) | 100.85 | RAMEM UPANet80 V2 |
| Instance Segmentation | PASCAL VOC 2012 | Frame (fps) | 60.93 | RAMEM UPANet80 V2 |
| Instance Segmentation | PASCAL VOC 2012 | Size (M) | 40.32 | RAMEM UPANet80 V2 |
| Instance Segmentation | PASCAL VOC 2012 | avgAP (mask AP + box AP) | 42.69 | RAMEM UPANet80 V2 |
| Instance Segmentation | PASCAL VOC 2012 | boxAP | 42.96 | RAMEM UPANet80 V2 |
| Instance Segmentation | PASCAL VOC 2012 | maskAP | 42.42 | RAMEM UPANet80 V2 |
| Instance Segmentation | PASCAL VOC 2012 | FLOPs (G) | 48.26 | maYOLACT ResNet50 |
| Instance Segmentation | PASCAL VOC 2012 | Frame (fps) | 81.27 | maYOLACT ResNet50 |
| Instance Segmentation | PASCAL VOC 2012 | Size (M) | 30.41 | maYOLACT ResNet50 |
| Instance Segmentation | PASCAL VOC 2012 | avgAP (mask AP + box AP) | 37.39 | maYOLACT ResNet50 |
| Instance Segmentation | PASCAL VOC 2012 | boxAP | 37.5 | maYOLACT ResNet50 |
| Instance Segmentation | PASCAL VOC 2012 | maskAP | 37.27 | maYOLACT ResNet50 |
| Instance Segmentation | PASCAL VOC 2012 | FLOPs (G) | 48.26 | YOLACT ResNet50 |
| Instance Segmentation | PASCAL VOC 2012 | Frame (fps) | 81.11 | YOLACT ResNet50 |
| Instance Segmentation | PASCAL VOC 2012 | Size (M) | 30.41 | YOLACT ResNet50 |
| Instance Segmentation | PASCAL VOC 2012 | avgAP (mask AP + box AP) | 35.73 | YOLACT ResNet50 |
| Instance Segmentation | PASCAL VOC 2012 | boxAP | 36.65 | YOLACT ResNet50 |
| Instance Segmentation | PASCAL VOC 2012 | maskAP | 35.12 | YOLACT ResNet50 |
| Instance Segmentation | MEIS | FLOPs (G) | 0.4826 | maYOLACT ResNet50 |
| Instance Segmentation | MEIS | Frame (fps) | 36.13 | maYOLACT ResNet50 |
| Instance Segmentation | MEIS | Size (M) | 30.38 | maYOLACT ResNet50 |
| Instance Segmentation | MEIS | avgAP (mask AP + box AP) | 46.29 | maYOLACT ResNet50 |
| Instance Segmentation | MEIS | boxAP | 49.59 | maYOLACT ResNet50 |
| Instance Segmentation | MEIS | maskAP | 42.99 | maYOLACT ResNet50 |
| Instance Segmentation | MEIS | FLOPs (G) | 100.85 | RAMEM UPANet80 V2 |
| Instance Segmentation | MEIS | Frame (fps) | 52.22 | RAMEM UPANet80 V2 |
| Instance Segmentation | MEIS | Size (M) | 40.28 | RAMEM UPANet80 V2 |
| Instance Segmentation | MEIS | avgAP (mask AP + box AP) | 47.15 | RAMEM UPANet80 V2 |
| Instance Segmentation | MEIS | boxAP | 51.2 | RAMEM UPANet80 V2 |
| Instance Segmentation | MEIS | maskAP | 43.09 | RAMEM UPANet80 V2 |