TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HUMUS-Net: Hybrid unrolled multi-scale network architectur...

HUMUS-Net: Hybrid unrolled multi-scale network architecture for accelerated MRI reconstruction

Zalan Fabian, Berk Tınaz, Mahdi Soltanolkotabi

2022-03-15AnatomyMRI Reconstruction
PaperPDFCode(official)Code(official)

Abstract

In accelerated MRI reconstruction, the anatomy of a patient is recovered from a set of under-sampled and noisy measurements. Deep learning approaches have been proven to be successful in solving this ill-posed inverse problem and are capable of producing very high quality reconstructions. However, current architectures heavily rely on convolutions, that are content-independent and have difficulties modeling long-range dependencies in images. Recently, Transformers, the workhorse of contemporary natural language processing, have emerged as powerful building blocks for a multitude of vision tasks. These models split input images into non-overlapping patches, embed the patches into lower-dimensional tokens and utilize a self-attention mechanism that does not suffer from the aforementioned weaknesses of convolutional architectures. However, Transformers incur extremely high compute and memory cost when 1) the input image resolution is high and 2) when the image needs to be split into a large number of patches to preserve fine detail information, both of which are typical in low-level vision problems such as MRI reconstruction, having a compounding effect. To tackle these challenges, we propose HUMUS-Net, a hybrid architecture that combines the beneficial implicit bias and efficiency of convolutions with the power of Transformer blocks in an unrolled and multi-scale network. HUMUS-Net extracts high-resolution features via convolutional blocks and refines low-resolution features via a novel Transformer-based multi-scale feature extractor. Features from both levels are then synthesized into a high-resolution output reconstruction. Our network establishes new state of the art on the largest publicly available MRI dataset, the fastMRI dataset. We further demonstrate the performance of HUMUS-Net on two other popular MRI datasets and perform fine-grained ablation studies to validate our design.

Results

TaskDatasetMetricValueModel
Image ReconstructionfastMRI Knee 8xPSNR37.3HUMUS-Net (train+val data)
Image ReconstructionfastMRI Knee 8xSSIM0.8945HUMUS-Net (train+val data)
Image ReconstructionfastMRI Knee 8xPSNR37HUMUS-Net (train only)
Image ReconstructionfastMRI Knee 8xSSIM0.8936HUMUS-Net (train only)
Image ReconstructionfastMRI Knee Val 8x NMSE0.0086HUMUS-Net-L
Image ReconstructionfastMRI Knee Val 8x PSNR37.45HUMUS-Net-L
Image ReconstructionfastMRI Knee Val 8x Params (M)228HUMUS-Net-L
Image ReconstructionfastMRI Knee Val 8x SSIM0.8955HUMUS-Net-L
Image ReconstructionfastMRI Knee Val 8x NMSE0.009HUMUS-Net
Image ReconstructionfastMRI Knee Val 8x PSNR37.2HUMUS-Net
Image ReconstructionfastMRI Knee Val 8x Params (M)109HUMUS-Net
Image ReconstructionfastMRI Knee Val 8x SSIM0.8946HUMUS-Net
MRI ReconstructionfastMRI Knee 8xPSNR37.3HUMUS-Net (train+val data)
MRI ReconstructionfastMRI Knee 8xSSIM0.8945HUMUS-Net (train+val data)
MRI ReconstructionfastMRI Knee 8xPSNR37HUMUS-Net (train only)
MRI ReconstructionfastMRI Knee 8xSSIM0.8936HUMUS-Net (train only)
MRI ReconstructionfastMRI Knee Val 8x NMSE0.0086HUMUS-Net-L
MRI ReconstructionfastMRI Knee Val 8x PSNR37.45HUMUS-Net-L
MRI ReconstructionfastMRI Knee Val 8x Params (M)228HUMUS-Net-L
MRI ReconstructionfastMRI Knee Val 8x SSIM0.8955HUMUS-Net-L
MRI ReconstructionfastMRI Knee Val 8x NMSE0.009HUMUS-Net
MRI ReconstructionfastMRI Knee Val 8x PSNR37.2HUMUS-Net
MRI ReconstructionfastMRI Knee Val 8x Params (M)109HUMUS-Net
MRI ReconstructionfastMRI Knee Val 8x SSIM0.8946HUMUS-Net

Related Papers

Are Vision Foundation Models Ready for Out-of-the-Box Medical Image Registration?2025-07-15PSAT: Pediatric Segmentation Approaches via Adult Augmentations and Transfer Learning2025-07-08SV-DRR: High-Fidelity Novel View X-Ray Synthesis Using Diffusion Model2025-07-07Causal-SAM-LLM: Large Language Models as Causal Reasoners for Robust Medical Segmentation2025-07-04Pose-Star: Anatomy-Aware Editing for Open-World Fashion Images2025-07-04MDPG: Multi-domain Diffusion Prior Guidance for MRI Reconstruction2025-06-30X-SiT: Inherently Interpretable Surface Vision Transformers for Dementia Diagnosis2025-06-25Volumetric segmentation of muscle compartments using in vivo imaging and architectural validation in human finger flexors2025-06-25