Longguang Wang, Yingqian Wang, Zhengfa Liang, Zaiping Lin, Jungang Yang, Wei An, Yulan Guo
Stereo image pairs can be used to improve the performance of super-resolution (SR) since additional information is provided from a second viewpoint. However, it is challenging to incorporate this information for SR since disparities between stereo images vary significantly. In this paper, we propose a parallax-attention stereo superresolution network (PASSRnet) to integrate the information from a stereo image pair for SR. Specifically, we introduce a parallax-attention mechanism with a global receptive field along the epipolar line to handle different stereo images with large disparity variations. We also propose a new and the largest dataset for stereo image SR (namely, Flickr1024). Extensive experiments demonstrate that the parallax-attention mechanism can capture correspondence between stereo images to improve SR performance with a small computational and memory cost. Comparative results show that our PASSRnet achieves the state-of-the-art performance on the Middlebury, KITTI 2012 and KITTI 2015 datasets.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Super-Resolution | Middlebury - 4x upscaling | PSNR | 28.63 | PASSRnet |
| Super-Resolution | KITTI 2012 - 4x upscaling | PSNR | 26.26 | PASSRnet |
| Super-Resolution | KITTI 2015 - 2x upscaling | PSNR | 29.78 | PASSRnet |
| Super-Resolution | KITTI 2012 - 2x upscaling | PSNR | 30.65 | PASSRnet |
| Super-Resolution | Middlebury - 2x upscaling | PSNR | 34.05 | PASSRnet |
| Super-Resolution | KITTI 2015 - 4x upscaling | PSNR | 25.43 | PASSRnet |
| Super-Resolution | Middlebury - 4x upscaling | PSNR | 28.72 | PASSRnet |
| Super-Resolution | Middlebury - 2x upscaling | PSNR | 34.23 | PASSRnet |
| Super-Resolution | KITTI2012 - 4x upscaling | PSNR | 26.34 | PASSRnet |
| Super-Resolution | Flickr1024 - 2x upscaling | PSNR | 28.38 | PASSRnet |
| Super-Resolution | Flickr1024 - 4x upscaling | PSNR | 23.31 | PASSRnet |
| Super-Resolution | KITTI2015 - 2x upscaling | PSNR | 30.6 | PASSRnet |
| Super-Resolution | KITTI2012 - 2x upscaling | PSNR | 30.81 | PASSRnet |
| Super-Resolution | KITTI2015 - 4x upscaling | PSNR | 26.08 | PASSRnet |
| Image Super-Resolution | Middlebury - 4x upscaling | PSNR | 28.63 | PASSRnet |
| Image Super-Resolution | KITTI 2012 - 4x upscaling | PSNR | 26.26 | PASSRnet |
| Image Super-Resolution | KITTI 2015 - 2x upscaling | PSNR | 29.78 | PASSRnet |
| Image Super-Resolution | KITTI 2012 - 2x upscaling | PSNR | 30.65 | PASSRnet |
| Image Super-Resolution | Middlebury - 2x upscaling | PSNR | 34.05 | PASSRnet |
| Image Super-Resolution | KITTI 2015 - 4x upscaling | PSNR | 25.43 | PASSRnet |
| Image Super-Resolution | Middlebury - 4x upscaling | PSNR | 28.72 | PASSRnet |
| Image Super-Resolution | Middlebury - 2x upscaling | PSNR | 34.23 | PASSRnet |
| Image Super-Resolution | KITTI2012 - 4x upscaling | PSNR | 26.34 | PASSRnet |
| Image Super-Resolution | Flickr1024 - 2x upscaling | PSNR | 28.38 | PASSRnet |
| Image Super-Resolution | Flickr1024 - 4x upscaling | PSNR | 23.31 | PASSRnet |
| Image Super-Resolution | KITTI2015 - 2x upscaling | PSNR | 30.6 | PASSRnet |
| Image Super-Resolution | KITTI2012 - 2x upscaling | PSNR | 30.81 | PASSRnet |
| Image Super-Resolution | KITTI2015 - 4x upscaling | PSNR | 26.08 | PASSRnet |
| 3D Object Super-Resolution | Middlebury - 4x upscaling | PSNR | 28.63 | PASSRnet |
| 3D Object Super-Resolution | KITTI 2012 - 4x upscaling | PSNR | 26.26 | PASSRnet |
| 3D Object Super-Resolution | KITTI 2015 - 2x upscaling | PSNR | 29.78 | PASSRnet |
| 3D Object Super-Resolution | KITTI 2012 - 2x upscaling | PSNR | 30.65 | PASSRnet |
| 3D Object Super-Resolution | Middlebury - 2x upscaling | PSNR | 34.05 | PASSRnet |
| 3D Object Super-Resolution | KITTI 2015 - 4x upscaling | PSNR | 25.43 | PASSRnet |
| 3D Object Super-Resolution | Middlebury - 4x upscaling | PSNR | 28.72 | PASSRnet |
| 3D Object Super-Resolution | Middlebury - 2x upscaling | PSNR | 34.23 | PASSRnet |
| 3D Object Super-Resolution | KITTI2012 - 4x upscaling | PSNR | 26.34 | PASSRnet |
| 3D Object Super-Resolution | Flickr1024 - 2x upscaling | PSNR | 28.38 | PASSRnet |
| 3D Object Super-Resolution | Flickr1024 - 4x upscaling | PSNR | 23.31 | PASSRnet |
| 3D Object Super-Resolution | KITTI2015 - 2x upscaling | PSNR | 30.6 | PASSRnet |
| 3D Object Super-Resolution | KITTI2012 - 2x upscaling | PSNR | 30.81 | PASSRnet |
| 3D Object Super-Resolution | KITTI2015 - 4x upscaling | PSNR | 26.08 | PASSRnet |
| 16k | Middlebury - 4x upscaling | PSNR | 28.63 | PASSRnet |
| 16k | KITTI 2012 - 4x upscaling | PSNR | 26.26 | PASSRnet |
| 16k | KITTI 2015 - 2x upscaling | PSNR | 29.78 | PASSRnet |
| 16k | KITTI 2012 - 2x upscaling | PSNR | 30.65 | PASSRnet |
| 16k | Middlebury - 2x upscaling | PSNR | 34.05 | PASSRnet |
| 16k | KITTI 2015 - 4x upscaling | PSNR | 25.43 | PASSRnet |
| 16k | Middlebury - 4x upscaling | PSNR | 28.72 | PASSRnet |
| 16k | Middlebury - 2x upscaling | PSNR | 34.23 | PASSRnet |
| 16k | KITTI2012 - 4x upscaling | PSNR | 26.34 | PASSRnet |
| 16k | Flickr1024 - 2x upscaling | PSNR | 28.38 | PASSRnet |
| 16k | Flickr1024 - 4x upscaling | PSNR | 23.31 | PASSRnet |
| 16k | KITTI2015 - 2x upscaling | PSNR | 30.6 | PASSRnet |
| 16k | KITTI2012 - 2x upscaling | PSNR | 30.81 | PASSRnet |
| 16k | KITTI2015 - 4x upscaling | PSNR | 26.08 | PASSRnet |