TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/NeuriCam: Key-Frame Video Super-Resolution and Colorizatio...

NeuriCam: Key-Frame Video Super-Resolution and Colorization for IoT Cameras

Bandhav Veluri, Collin Pernu, Ali Saffari, Joshua Smith, Michael Taylor, Shyamnath Gollakota

2022-07-25Super-ResolutionVideo Super-ResolutionKey-Frame-based Video Super-Resolution (K = 15)Colorization
PaperPDFCode(official)

Abstract

We present NeuriCam, a novel deep learning-based system to achieve video capture from low-power dual-mode IoT camera systems. Our idea is to design a dual-mode camera system where the first mode is low-power (1.1 mW) but only outputs grey-scale, low resolution, and noisy video and the second mode consumes much higher power (100 mW) but outputs color and higher resolution images. To reduce total energy consumption, we heavily duty cycle the high power mode to output an image only once every second. The data for this camera system is then wirelessly sent to a nearby plugged-in gateway, where we run our real-time neural network decoder to reconstruct a higher-resolution color video. To achieve this, we introduce an attention feature filter mechanism that assigns different weights to different features, based on the correlation between the feature map and the contents of the input frame at each spatial location. We design a wireless hardware prototype using off-the-shelf cameras and address practical issues including packet loss and perspective mismatch. Our evaluations show that our dual-camera approach reduces energy consumption by 7x compared to existing systems. Further, our model achieves an average greyscale PSNR gain of 3.7 dB over prior single and dual-camera video super-resolution methods and 5.6 dB RGB gain over prior color propagation methods. Open-source code: https://github.com/vb000/NeuriCam.

Results

TaskDatasetMetricValueModel
Super-ResolutionUDM10 - 4x upscalingPSNR42.03NeuriCam-net
Super-ResolutionUDM10 - 4x upscalingSSIM0.9814NeuriCam-net
Super-ResolutionVid4 - 4x upscalingPSNR31.36NeuriCam-net
Super-ResolutionVid4 - 4x upscalingSSIM0.933NeuriCam-net
3D Human Pose EstimationUDM10 - 4x upscalingPSNR42.03NeuriCam-net
3D Human Pose EstimationUDM10 - 4x upscalingSSIM0.9814NeuriCam-net
3D Human Pose EstimationVid4 - 4x upscalingPSNR31.36NeuriCam-net
3D Human Pose EstimationVid4 - 4x upscalingSSIM0.933NeuriCam-net
VideoUDM10 - 4x upscalingPSNR42.03NeuriCam-net
VideoUDM10 - 4x upscalingSSIM0.9814NeuriCam-net
VideoVid4 - 4x upscalingPSNR31.36NeuriCam-net
VideoVid4 - 4x upscalingSSIM0.933NeuriCam-net
Pose EstimationUDM10 - 4x upscalingPSNR42.03NeuriCam-net
Pose EstimationUDM10 - 4x upscalingSSIM0.9814NeuriCam-net
Pose EstimationVid4 - 4x upscalingPSNR31.36NeuriCam-net
Pose EstimationVid4 - 4x upscalingSSIM0.933NeuriCam-net
3DUDM10 - 4x upscalingPSNR42.03NeuriCam-net
3DUDM10 - 4x upscalingSSIM0.9814NeuriCam-net
3DVid4 - 4x upscalingPSNR31.36NeuriCam-net
3DVid4 - 4x upscalingSSIM0.933NeuriCam-net
3D Face AnimationUDM10 - 4x upscalingPSNR42.03NeuriCam-net
3D Face AnimationUDM10 - 4x upscalingSSIM0.9814NeuriCam-net
3D Face AnimationVid4 - 4x upscalingPSNR31.36NeuriCam-net
3D Face AnimationVid4 - 4x upscalingSSIM0.933NeuriCam-net
2D Human Pose EstimationUDM10 - 4x upscalingPSNR42.03NeuriCam-net
2D Human Pose EstimationUDM10 - 4x upscalingSSIM0.9814NeuriCam-net
2D Human Pose EstimationVid4 - 4x upscalingPSNR31.36NeuriCam-net
2D Human Pose EstimationVid4 - 4x upscalingSSIM0.933NeuriCam-net
3D Absolute Human Pose EstimationUDM10 - 4x upscalingPSNR42.03NeuriCam-net
3D Absolute Human Pose EstimationUDM10 - 4x upscalingSSIM0.9814NeuriCam-net
3D Absolute Human Pose EstimationVid4 - 4x upscalingPSNR31.36NeuriCam-net
3D Absolute Human Pose EstimationVid4 - 4x upscalingSSIM0.933NeuriCam-net
Video Super-ResolutionUDM10 - 4x upscalingPSNR42.03NeuriCam-net
Video Super-ResolutionUDM10 - 4x upscalingSSIM0.9814NeuriCam-net
Video Super-ResolutionVid4 - 4x upscalingPSNR31.36NeuriCam-net
Video Super-ResolutionVid4 - 4x upscalingSSIM0.933NeuriCam-net
3D Object Super-ResolutionUDM10 - 4x upscalingPSNR42.03NeuriCam-net
3D Object Super-ResolutionUDM10 - 4x upscalingSSIM0.9814NeuriCam-net
3D Object Super-ResolutionVid4 - 4x upscalingPSNR31.36NeuriCam-net
3D Object Super-ResolutionVid4 - 4x upscalingSSIM0.933NeuriCam-net
1 Image, 2*2 StitchiUDM10 - 4x upscalingPSNR42.03NeuriCam-net
1 Image, 2*2 StitchiUDM10 - 4x upscalingSSIM0.9814NeuriCam-net
1 Image, 2*2 StitchiVid4 - 4x upscalingPSNR31.36NeuriCam-net
1 Image, 2*2 StitchiVid4 - 4x upscalingSSIM0.933NeuriCam-net

Related Papers

SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution2025-07-17IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution2025-07-14PanoDiff-SR: Synthesizing Dental Panoramic Radiographs using Diffusion and Super-resolution2025-07-12HNOSeg-XS: Extremely Small Hartley Neural Operator for Efficient and Resolution-Robust 3D Image Segmentation2025-07-104KAgent: Agentic Any Image to 4K Super-Resolution2025-07-09EAMamba: Efficient All-Around Vision State Space Model for Image Restoration2025-06-27Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models2025-06-25MTSIC: Multi-stage Transformer-based GAN for Spectral Infrared Image Colorization2025-06-21