TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Hierarchical Feature Fusion

Hierarchical Feature Fusion

Computer VisionIntroduced 200064 papers
Source Paper

Description

Hierarchical Feature Fusion (HFF) is a feature fusion method employed in ESP and EESP image model blocks for degridding. In the ESP module, concatenating the outputs of dilated convolutions gives the ESP module a large effective receptive field, but it introduces unwanted checkerboard or gridding artifacts. To address the gridding artifact in ESP, the feature maps obtained using kernels of different dilation rates are hierarchically added before concatenating them (HFF). This solution is simple and effective and does not increase the complexity of the ESP module.

Papers Using This Method

OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning2025-05-31BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing2025-03-17MSV-Mamba: A Multiscale Vision Mamba Network for Echocardiography Segmentation2025-01-13HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection2025-01-10Self-Paced Learning Strategy with Easy Sample Prior Based on Confidence for the Flying Bird Object Detection Model Training2024-12-09HIP: Hierarchical Point Modeling and Pre-training for Visual Information Extraction2024-11-02InstructBioMol: Advancing Biomolecule Understanding and Design Following Human Instructions2024-10-10ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech2024-09-24GraspMamba: A Mamba-based Language-driven Grasp Detection Framework with Hierarchical Feature Learning2024-09-22Coherence influx is indispensable for quantum reservoir computing2024-09-19ESPnet-EZ: Python-only ESPnet for Easy Fine-tuning and Integration2024-09-14The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization2024-07-23Modality-Order Matters! A Novel Hierarchical Feature Fusion Method for CoSAm: A Code-Switched Autism Corpus2024-07-19Advanced Multimodal Deep Learning Architecture for Image-Text Matching2024-06-13ESP: Extro-Spective Prediction for Long-term Behavior Reasoning in Emergency Scenarios2024-05-07A cost minimization approach to fix the vocabulary size in a tokenizer for an End-to-End ASR system2024-04-29LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence Parallelism2024-04-15Towards Efficient Resume Understanding: A Multi-Granularity Multi-Modal Pre-Training Approach2024-04-13Extending echo state property for quantum reservoir computing2024-03-05Synthesizing Environment-Specific People in Photographs2023-12-22