TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Highway networks

Highway networks

GeneralIntroduced 200024 papers
Source Paper

Description

There is plenty of theoretical and empirical evidence that depth of neural networks is a crucial ingredient for their success. However, network training becomes more difficult with increasing depth and training of very deep networks remains an open problem. In this extended abstract, we introduce a new architecture designed to ease gradient-based training of very deep networks. We refer to networks with this architecture as highway networks, since they allow unimpeded information flow across several layers on "information highways". The architecture is characterized by the use of gating units which learn to regulate the flow of information through a network. Highway networks with hundreds of layers can be trained directly using stochastic gradient descent and with a variety of activation functions, opening up the possibility of studying extremely deep and efficient architectures.

Papers Using This Method

To Stay or to Bypass: Unraveling Mainline Vehicles' Aggregate Strategic Decision-Making at Highway Weaving Ramps2025-05-13Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free2025-05-10Deconstructing Recurrence, Attention, and Gating: Investigating the transferability of Transformers and Gated Recurrent Neural Networks in forecasting of dynamical systems2024-10-03Highway Networks for Improved Surface Reconstruction: The Role of Residuals and Weight Updates2024-07-11Computer vision-based model for detecting turning lane features on Florida's public roadways2024-06-13Deep Learning-Based Vehicle Speed Prediction for Ecological Adaptive Cruise Control in Urban and Highway Scenarios2022-11-30Multi-task recommendation system for scientific papers with high-way networks2022-04-21Transfer Learning with Graph Neural Networks for Short-Term Highway Traffic Forecasting2020-04-17Graph-Partitioning-Based Diffusion Convolutional Recurrent Neural Network for Large-Scale Traffic Forecasting2019-09-24Investigating the effect of residual and highway connections in speech enhancement models2018-10-22Batch-normalized Recurrent Highway Networks2018-09-26Semi-tied Units for Efficient Gating in LSTM and Highway Networks2018-06-18Humor Recognition Using Deep Learning2018-06-01Avoiding degradation in deep feed-forward networks by phasing out skip-connections2018-01-01Language Modeling with Recurrent Highway Hypernetworks2017-12-01Exploiting Nontrivial Connectivity for Automatic Speech Recognition2017-11-28Lattice Recurrent Unit: Improving Convergence and Statistical Efficiency for Sequence Modeling2017-10-06Application of a Hybrid Bi-LSTM-CRF model to the task of Russian Named Entity Recognition2017-09-27Language Modeling with Highway LSTM2017-09-19Early Improving Recurrent Elastic Highway Network2017-08-14