Feedback Networks (1612.09508v3)

Published 30 Dec 2016 in cs.CV

Abstract: Currently, the most successful learning models in computer vision are based on learning successive representations followed by a decision layer. This is usually actualized through feedforward multilayer neural networks, e.g. ConvNets, where each layer forms one of such successive representations. However, an alternative that can achieve the same goal is a feedback based approach in which the representation is formed in an iterative manner based on a feedback received from previous iteration's output. We establish that a feedback based approach has several fundamental advantages over feedforward: it enables making early predictions at the query time, its output naturally conforms to a hierarchical structure in the label space (e.g. a taxonomy), and it provides a new basis for Curriculum Learning. We observe that feedback networks develop a considerably different representation compared to feedforward counterparts, in line with the aforementioned advantages. We put forth a general feedback based learning architecture with the endpoint results on par or better than existing feedforward networks with the addition of the above advantages. We also investigate several mechanisms in feedback architectures (e.g. skip connections in time) and design choices (e.g. feedback length). We hope this study offers new perspectives in quest for more natural and practical learning models.

Citations (209)

View on Semantic Scholar

Summary

The paper introduces a feedback-based learning architecture that iteratively refines predictions to deliver early outputs and conform to hierarchical taxonomies.
It employs a convolutional LSTM framework to generate coarse-to-fine representations and enable episodic curriculum learning within a single inference cycle.
Empirical evaluations on datasets like CIFAR-100, Stanford Cars, and MPII Human Pose demonstrate competitive results and notable improvements in joint localization accuracy.

Feedback Networks: A Comprehensive Examination

The paper "Feedback Networks" investigates the potential advantages of replacing the traditional feedforward structure prevalent in convolutional neural networks (ConvNets) with a feedback-based approach. The authors identify several intrinsic benefits of incorporating feedback mechanisms into neural networks, including the ability to make early predictions, compliance with taxonomies, and improved curriculum learning capabilities.

Core Contributions and Insights

The authors propose a generalized feedback-based learning architecture instantiated through existing recurrent neural networks (RNNs). The primary contributions can be summarized as follows:

Feedback Mechanisms: Unlike feedforward networks, feedback networks predict outcomes iteratively. This results in several advantageous properties:
- Early Predictions: The ability to provide estimations of outputs at fractions of the total inference time proves crucial in real-time applications such as robotics and autonomous driving.
- Taxonomy Compliance: Feedback networks naturally produce predictions conforming to a hierarchical structure in label spaces. This characteristic is attributed to their iterative learning approach, which first determines coarse categories before refining to more granular classes.
- Episodic Curriculum Learning: Traditional curriculum learning relies on progressively harder training samples. Feedback networks instead allow for curriculum learning through a series of predictions within a single inference, gradually enhancing prediction complexity per iteration.
Representation Formation: Feedback networks inherently form representations differently compared to feedforward models. The iterative nature results in a coarse-to-fine representation scheme, as opposed to the typical feature evolution from low-level to high-level in feedforward networks.
Architecture and Implementation: The feedback network is realized by using a convolutional LSTM (Long Short-Term Memory) framework. A detailed account of module lengths for feedback loops and the impact of skip connections in time is provided. Empirical results suggest that effective feedback networks balance feedback loop lengths to optimize performance.

Empirical Evaluation

The proposed architecture was evaluated using several datasets, prominently CIFAR-100, Stanford Cars, and MPII Human Pose. The feedback networks consistently demonstrated competitive final results, matching or outperforming contemporary feedforward networks despite introducing early prediction capabilities. The advantages were both practical, in terms of explicit metric improvements, and conceptual, providing a robust alternative to existing paradigms.

Early Prediction: Feedback networks achieved significant accuracy at intermediate stages of computation, setting them apart from feedforward models that suffer from poorer representation at earlier layers.
Taxonomic Prediction: Experiments confirmed that feedback networks produce more hierarchy-compliant outputs. This was visually evidenced by a novel timed-tSNE visualization method that highlighted how representations change across iterations.
Human Pose Estimation: Further validation was performed on MPII Human Pose benchmark, employing feedback mechanisms over a state-of-the-art Hourglass network. This approach achieved notable improvements in joint localization accuracy, showcasing the flexibility of feedback architectures in regression tasks.

Implications and Future Directions

This paper signifies the practicality of feedback in enhancing neural network architectures by incorporating an iterative prediction mechanism. The findings suggest promising avenues for future research, particularly in applications where early prediction is critical or hierarchical classification structures are beneficial. The development of hardware and software optimized for such feedback operations could further accentuate their advantages.

Future work could extend this research by exploring the integration of additional neural network design elements, such as stochastic depth and ensemble methods, to widen the applicability of feedback networks across diverse domains in machine learning and artificial intelligence. Such advancements would not only theoretically refine feedback networks but also substantially impact their adoption in real-world systems.

Overall, this examination affirms the merit of feedback mechanisms as a competitive and potent paradigm in the landscape of neural network design, paralleling or potentially surpassing the effectiveness of established feedforward models.

PDF Markdown