Papers

Topics

Authors

Recent

View all

Assistant

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 89 tok/s

Gemini 2.5 Pro 53 tok/s Pro

GPT-5 Medium 26 tok/s Pro

GPT-5 High 25 tok/s Pro

GPT-4o 93 tok/s Pro

Kimi K2 221 tok/s Pro

GPT OSS 120B 457 tok/s Pro

Claude Sonnet 4 38 tok/s Pro

2000 character limit reached

A filter based approach for inbetweening (1706.03497v1)

Published 12 Jun 2017 in cs.CV and cs.GR

Abstract: We present a filter based approach for inbetweening. We train a convolutional neural network to generate intermediate frames. This network aim to generate smooth animation of line drawings. Our method can process scanned images directly. Our method does not need to compute correspondence of lines and topological changes explicitly. We experiment our method with real animation production data. The results show that our method can generate intermediate frames partially.

Citations (5)

View on Semantic Scholar

Summary

The paper presents a novel CNN architecture that processes rasterized line drawings without needing explicit line correspondences to generate intermediate frames.
It employs a dual-resolution strategy with a weighted mean squared error loss to reduce computational load while preserving image detail.
Experiments on TV animation data demonstrated a fourfold frame increase for similar inputs, highlighting its potential in high frame rate productions.

A Filter-Based Approach for Inbetweening

This paper presents an innovative approach to the animation task known as "inbetweening," which is the process of generating intermediate frames between two given frames to create a smooth transition. The central contribution of this research is the development and application of a convolutional neural network (CNN) that is specifically designed to operate on rasterized line drawing data for generating these intermediate frames.

The proposed method offers several technical advancements over existing methodologies. Notably, it does not require the explicit computation of line correspondence or handle topological changes between frames, which often complicates traditional approaches. This simplification is made possible through the architecture of the CNN and the training strategy adopted. The network accommodates both low-resolution and high-resolution images, effectively reducing computational burdens while maintaining output quality.

Key Methodological Features:

The CNN comprises two distinct networks: a low-resolution network that processes downscaled inputs, and a high-resolution network for the final output, thus balancing computational demand with resolution fidelity.
The loss function employed is a weighted mean squared error, with specific weightings designed to emphasize the fidelity of line regions in the target image, enhancing the effectiveness of the training process.
Extensive data augmentation techniques, including random translations and rotations, are employed to improve the robustness of the network for various frame inputs.

Results and Experiments:

The proposed CNN was evaluated using real-world animation data from television productions. It successfully demonstrated its capability to generate intermediate frames when input frames were sufficiently similar, as evidenced by increased frame counts by approximately fourfold for some sequences. The method was precise enough to be partially effective, although the authors acknowledge its limitations in handling significantly dissimilar input frames comprehensively.

Implications and Future Work:

Despite its contributions, the method's applicability faces certain restrictions. It may not directly reduce the workload of artists in traditional inbetweening tasks, primarily because of its partial success with dissimilar inputs and the current inability to handle complex transformations and artistic nuances. However, it holds potential utility in projects requiring high frame rate animations, such as those with requirements exceeding 24 frames per second or those involving slow-motion effects.

Future research directions suggested by the authors include optimizing data augmentation techniques by incorporating motion information to reduce redundancy and adjusting the CNN channel numbers to balance training progression and output quality better. Incorporating extensive training on a larger variety of cuts for enhanced generalization capabilities is also recommended.

This research underscores a potentially significant shift in computational animation, especially as advancements in CNN architectures continue to evolve, potentially leading to more sophisticated and versatile applications in animation and related fields.