In Context Learning with Vision Transformers: Case Study (2505.20872v1)

Published 27 May 2025 in cs.CV, cs.AI, and cs.LG

Abstract: Large transformer models have been shown to be capable of performing in-context learning. By using examples in a prompt as well as a query, they are capable of performing tasks such as few-shot, one-shot, or zero-shot learning to output the corresponding answer to this query. One area of interest to us is that these transformer models have been shown to be capable of learning the general class of certain functions, such as linear functions and small 2-layer neural networks, on random data (Garg et al, 2023). We aim to extend this to the image space to analyze their capability to in-context learn more complex functions on the image space, such as convolutional neural networks and other methods.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

In Context Learning with Vision Transformers: Case Study (2505.20872v1)

Summary

Related Papers