Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

G^3: Geolocation via Guidebook Grounding (2211.15521v1)

Published 28 Nov 2022 in cs.CV and cs.CL

Abstract: We demonstrate how language can improve geolocation: the task of predicting the location where an image was taken. Here we study explicit knowledge from human-written guidebooks that describe the salient and class-discriminative visual features humans use for geolocation. We propose the task of Geolocation via Guidebook Grounding that uses a dataset of StreetView images from a diverse set of locations and an associated textual guidebook for GeoGuessr, a popular interactive geolocation game. Our approach predicts a country for each image by attending over the clues automatically extracted from the guidebook. Supervising attention with country-level pseudo labels achieves the best performance. Our approach substantially outperforms a state-of-the-art image-only geolocation method, with an improvement of over 5% in Top-1 accuracy. Our dataset and code can be found at https://github.com/g-luo/geolocation_via_guidebook_grounding.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Grace Luo (11 papers)
  2. Giscard Biamby (8 papers)
  3. Trevor Darrell (324 papers)
  4. Daniel Fried (69 papers)
  5. Anna Rohrbach (53 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.