Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Synthetic dataset of ID and Travel Document (2401.01858v1)

Published 3 Jan 2024 in cs.CV

Abstract: This paper presents a new synthetic dataset of ID and travel documents, called SIDTD. The SIDTD dataset is created to help training and evaluating forged ID documents detection systems. Such a dataset has become a necessity as ID documents contain personal information and a public dataset of real documents can not be released. Moreover, forged documents are scarce, compared to legit ones, and the way they are generated varies from one fraudster to another resulting in a class of high intra-variability. In this paper we trained state-of-the-art models on this dataset and we compare them to the performance achieved in larger, but private, datasets. The creation of this dataset will help to document image analysis community to progress in the task of ID document verification.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (14)
  1. Catherine De Bolle and et al. Internet organised crime thread assesment (iocta). EUROPOL, 2020.
  2. MIDV-2020: A comprehensive benchmark dataset for identity document analysis. CoRR, abs/2107.00396, 2021.
  3. Synthetic id card image generation for improving presentation attack detection, 2022.
  4. Information technology - biometric presentation attack detection - part 1: Framework. Technical report.
  5. Smartdoc 2017 video capture: Mobile document acquisition in video mode. In 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), volume 4, pages 11–16. IEEE, 2017.
  6. A style-based generator architecture for generative adversarial networks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 4401–4410, 2019.
  7. Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning, pages 6105–6114.
  8. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
  9. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929, 2020.
  10. TransFG: A transformer architecture for fine-grained recognition. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 852–860, 2022.
  11. Recurrent comparator with attention models to detect counterfeit documents. In 2019 International Conference on Document Analysis and Recognition, ICDAR 2019, Sydney, Australia, September 20-25, 2019, pages 1332–1337. IEEE, 2019.
  12. Deep coattention-based comparator for relative representation learning in person re-identification. IEEE transactions on neural networks and learning systems, 32(2):722–735, 2020.
  13. Attention is all you need. Advances in neural information processing systems, 30, 2017.
  14. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com