ConGeo: Robust Cross-View Geo-Localization Across Ground View Variations

Mi, Li; Xu, Chang; Castillo-Navarro, Javiera; Montariol, Syrielle; Yang, Wen; Bosselut, Antoine; Tuia, Devis

doi:10.1007/978-3-031-72630-9_13

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 15072))

Included in the following conference series:

European Conference on Computer Vision

54 Accesses
1 Citations

Abstract

Cross-view geo-localization aims at localizing a ground-level query image by matching it to its corresponding geo-referenced aerial view. In real-world scenarios, the task requires accommodating diverse ground images captured by users with varying orientations and reduced field of views (FoVs). However, existing learning pipelines are orientation-specific or FoV-specific, demanding separate model training for different ground view variations. Such models heavily depend on the North-aligned spatial correspondence and predefined FoVs in the training data, compromising their robustness across different settings. To tackle this challenge, we propose ConGeo, a single- and cross-view Contrastive method for Geo-localization: it enhances robustness and consistency in feature representations to improve a model’s invariance to orientation and its resilience to FoV variations, by enforcing proximity between ground view variations of the same location. As a generic learning objective for cross-view geo-localization, when integrated into state-of-the-art pipelines, ConGeo significantly boosts the performance of three base models on four geo-localization benchmarks for diverse ground view variations and outperforms competing methods that train separate models for each ground view variation.

L. Mi and C. Xu—Equal contribution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

¥17,985 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: JPY 3498; Price includes VAT (Japan)

eBook: JPY 8465; Price includes VAT (Japan)

Softcover Book: JPY 10581; Price includes VAT (Japan)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Cross-View Image Geo-Localization with Panorama-BEV Co-retrieval Network

Benchmarking the Robustness of Cross-View Geo-Localization Models

Image-Based Geo-Localization Using Satellite Imagery

Article 10 June 2019

Notes

1.
Thanks to its flexible learning objective, ConGeo can be plugged into other existing geo-localization models. In this section, we use Sample4Geo [4] as an example. In the experiments, we also plug ConGeo into TransGeo [34], and SAIG-D [37].

References

Arularasu, A., Kulkarni, P.P., Nayak, G.K., Shah, M.: Robust image geolocalization. Technical report (2023)
Google Scholar
Bromley, J., Guyon, I., LeCun, Y., Säckinger, E., Shah, R.: Signature verification using a “SIAMESE” time delay neural network. In: Advances in Neural Information Processing Systems, vol. 6 (1993)
Google Scholar
Chen, T., Kornblith, S., Norouzi, M., Hinton, G.: A simple framework for contrastive learning of visual representations. In: International Conference on Machine Learning (2020)
Google Scholar
Deuser, F., Habel, K., Oswald, N.: Sample4Geo: hard negative sampling for cross-view geo-localisation. In: IEEE International Conference on Computer Vision, pp. 16847–16856 (2023)
Google Scholar
Fervers, F., Bullinger, S., Bodensteiner, C., Arens, M., Stiefelhagen, R.: Uncertainty-aware vision-based metric cross-view geolocalization. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 21621–21631 (2023)
Google Scholar
Guo, Y., Choi, M., Li, K., Boussaid, F., Bennamoun, M.: Soft exemplar highlighting for cross-view image-based geo-localization. IEEE Trans. Image Process. 31, 2094–2105 (2022)
Article Google Scholar
He, K., Fan, H., Wu, Y., Xie, S., Girshick, R.: Momentum contrast for unsupervised visual representation learning. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 9729–9738 (2020)
Google Scholar
Hu, S., Feng, M., Nguyen, R.M., Lee, G.H.: CVM-Net: cross-view matching network for image-based ground-to-aerial geo-localization. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 7258–7267 (2018)
Google Scholar
Khosla, P., et al.: Supervised contrastive learning. In: Advances in Neural Information Processing Systems (2020)
Google Scholar
Li, A., Hu, H., Mirowski, P., Farajtabar, M.: Cross-view policy learning for street navigation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 8100–8109 (2019)
Google Scholar
Liang, Z., Jiang, W., Hu, H., Zhu, J.: Learning to contrast the counterfactual samples for robust visual question answering. In: EMNLP, pp. 3285–3292 (2020)
Google Scholar
Lin, T.Y., Belongie, S., Hays, J.: Cross-view image geolocalization. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 891–898 (2013)
Google Scholar
Liu, L., Li, H.: Lending orientation to neural networks for cross-view geo-localization. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 5624–5633 (2019)
Google Scholar
Liu, Z., Mao, H., Wu, C.Y., Feichtenhofer, C., Darrell, T., Xie, S.: A convnet for the 2020s. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 11976–11986 (2022)
Google Scholar
Mu, N., Kirillov, A., Wagner, D., Xie, S.: SLIP: self-supervision meets language-image pre-training. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T. (eds.) ECCV 2022. LNCS, vol. 13686, pp. 529–544. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-19809-0_30
Chapter Google Scholar
Oord, A.v.d., Li, Y., Vinyals, O.: Representation learning with contrastive predictive coding. arXiv preprint arXiv:1807.03748 (2018)
Peyré, G., Cuturi, M., et al.: Computational optimal transport: with applications to data science. Found. Trends Mach. Learn. 11(5–6), 355–607 (2019)
Article Google Scholar
Radford, A., et al.: Learning transferable visual models from natural language supervision. In: International Conference on Machine Learning, pp. 8748–8763 (2021)
Google Scholar
Rodrigues, R., Tani, M.: Global assists local: Effective aerial representations for field of view constrained image geo-localization. In: IEEE Workshops on Applications of Computer Vision, pp. 3871–3879 (2022)
Google Scholar
Shi, Y., Liu, L., Yu, X., Li, H.: Spatial-aware feature aggregation for image based cross-view geo-localization. In: Advances in Neural Information Processing Systems, vol. 32 (2019)
Google Scholar
Shi, Y., Yu, X., Campbell, D., Li, H.: Where am I looking at? Joint location and orientation estimation by cross-view matching. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 4064–4072 (2020)
Google Scholar
Shi, Y., Yu, X., Liu, L., Zhang, T., Li, H.: Optimal feature transport for cross-view image geo-localization. In: AAAI Conference on Artificial Intelligence, vol. 34, pp. 11990–11997 (2020)
Google Scholar
Tian, Y., Sun, C., Poole, B., Krishnan, D., Schmid, C., Isola, P.: What makes for good views for contrastive learning? Adv. Neural. Inf. Process. Syst. 33, 6827–6839 (2020)
Google Scholar
Wang, P., Han, K., Wei, X.S., Zhang, L., Wang, L.: Contrastive learning based hybrid networks for long-tailed image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 943–952 (2021)
Google Scholar
Wang, T., Zheng, Z., Yan, C., Zhang, J., Sun, Y., Zheng, B., Yang, Y.: Each part matters: local patterns facilitate cross-view geo-localization. IEEE Trans. Circuits Syst. Video Technol. 32(2), 867–879 (2021)
Article Google Scholar
Workman, S., Souvenir, R., Jacobs, N.: Wide-area image geolocalization with aerial reference imagery. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3961–3969 (2015)
Google Scholar
Xia, Z., Booij, O., Manfredi, M., Kooij, J.F.: Visual cross-view metric localization with dense uncertainty estimates. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T. (eds.) ECCV 2022. LNCS, vol. 13699, pp. 90–106. Springer, Cham (2022). https://doi.org/10.1007/978-3-031-19842-7_6
Chapter Google Scholar
Xie, E., et al.: DETCO: unsupervised contrastive learning for object detection. In: IEEE International Conference on Computer Vision, pp. 8392–8401 (2021)
Google Scholar
Yang, H., Lu, X., Zhu, Y.: Cross-view geo-localization with layer-to-layer transformer. Adv. Neural. Inf. Process. Syst. 34, 29009–29020 (2021)
Google Scholar
Zamir, A.R., Shah, M.: Accurate image localization based on google maps street view. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 255–268. Springer, Heidelberg (2010). https://doi.org/10.1007/978-3-642-15561-1_19
Chapter Google Scholar
Zhai, M., Bessinger, Z., Workman, S., Jacobs, N.: Predicting ground-level scene layout from aerial imagery. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 867–875 (2017)
Google Scholar
Zhang, X., Li, X., Sultani, W., Zhou, Y., Wshah, S.: Cross-view geo-localization via learning disentangled geometric layout correspondence. In: AAAI Conference on Artificial Intelligence, vol. 37, pp. 3480–3488 (2023)
Google Scholar
Zheng, Z., Wei, Y., Yang, Y.: University-1652: a multi-view multi-source benchmark for drone-based geo-localization. In: ACM International Conference on Multimedia, pp. 1395–1403 (2020)
Google Scholar
Zhu, S., Shah, M., Chen, C.: TransGeo: transformer is all you need for cross-view image geo-localization. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1162–1171 (2022)
Google Scholar
Zhu, S., Yang, T., Chen, C.: Revisiting street-to-aerial view image geo-localization and orientation estimation. In: IEEE Workshops on Applications of Computer Vision, pp. 756–765 (2021)
Google Scholar
c Zhu, S., Yang, T., Chen, C.: VIGOR: cross-view image geo-localization beyond one-to-one retrieval. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3640–3649 (2021)
Google Scholar
Zhu, Y., Yang, H., Lu, Y., Huang, Q.: Simple, effective and general: a new backbone for cross-view image geo-localization. arXiv preprint arXiv:2302.01572 (2023)

Download references

Acknowledgments

We thank the anonymous reviewers for their constructive and thoughtful comments. We thank Haoyuan Li, Zimin Xia, Gencer Sümbül, Silin Gao, Valérie Zermatten, Zeming Chen, Tianqing Fang, Gaston Lenczner, Kuangyi Chen, Robin Zbinden, Giacomo May, Riccardo Ricci, Emanuele Dalsasso, and Sepideh Mamooler for providing helpful feedback on earlier versions of this work. We acknowledge the support from the CSC and EPFL Science Seed Fund and the support in part by the National Natural Science Foundation of China (NSFC) under Grant 62271355. AB gratefully acknowledges the support of the Swiss National Science Foundation (No. 215390), Innosuisse (PFFS-21-29), the EPFL Center for Imaging, Sony Group Corporation, and the Allen Institute for AI.

Author information

Authors and Affiliations

EPFL, Lausanne, Switzerland
Li Mi, Chang Xu, Javiera Castillo-Navarro, Syrielle Montariol, Antoine Bosselut & Devis Tuia
Wuhan University, Wuhan, China
Chang Xu & Wen Yang

Authors

Li Mi
View author publications
You can also search for this author in PubMed Google Scholar
Chang Xu
View author publications
You can also search for this author in PubMed Google Scholar
Javiera Castillo-Navarro
View author publications
You can also search for this author in PubMed Google Scholar
Syrielle Montariol
View author publications
You can also search for this author in PubMed Google Scholar
Wen Yang
View author publications
You can also search for this author in PubMed Google Scholar
Antoine Bosselut
View author publications
You can also search for this author in PubMed Google Scholar
Devis Tuia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chang Xu .

Editor information

Editors and Affiliations

University of Birmingham, Birmingham, UK
Aleš Leonardis
University of Trento, Trento, Italy
Elisa Ricci
Technical University of Darmstadt, Darmstadt, Germany
Stefan Roth
Princeton University, Princeton, NJ, USA
Olga Russakovsky
Czech Technical University in Prague, Prague, Czech Republic
Torsten Sattler
École des Ponts ParisTech, Marne-la-Vallée, France
Gül Varol

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 23414 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mi, L. et al. (2025). ConGeo: Robust Cross-View Geo-Localization Across Ground View Variations. In: Leonardis, A., Ricci, E., Roth, S., Russakovsky, O., Sattler, T., Varol, G. (eds) Computer Vision – ECCV 2024. ECCV 2024. Lecture Notes in Computer Science, vol 15072. Springer, Cham. https://doi.org/10.1007/978-3-031-72630-9_13

Download citation

DOI: https://doi.org/10.1007/978-3-031-72630-9_13
Published: 05 December 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-72629-3
Online ISBN: 978-3-031-72630-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

ConGeo: Robust Cross-View Geo-Localization Across Ground View Variations

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Cross-View Image Geo-Localization with Panorama-BEV Co-retrieval Network

Benchmarking the Robustness of Cross-View Geo-Localization Models

Image-Based Geo-Localization Using Satellite Imagery

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 23414 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

ConGeo: Robust Cross-View Geo-Localization Across Ground View Variations

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Cross-View Image Geo-Localization with Panorama-BEV Co-retrieval Network

Benchmarking the Robustness of Cross-View Geo-Localization Models

Image-Based Geo-Localization Using Satellite Imagery

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 23414 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation