Vision-Language Foundation Models

How to Efficiently Adapt Foundation Models to Remote Sensing for Object Detection?

Vision-Language Models (VLMs) trained on millions of images excel at generalization in various domains. However, a significant …

Clément Barbier, Baptiste Abeloos, Stéphane Herbin