aerial-d
Collection
Generalized Referring Expression Segmentation on Aerial Photos
•
5 items
•
Updated
QLoRA-finetuned Gemma3-12B for generating natural referring expressions in aerial imagery. Distilled from 500 OpenAI o3 samples (~238× cheaper than direct o3 usage).
google/gemma-3-12b-itUsed in the dataset generation pipeline to enhance rule-based expressions:
# Start vLLM server
vllm serve luisml77/gemma-aerial-12b --port 8000
# Run enhancement (in another terminal)
cd datagen
python pipeline/7_vllm_enhance.py
Full pipeline at GitHub.
python gemma3_lora_finetune.py \
--enhanced_data_dir enhanced_annotations_o3_dual \
--output_dir ./gemma-aerial-12b \
--lora_r 64 --lora_alpha 16
@article{marnoto2025aeriald,
title={Generalized Referring Expression Segmentation on Aerial Photos},
author={Marnoto, Luís Pedro Soares},
journal={IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (J-STARS)},
year={2025},
note={Submitted}
}