Attribute Guidance With Inherent Pseudo-label For Occluded Person Re-identification

2508.04998v1 cs.CV 2025-08-09

Авторы:

Rui Zhi, Zhen Yang, Haiyang Zhang

Резюме на русском

**Резюме** Person re-identification (Re-ID) — задача сопоставления изображений персон из различных камер. Occluded Re-ID специализируется на сценариях, когда часть тела персона достаточно закрыта. Обычное использование предварительно обученных vision-language моделей в таких случаях сталкивается с проблемой фокусировки только на глобальных семантических признаках, что снижает точность распознавания при недостаточной или ограниченной видимости. Мы предлагаем Attribute-Guide ReID (AG-ReID) — новую модель, которая использует существующие предварительно обученные модели для эффективного извлечения тонких атрибутов без дополнительных данных или анотирования. Метод AG-ReID работает в двух этапах: генерирует псевдо-метки для атрибутов, а затем использует двухуровневую стратегию с направлением на взвешенное использование глобальных и тонких признаков. Мы проверили AG-ReID на нескольких популярных Re-ID датасетах, показав ее выигрыш в обработке скрытых частей тела и небольших признаков, с одновременным сохранением высокой эффективности в остальных сценариях.

Abstract

Person re-identification (Re-ID) aims to match person images across different camera views, with occluded Re-ID addressing scenarios where pedestrians are partially visible. While pre-trained vision-language models have shown effectiveness in Re-ID tasks, they face significant challenges in occluded scenarios by focusing on holistic image semantics while neglecting fine-grained attribute information. This limitation becomes particularly evident when dealing with partially occluded pedestrians or when distinguishing between individuals with subtle appearance differences. To address this limitation, we propose Attribute-Guide ReID (AG-ReID), a novel framework that leverages pre-trained models' inherent capabilities to extract fine-grained semantic attributes without additional data or annotations. Our framework operates through a two-stage process: first generating attribute pseudo-labels that capture subtle visual characteristics, then introducing a dual-guidance mechanism that combines holistic and fine-grained attribute information to enhance image feature extraction. Extensive experiments demonstrate that AG-ReID achieves state-of-the-art results on multiple widely-used Re-ID datasets, showing significant improvements in handling occlusions and subtle attribute differences while maintaining competitive performance on standard Re-ID scenarios.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Attribute Guidance With Inherent Pseudo-label For Occluded Person Re-identification

Авторы:

Резюме на русском

Abstract

Ссылки и действия

Связанные статьи

ViRectify: A Challenging Benchmark for Video Reasoning Correction with Multimoda...

PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with P...

ViDiC: Video Difference Captioning

Beyond the Ground Truth: Enhanced Supervision for Image Restoration

TempR1: Improving Temporal Understanding of MLLMs via Temporal-Aware Multi-Task ...

Навигация