Video generalized semantic segmentation via Non-Salient Feature Reasoning and Consistency

Yuhang Zhang; Zhengyu Zhang; Muxin Liao; Shishun Tian; Rong You; Wenbin Zou; Chen Xu

doi:10.1016/j.knosys.2024.111584

Article Dans Une Revue Knowledge-Based Systems Année : 2024

Video generalized semantic segmentation via Non-Salient Feature Reasoning and Consistency

(1) , (2) , (1) , (1) , (1) , (1) , (1)

1
2

Yuhang Zhang

Fonction : Auteur
PersonId : 1315374
ORCID : 0000-0003-3825-0796

Shenzhen University [Shenzhen]

Zhengyu Zhang

Fonction : Auteur
PersonId : 1364106
ORCID : 0000-0001-8066-4226

Institut d'Électronique et des Technologies du numéRique

Muxin Liao

Fonction : Auteur
PersonId : 1294445
ORCID : 0000-0002-8461-1946

Shenzhen University [Shenzhen]

Shishun Tian

Fonction : Auteur
PersonId : 1294407
ORCID : 0000-0002-7616-8382

Shenzhen University [Shenzhen]

Rong You

Fonction : Auteur

Shenzhen University [Shenzhen]

Wenbin Zou

Fonction : Auteur correspondant
PersonId : 1294408
ORCID : 0000-0003-1389-9089

Shenzhen University [Shenzhen]

Chen Xu

Fonction : Auteur
PersonId : 1066142
ORCID : 0000-0001-5041-0532

Shenzhen University [Shenzhen]

Résumé

Video semantic segmentation is beneficial for dynamic scene processing in real-world environments, and achieves superior performance on independent and identically distributed data. However, it suffers from performance degradation in environments with various domain styles, which is known as the distribution shift problem. Although some previous studies on image generalized semantic segmentation considered the distribution shift problem, temporal-frame information could not be used to obtain more accurate prediction. Thus, in this study, we explore a new task, known as the video generalized semantic segmentation (VGSS) task, which establishes a connection between continuous frames and domain generalization. We propose a novel method named Non-Salient Feature Reasoning and Consistency (NSFRC) for this task. Specifically, we first define the class-wise non-salient feature, which describes the features of the class-wise non-salient region that carry more generalized information. We then propose a class-wise non-salient feature reasoning strategy to select and enhance generalized channels adaptively. This strategy adopts a new form to use domain-invariant features by treating the domain-invariant features as prior information to assist domain-invariant model learning. Finally, we propose a non-salient centroid alignment loss to alleviate the temporally inconsistent and negative transfer problems in the VGSS task. We also extend our video-based framework to the image generalized semantic segmentation (IGSS) task. Experiments demonstrate that our NSFRC framework yields significant improvements in both the VGSS and IGSS tasks. To explain the idea of this research in a clear and attractive way, we provide the visual abstract shown in Fig. 1.

Mots clés

Semantic segmentation Video domain generalization Non-salient region Class-wise relationship reasoning Domain-invariant feature

Domaines

Sciences de l'ingénieur [physics]

Fichier sous embargo

0	―	4	―	2
Année		Mois		Jours

Avant la publication
jeudi 29 août 2024

Laurent Jonchère : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04506025

Soumis le : jeudi 28 mars 2024-15:13:46

Dernière modification le : jeudi 28 mars 2024-15:13:46

Dates et versions

hal-04506025 , version 1 (28-03-2024)

Licence

Paternité - Pas d'utilisation commerciale

Identifiants

HAL Id : hal-04506025 , version 1
DOI : 10.1016/j.knosys.2024.111584

Citer

Yuhang Zhang, Zhengyu Zhang, Muxin Liao, Shishun Tian, Rong You, et al.. Video generalized semantic segmentation via Non-Salient Feature Reasoning and Consistency. Knowledge-Based Systems, 2024, Knowledge-Based Systems, 292, pp.111584. ⟨10.1016/j.knosys.2024.111584⟩. ⟨hal-04506025⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INSA-RENNES IETR CENTRALESUPELEC UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM NANTES-UNIVERSITE

0 Consultations

0 Téléchargements

Video generalized semantic segmentation via Non-Salient Feature Reasoning and Consistency

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager