Semantic Segmentation using Foundation Models for Cultural Heritage: an Experimental Study on Notre-Dame de Paris

Kévin Réby; Anaïs Guillem; Livio De Luca

Communication Dans Un Congrès Année : 2023

Semantic Segmentation using Foundation Models for Cultural Heritage: an Experimental Study on Notre-Dame de Paris

(1) , (1) , (1)

Kévin Réby

Fonction : Auteur
PersonId : 1304946
IdHAL : kevin-reby
ORCID : 0009-0000-6823-280X

Modèles et simulations pour l'Architecture et le Patrimoine

Anaïs Guillem

Fonction : Auteur
PersonId : 1259682
IdHAL : anais-guillem
ORCID : 0000-0002-1473-7594

Modèles et simulations pour l'Architecture et le Patrimoine

Livio De Luca

Fonction : Auteur
PersonId : 1164075
IdHAL : livio-de-luca
ORCID : 0000-0003-0656-3165
IdRef : 115945512

Modèles et simulations pour l'Architecture et le Patrimoine

Résumé

The zero-shot performance of foundation models has captured a lot of attention. Specifically, the Segment Anything Model (SAM) has gained popularity in computer vision due to its label-free segmentation capabilities. Our study proposes using SAM on cultural heritage data, specifically images of Notre-Dame de Paris, with a controlled vocabulary. SAM can successfully identify objects within the cathedral. To further improve segmentation, we utilized Grounding DINO to detect objects and CLIP to automatically add labels from the segmentation masks generated by SAM. Our study demonstrates the usefulness of foundation models for zero-shot semantic segmentation of cultural heritage data.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV] Machine Learning [stat.ML]

Fichier principal

ICCV workshop paper.pdf (1.38 Mo)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Ariane Néroulidis : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04275484

Soumis le : mercredi 8 novembre 2023-15:20:42

Dernière modification le : vendredi 5 avril 2024-10:25:46

Dates et versions

hal-04275484 , version 1 (08-11-2023)

Identifiants

HAL Id : hal-04275484 , version 1

Citer

Kévin Réby, Anaïs Guillem, Livio De Luca. Semantic Segmentation using Foundation Models for Cultural Heritage: an Experimental Study on Notre-Dame de Paris. 4th ICCV Workshop on Electronic Cultural Heritage, Computer Vision Foundation, Oct 2023, Paris, France. https://openaccess.thecvf.com/content/ICCV2023W/e-Heritage/html/Reby_Semantic_Segmentation_Using_Foundation_Models_for_Cultural_Heritage_an_Experimental_ICCVW_2023_paper.html. ⟨hal-04275484⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS MAP CHANTIER-SCIENTIFIQUE-NDP UPR-MAP

53 Consultations

30 Téléchargements

Semantic Segmentation using Foundation Models for Cultural Heritage: an Experimental Study on Notre-Dame de Paris

Résumé

Domaines

Dates et versions

Identifiants

Citer

Relations

Exporter

Collections

Partager