Reconstructing the Unseen: GRIOT for Attributed Graph Imputation with Optimal Transport

Richard Serrano; Charlotte Laclau; Baptiste Jeudy; Christine Largeron

Pré-Publication, Document De Travail (Working Paper) Année : 2024

Reconstructing the Unseen: GRIOT for Attributed Graph Imputation with Optimal Transport

Reconstruire l'invisible: GRIOT pour l'Imputation de Graphes Attribués par Transport Optimal

(1) , (2, 3) , (1) , (1)

1
2
3

Richard Serrano

Fonction : Auteur

Laboratoire Hubert Curien

Charlotte Laclau

Fonction : Auteur
PersonId : 1043377
IdHAL : charlotte-laclau

Signal, Statistique et Apprentissage

Département Images, Données, Signal

Baptiste Jeudy

Fonction : Auteur

Laboratoire Hubert Curien

Christine Largeron

Fonction : Auteur

Laboratoire Hubert Curien

Résumé

In recent years, there has been a significant surge in machine learning techniques, particularly in the domain of deep learning, tailored for handling attributed graphs. Nevertheless, to work, these methods assume that the attribute values are fully known, which is not realistic in numerous real-world applications. This paper explores the potential of Optimal Transport (OT) to impute missing attribute values on graphs. To proceed, we design a novel multi-view OT loss function that can encompass both node feature data and the underlying topological structure of the graph by utilizing multiple graph representations. We then utilize this novel loss to train efficiently a Graph Convolutional Neural Network (GCN) architecture capable of imputing all missing values over the graph at once. We evaluate the interest of our approach with experiments both on synthetic data and real-world graphs, including different missingness mechanisms and a wide range of missing data. These experiments demonstrate that our method is competitive with the state-of-the-art in all cases and of particular interest on weakly homophilic graphs.

Ces dernières années, les techniques d’apprentissage automatique ont connu un essor considérable, en particulier dans le domaine de l’apprentissage profond, notamment adaptées à la gestion des graphes attribués. Néanmoins, pour fonctionner, ces méthodes supposent que les valeurs des attributs sont entièrement connues, ce qui n’est pas réaliste dans de nombreuses applications du monde réel. Cet article explore le potentiel du transport optimal (TO) pour imputer les valeurs d'attributs manquantes sur les graphes. Pour ce faire, nous concevons une nouvelle fonction de perte basée sur du TO multi-vues qui peut englober à la fois les attributs des nœuds et la structure topologique sous-jacente du graphe à travers plusieurs représentations (vues) de ce dernier. Nous utilisons ensuite cette nouvelle fonction de perte pour former efficacement une architecture de réseau de neurones convolutif en graphes (GCN) capable d'imputer simultanément toutes les valeurs manquantes du graphe. Nous évaluons l'intérêt de notre approche avec des expérimentations à la fois sur des données synthétiques et sur des graphiques du monde réel, incluant différents mécanismes et une grande plage de données manquantes. Ces expériences démontrent que notre méthode est compétitive avec l’état de l’art dans tous les cas et particulièrement intéressante sur les graphes faiblement homophiles.

Mots clés

Attributed Graph Missing Data Imputation Optimal Transport Attributed Graph Missing Data Imputation Optimal Transport

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

ECML_PKDD_Richard.pdf (1.15 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Richard Serrano : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04604650

Soumis le : mardi 11 juin 2024-14:11:33

Dernière modification le : lundi 17 juin 2024-14:44:27

Dates et versions

hal-04604650 , version 1 (11-06-2024)

Licence

Paternité

Identifiants

HAL Id : hal-04604650 , version 1

Citer

Richard Serrano, Charlotte Laclau, Baptiste Jeudy, Christine Largeron. Reconstructing the Unseen: GRIOT for Attributed Graph Imputation with Optimal Transport. 2024. ⟨hal-04604650⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ST-ETIENNE INSTITUT-TELECOM IOGS CNRS LAHC LTCI IDS S2A IP_PARIS UDL LABORATOIRE-HUBERT-CURIEN

0 Consultations

0 Téléchargements

Reconstructing the Unseen: GRIOT for Attributed Graph Imputation with Optimal Transport

Reconstruire l'invisible: GRIOT pour l'Imputation de Graphes Attribués par Transport Optimal

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Partager