Bipartite Graph Variational Auto-Encoder with Fair Latent Representation to Account for Sampling Bias in Ecological Networks

Citizen Science
Ecological network
Graph Neural Network
Hilbert-Schmidt Independence Criterion
Sampling Effect

Translating the fairness framework commonly considered in sociology in order to address sampling bias in ecology using graph embeddings.

Authors
Affiliations

Emre Anakok

Université Paris-Saclay, AgroParisTech, INRAE, UMR MIA Paris-Saclay, 91120, Palaiseau, France.

Pierre Barbillon

Université Paris-Saclay, AgroParisTech, INRAE, UMR MIA Paris-Saclay, 91120, Palaiseau, France.

Colin Fontaine

Centre d’Ecologie et des Sciences de la Conservation, MNHN, CNRS, SU, 43 rue Buffon, 75005 Paris, France

Elisa Thebault

Sorbonne Université, CNRS, IRD, INRAE, Université Paris Est Créteil, Université Paris Cité, Institute of Ecology and Environmental Sciences (iEES-Paris), 75005 Paris, France

Published

July 15, 2024

Abstract

We propose a method to represent bipartite networks using graph embeddings tailored to tackle the challenges of studying ecological networks, such as the ones linking plants and pollinators, where many covariates need to be accounted for, in particular to control for sampling bias. We adapt the variational graph auto-encoder approach to the bipartite case, which enables us to generate embeddings in a latent space where the two sets of nodes are positioned based on their probability of connection. We translate the fairness framework commonly considered in sociology in order to address sampling bias in ecology. By incorporating the Hilbert-Schmidt independence criterion (HSIC) as an additional penalty term in the loss we optimize, we ensure that the structure of the latent space is independent of continuous variables, which are related to the sampling process. Finally, we show how our approach can change our understanding of ecological networks when applied to the Spipoll data set, a citizen science monitoring program of plant-pollinator interactions to which many observers contribute, making it prone to sampling bias.

Link to preprint

GitHub Repository

Citation

BibTeX citation:
@online{anakok2024,
  author = {Anakok, Emre and Barbillon, Pierre and Fontaine, Colin and
    Thebault, Elisa},
  title = {Bipartite {Graph} {Variational} {Auto-Encoder} with {Fair}
    {Latent} {Representation} to {Account} for {Sampling} {Bias} in
    {Ecological} {Networks}},
  date = {2024-07-15},
  url = {https://arxiv.org/abs/2403.02011},
  langid = {en}
}
For attribution, please cite this work as:
Anakok, Emre, Pierre Barbillon, Colin Fontaine, and Elisa Thebault. 2024. “Bipartite Graph Variational Auto-Encoder with Fair Latent Representation to Account for Sampling Bias in Ecological Networks.” July 15, 2024. https://arxiv.org/abs/2403.02011.