Structural Node Embeddings with Homomorphism Counts
https://arxiv.org/abs/2308.15283

* graph homomorphism counts of interest in graph-based machine learning
* capture local structural information, enabling creation of robust structural embeddings
* enriched w. node labels, node weights, & edge weights, offers interpretable representation of graph data
* allows enhanced explainability of ML models

#GraphTheory #GNN #homomorphism #NeuralNetworks #GraphColoring #GraphEmbeddings #EplainableML

Structural Node Embeddings with Homomorphism Counts

Graph homomorphism counts, first explored by Lovász in 1967, have recently garnered interest as a powerful tool in graph-based machine learning. Grohe (PODS 2020) proposed the theoretical foundations for using homomorphism counts in machine learning on graph level as well as node level tasks. By their very nature, these capture local structural information, which enables the creation of robust structural embeddings. While a first approach for graph level tasks has been made by Nguyen and Maehara (ICML 2020), we experimentally show the effectiveness of homomorphism count based node embeddings. Enriched with node labels, node weights, and edge weights, these offer an interpretable representation of graph data, allowing for enhanced explainability of machine learning models. We propose a theoretical framework for isomorphism-invariant homomorphism count based embeddings which lend themselves to a wide variety of downstream tasks. Our approach capitalises on the efficient computability of graph homomorphism counts for bounded treewidth graph classes, rendering it a practical solution for real-world applications. We demonstrate their expressivity through experiments on benchmark datasets. Although our results do not match the accuracy of state-of-the-art neural architectures, they are comparable to other advanced graph learning models. Remarkably, our approach demarcates itself by ensuring explainability for each individual feature. By integrating interpretable machine learning algorithms like SVMs or Random Forests, we establish a seamless, end-to-end explainable pipeline. Our study contributes to the advancement of graph-based techniques that offer both performance and interpretability.

arXiv.org

cont'd

Graph homomorphism: https://en.wikipedia.org/wiki/Graph_homomorphism

* graph theory
* mapping between 2 graphs that respects their structure
* generalize various notions of graph colorings
* allow expression of an imp. class of constraint satisfaction problems, e.g. scheduling; frequency assignment problems
* computational complexity generally prohibitive
* known about special cases solvable in polynomial time

#GraphTheory #GNN #homomorphism #NeuralNetworks #GraphColoring #GraphEmbeddings

Graph homomorphism - Wikipedia

Addendum 1

Graph coloring (above) appears e.g. in eval. relational databases such as PostgreSQL as graph databases (de novo, v. Agensgraph / Apache AGE ...).

Background

Using PostgreSQL as a graph database
https://tech.ingrid.com/sql-as-graph-database

Improving the Speed & Quality of Parallel Graph Coloring
https://dl.acm.org/doi/10.1145/3543545
https://userweb.cs.txstate.edu/~mb92/papers/topc22.pdf

Graph coloring / RDB

SQLGraph: An Efficient Relational-Based Property Graph Store
https://dl.acm.org/doi/pdf/10.1145/2723372.2723732
* e.g. Fig. 2

#GraphTheory #GraphDatabase #PostgreSQL

Using PostgreSQL as a graph database

Who needs a graph database when you already have PostgreSQL