research | Matteo Biagetti

Geometry and Topology of Large Language Models

We study the internal representations of transformer models using tools from geometry and topological data analysis. By treating token embeddings as evolving point clouds across layers, we characterize how models organize and process information. This geometric perspective reveals interpretable structure in the latent space and provides practical tools for model interpretability and safety. Applications include pre-output monitoring, layer pruning, and understanding uncertainty propagation in neural networks.

People: Yuri Gardinazzi and Karthik Viswanathan
Papers: Persistent Topological Features in Large Language Models, The Geometry of Tokens in Internal Representations of Large Language Models

Topological Analysis of Neural Dynamics

We develop methods to characterize temporal evolution in neural population recordings using persistent homology. By tracking topological features (loops, connected components) in time-varying activity patterns, we extract compact representations of spatiotemporal dynamics. This framework applies to several types of high-dimensional time series from neuroscience experiments.

People: Yuri Gardinazzi and Ana Fló

Learning Optimal Topological Summaries

Applying topological data analysis requires choosing filtrations and vectorization schemes. We introduce methods to learn these components by maximizing relevant information about the system. This enables automated, task-specific optimization of the topological pipeline for scientific inference problems. The framework is implemented in an open-source library and applies to point clouds, fields, and time series.

People: Karthik Viswanathan

Bayesian Inference for Spectroscopy

We develop Bayesian methods for feature extraction from X-ray absorption and other spectroscopic data. By fitting physically motivated forward models with full uncertainty quantification, we enable robust analysis of operando measurements where traditional baseline subtraction fails. The approach combines objective model selection via information criteria with MCMC posterior sampling to provide interpretable, statistically rigorous spectral decomposition.