Resume-aware faculty matching

Find professors who actually fit you

Upload your resume. Four AI agents analyze your background, rank the faculty who fit, inspect their recent research, and help you draft outreach — grounded in their actual work, not templates.

Free to startNo credit cardCancel anytime
Top matches Balanced preset
Dr. Sarah Chen
Stanford · Interpretability · NLP
91
Dr. Marcus Holloway
MIT · Robotics · RL
84
Dr. Aisha Okonkwo
CMU · Fairness · HCI
82
Nova · Professor Researcher · re-ranking top 20…
Keaton Hamm

Keaton Hamm

Verified

University of Arizona · Software Engineering

Active 2013–2026

h-index11
Citations362
Papers7335 last 5y
Funding
See your match with Keaton Hamm — sign in to PhdFit.Sign in

Research topics

  • Mathematics
  • Combinatorics
  • Computer science
  • Discrete mathematics
  • Pure mathematics

Selected publications

  • Recovering Wasserstein Distance Matrices from Few Measurements

    2026-04-21

    articleOpen accessSenior author

    This paper proposes two algorithms for estimating square Wasserstein distance matrices from a small number of entries. These matrices are used to compute manifold learning embeddings like multidimensional scaling (MDS) or Isomap, but contrary to Euclidean distance matrices, are extremely costly to compute. We analyze matrix completion from upper triangular samples and Nyström completion in which ${\mathcal{O}}\left({d\log \left(d\right)}\right)$ columns of the distance matrices are computed where d is the desired embedding dimension, prove stability of MDS under Nyström completion, and show that it can outperform matrix completion for a fixed budget of sample distances. Finally, we show that classification of the OrganCMNIST dataset from the MedMNIST benchmark is stable on data embedded from the Nyström estimation of the distance matrix even when only 10% of the columns are computed.

  • Empowering Clients: Self-Adaptive Federated Learning for Data Quality Challenges

    2025-07-07 · 1 citations

    article

    Federated Learning (FL) enables collaborative model training across distributed clients, but the global model’s performance often degrades due to variable data quality and reliability at the local level. Previous approaches mitigate this by restricting or excluding contributions from certain clients, leading to wasted computation and communication resources for those disregarded. In this paper, we introduce FedSRC: Federated Learning with Self-Regulating Clients, a novel framework that optimizes resource use while safeguarding client anonymity. With FedSRC, clients autonomously assess their local training’s benefit to the global model using a lightweight checkpoint based on local test loss and a Refined Heterogeneity Index (RHI), deciding their participation in each FL round accordingly. Extensive evaluations across four datasets demonstrate that FedSRC achieves up to 30% savings in communication costs and 55% in computation costs, all while maintaining privacy and enhancing efficiency.

  • Persistent Classification: Understanding Adversarial Attacks by Studying Decision Boundary Dynamics

    Statistical Analysis and Data Mining The ASA Data Science Journal · 2025-01-21 · 2 citations

    articleOpen access

    ABSTRACT There are a number of hypotheses underlying the existence of adversarial examples for classification problems. These include the high‐dimensionality of the data, the high codimension in the ambient space of the data manifolds of interest, and that the structure of machine learning models may encourage classifiers to develop decision boundaries close to data points. This article proposes a new framework for studying adversarial examples that does not depend directly on the distance to the decision boundary. Similarly to the smoothed classifier literature, we define a (natural or adversarial) data point to be ( γ , σ)‐stable if the probability of the same classification is at least for points sampled in a Gaussian neighborhood of the point with a given standard deviation . We focus on studying the differences between persistence metrics along interpolants of natural and adversarial points. We show that adversarial examples have significantly lower persistence than natural examples for large neural networks in the context of the MNIST and ImageNet datasets. We connect this lack of persistence with decision boundary geometry by measuring angles of interpolants with respect to decision boundaries. Finally, we connect this approach with robustness by developing a manifold alignment gradient metric and demonstrating the increase in robustness that can be achieved when training with the addition of this metric.

  • Manifold Learning in Wasserstein Space

    SIAM Journal on Mathematical Analysis · 2025-06-05 · 4 citations

    articleOpen access1st authorCorresponding
  • On Wasserstein distances for affine transformations of random vectors

    Foundations of Data Science · 2024-01-01 · 2 citations

    articleOpen access1st authorCorresponding

    We expound on some known lower bounds of the quadratic Wasserstein distance between random vectors in $ \mathbb{R}^n $ with an emphasis on affine transformations that have been used in manifold learning of data in Wasserstein space. In particular, we give concrete lower bounds for rotated copies of random vectors in $ \mathbb{R}^2 $ by computing the Bures metric between the covariance matrices. We also derive upper bounds for compositions of affine maps which yield a fruitful variety of diffeomorphisms applied to an initial data measure. We apply these bounds to various distributions including those lying on a 1-dimensional manifold in $ \mathbb{R}^2 $ and illustrate the quality of the bounds. Finally, we give a framework for mimicking handwritten digit or alphabet datasets that can be applied in a manifold learning framework.

  • Persistent Classification: A New Approach to Stability of Data and Adversarial Examples

    arXiv (Cornell University) · 2024-04-11

    preprintOpen access

    There are a number of hypotheses underlying the existence of adversarial examples for classification problems. These include the high-dimensionality of the data, high codimension in the ambient space of the data manifolds of interest, and that the structure of machine learning models may encourage classifiers to develop decision boundaries close to data points. This article proposes a new framework for studying adversarial examples that does not depend directly on the distance to the decision boundary. Similarly to the smoothed classifier literature, we define a (natural or adversarial) data point to be $(γ,σ)$-stable if the probability of the same classification is at least $γ$ for points sampled in a Gaussian neighborhood of the point with a given standard deviation $σ$. We focus on studying the differences between persistence metrics along interpolants of natural and adversarial points. We show that adversarial examples have significantly lower persistence than natural examples for large neural networks in the context of the MNIST and ImageNet datasets. We connect this lack of persistence with decision boundary geometry by measuring angles of interpolants with respect to decision boundaries. Finally, we connect this approach with robustness by developing a manifold alignment gradient metric and demonstrating the increase in robustness that can be achieved when training with the addition of this metric.

  • Linearized Wasserstein dimensionality reduction with approximation guarantees

    Applied and Computational Harmonic Analysis · 2024-10-15 · 5 citations

    articleCorresponding
  • Manifold learning in Wasserstein space

    arXiv (Cornell University) · 2023-11-14 · 1 citations

    preprintOpen access1st authorCorresponding

    This paper aims at building the theoretical foundations for manifold learning algorithms in the space of absolutely continuous probability measures $\mathcal{P}_{\mathrm{a.c.}}(Ω)$ with $Ω$ a compact and convex subset of $\mathbb{R}^d$, metrized with the Wasserstein-2 distance $\mathbb{W}$. We begin by introducing a construction of submanifolds $Λ$ in $\mathcal{P}_{\mathrm{a.c.}}(Ω)$ equipped with metric $\mathbb{W}_Λ$, the geodesic restriction of $\mathbb{W}$ to $Λ$. In contrast to other constructions, these submanifolds are not necessarily flat, but still allow for local linearizations in a similar fashion to Riemannian submanifolds of $\mathbb{R}^d$. We then show how the latent manifold structure of $(Λ,\mathbb{W}_Λ)$ can be learned from samples $\{λ_i\}_{i=1}^N$ of $Λ$ and pairwise extrinsic Wasserstein distances $\mathbb{W}$ on $\mathcal{P}_{\mathrm{a.c.}}(Ω)$ only. In particular, we show that the metric space $(Λ,\mathbb{W}_Λ)$ can be asymptotically recovered in the sense of Gromov--Wasserstein from a graph with nodes $\{λ_i\}_{i=1}^N$ and edge weights $W(λ_i,λ_j)$. In addition, we demonstrate how the tangent space at a sample $λ$ can be asymptotically recovered via spectral analysis of a suitable ``covariance operator'' using optimal transport maps from $λ$ to sufficiently close and diverse samples $\{λ_i\}_{i=1}^N$. The paper closes with some explicit constructions of submanifolds $Λ$ and numerical examples on the recovery of tangent spaces through spectral analysis.

  • Multi-Priority Graph Sparsification

    arXiv (Cornell University) · 2023-01-29

    preprintOpen access

    A \emph{sparsification} of a given graph $G$ is a sparser graph (typically a subgraph) which aims to approximate or preserve some property of $G$. Examples of sparsifications include but are not limited to spanning trees, Steiner trees, spanners, emulators, and distance preservers. Each vertex has the same priority in all of these problems. However, real-world graphs typically assign different ``priorities'' or ``levels'' to different vertices, in which higher-priority vertices require higher-quality connectivity between them. Multi-priority variants of the Steiner tree problem have been studied in prior literature but this generalization is much less studied for other sparsification problems. In this paper, we define a generalized multi-priority problem and present a rounding-up approach that can be used for a variety of graph sparsifications. Our analysis provides a systematic way to compute approximate solutions to multi-priority variants of a wide range of graph sparsification problems given access to a single-priority subroutine.

  • Linearized Wasserstein dimensionality reduction with approximation guarantees

    arXiv (Cornell University) · 2023-02-14

    preprintOpen access

    We introduce LOT Wassmap, a computationally feasible algorithm to uncover low-dimensional structures in the Wasserstein space. The algorithm is motivated by the observation that many datasets are naturally interpreted as probability measures rather than points in $\mathbb{R}^n$, and that finding low-dimensional descriptions of such datasets requires manifold learning algorithms in the Wasserstein space. Most available algorithms are based on computing the pairwise Wasserstein distance matrix, which can be computationally challenging for large datasets in high dimensions. Our algorithm leverages approximation schemes such as Sinkhorn distances and linearized optimal transport to speed-up computations, and in particular, avoids computing a pairwise distance matrix. We provide guarantees on the embedding quality under such approximations, including when explicit descriptions of the probability measures are not available and one must deal with finite samples instead. Experiments demonstrate that LOT Wassmap attains correct embeddings and that the quality improves with increased sample size. We also show how LOT Wassmap significantly reduces the computational cost when compared to algorithms that depend on pairwise distance computations.

Frequent coauthors

Education

  • PhD Mathematics, Mathematics

    Texas A&M University

  • BS Chemical Engineering, Chemical Engineering

    Texas A&M University

    2010
  • BS Mathematics (Honors), Mathematics

    Texas A&M University

    2010
  • Resume-aware match score
  • Save to shortlist
  • AI-drafted outreach

See your match with Keaton Hamm

PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.

  • Free to start
  • No credit card
  • 30-second signup