Carnegie Mellon University

CMU Mathematics Undergraduate Research, 2024

surf-sems-2024


In early 2020, as the global coronavirus pandemic brought the world to a screeching halt, many of our students found that their summer programs had been cancelled. Many of these students were very interested in using this suddenly free summer to engage with open ended-projects in mathematics. In response to this demand, the Department of Mathematical Sciences curated a brand new summer program: Summer Experiences in Mathematical Sciences (SEMS). This program is offered in parallel with the traditional SURF program, which provides stipends to students who stay at CMU in the summer months to work on research projects. Find below details of the projects conducted in the summer of 2024 under the auspices of the SURF and SEMS program.

▼ SURF projects

► ImProver
Riyaz Ahuja

Advisors: Jeremy Avigad, Prasad Tetali, and Sean Welleck
Abstract: We present ImProver, a general-purpose agent-based framework and tool for Lean 4 proof optimization that enables templated and nontemplated neural theorem proving on arbitrary, user-defined metrics. Additionally, we define and analyze a set of metrics that are of general interest, including proof length, modularity, and readability, as well as a completion metric that enables nontemplated automated theorem proving. ImProver is powered by error-correction, tactic prediction, and example selection retrieval-augmented generation (RAG), as well as composite 15-sample refinement and best-of-n that promises output correctness. Additionally, we utilize Lean metaprogramming to extract proof state annotation information, as well as proof tree structure.

With this framework, ImProver outperforms SoTA language models (gpt-4o) on proof optimization tasks on undergraduate (MIL), competition (USAMO, IMO), and research-level (Polynomial Freiman-Ruzsa Conjecture) datasets with 57% mean metric improvement, 100% accuracy, and 54% nonzero accuracy -- compared to the baseline 19% improvement, 35.77% accuracy, and 15.33% nonzero accuracy.

ImProver project

► Divergent Sums
Soren Dupont

Advisor: Giovanni Leoni
Abstract: In this project, we investigate instances where a divergent sum is assigned a (finite) value. For instance, this can occur if we switch the order of integration and summation, when the hypothesis of the Lebesgue dominated convergence theorem is not met, or by applying the formula for a geometric series when convergence is not met. We looked at instances where these incorrect steps are used to get results that turn out to be correct, and modify them to be rigorous (not relying on an incorrect assumption that divergent sums can actually be treated as values), typically making use of results from complex analysis and Ramanujan Summation.

We explore the mathematical toolset which equips us to deal with divergent sums, thereby putting the resulting identities on solid foundation.

divergent_sums_leoni.gif

$e^{-x}$ times the Taylor expansion of $\cos(1.5x) (e^{-x}\sum_{n\ge0}\frac{(-1)^{n}(1.5)^{2n}x^{2n}}{(2n)!})$ (green) gives an increasingly better approximations for $e^{-x}\cos(1.5x)$ (red) (i.e., if this processes is taken to infinity, we have that the green and red curves coincide everywhere; we can write $e^{-x}cos(1.5x)$ as an infinite sum for all x), but the area under the green curve diverges rather than approaching the area under the red curve. As a result, when we interchange integration and summation in this case, it changes the behavior of our expression, from convergence to divergence (a convergent integral of the red function to a divergent sum). Nonetheless, when we apply the Geometric series formula, we get the correct value of this integral by using this divergent sum.

► Mathematical Modeling of Micromagnetics
Nelitha Kulasiri, Henry Siegel, Michael Zhou

Advisor: Giovanni Leoni
Abstract: This research project investigates micromagnetics -- a theoretical framework developed by W.F.Brown, to describe the macroscopic behaviour of magnetic materials. A central concept in micromagnetics is the minimization of the magnetic energy within a material.

In this project we looked at the one dimensional variant of the energy equation: $E_{\varepsilon}(f) = \int_0^1 \varepsilon(f')^2 + \frac{1}{\varepsilon}|f^2-1|^{\alpha} dx$ . We separately analyzed the cases where $\alpha=1$, $\alpha=2$, and $1<\alpha<2$, and in all cases we found that the sequence of functions with minimized $E_{\varepsilon}$ was one which satisfies $f' = \frac{1}{\varepsilon}(1-f^2)^{\frac{1}{2}}$ up to an error factor, and explicitly found what the functions gamma converged to. For the $\alpha=1$ and $\alpha=2$ cases we found an explicit sequence.

We analyze the second order gamma of the function $G_\varepsilon(f) = \frac{E_{\varepsilon} - m_0}{h(\varepsilon)}$, where $h(\varepsilon)$ needed to be such that $G_{\varepsilon}$ had a nontrivial limit. For $\alpha=1$ and $\alpha=2$ we explicitly constructed an $h(\varepsilon)$ by finding the derivative of $E_{\varepsilon}$ with respect to $\varepsilon$ and using L'Hospital's theorem.

► Explicit reductions between the Problem of Equivalence in dimension 2 and the universal CBER
Yoojung Christina Kwon

Advisor: Aristotelis Panagiotopoulos
Abstract: The Problem of Equivalence, which dates back to Gauss and Cartan, has been a subject of study for centuries. The problem asks for simple, necessary and sufficient conditions for deciding when two Riemannian manifolds represent the same geometry, albeit in a different coordinate system. Earlier findings suggested that the intrinsic complexity of the Problem of Equivalence for dimensional 2 manifolds is Borel bireducible to the universal CBER.

In this project we confirmed that this is indeed true, by providing explicit reductions between the Bernoulli shift of free group in two generators and the Problem of Equivalence in dimension 2.

the Problem of Equivalence project

► Multivariate Gaussian Mixtures
Ishin Shah

Advisor: Tomasz Tkocz
Abstract: We study extremal-volume planar sections of unit balls in $\ell p$. Minimal-volume sections have been only recently found for the cross-polytope ($p=1$) via a geometric-analytic argument relying on convexity of certain functions arising from the radial function of a natural planar embedding of the sections. We have analysed several low-dimensional examples illustrating difficulties in extending this argument to the other values of $p$, beyond $p=1$. This analysis has enabled us to formulate a plausible conjecture for extremisers of cube slices (the case $p=\infty$).

► Competitively Constructed Planar Graphs
Eric Wang

Advisor: Wes Pegden
Abstract: An intriguing aspect of combinatorial games is the frequency with which the behavior of competitive players under optimum strategies mimics the outcomes expected by chance in random play. Motivated by this connection, we studied combinatorial games producing planar graphs, for which results about uniformly random instances remains a significant theoeretical challenge.

We established conditions under which optimum play in such games could force or prevent the creation of desired structures and substructures.

wes.png

► Representability of Polish groups on reflexive Banach spaces
Francesca Zhehan Yu

Advisor: Aristotelis Panagiotopoulos
Abstract: Given a group of symmetries G one often wants to know whether G can be understood through the ways in which it acts on a Hilbert space. In contrast to locally-compact topological groups, there are several Polish groups which admit no non-trivial (continuous) representation on a Hilbert space. Christensen and Herer provided one of the first examples of such an "exotic" group: the group of all real-valued measurable functions from the interval endowed with a pathological submeasure, endowed with the convergence in (sub)measure topology.

Through this project we explored whether the aforementioned topological group admits any interesting actions on more general classes of Banach spaces. Our preliminary findings suggest that it does not admit any non-trivial action by linear isometries on any reflexive Banach space either.

Representability of Polish groups project

► Determinacy of Infinite Games and Large Cardinals
Rui Zhou

Advisor: Ernest Schimmerling
Abstract: For simplicity, consider a game with two players who take turns making moves. Let us assume that there are no ties, so that one of the players wins each round. In the familiar games people play, each round is finite, but we do not impose this requirement. A game is said to be determined if one of the players has a winning strategy. Many key properties in topology and analysis can be expressed in terms of certain games being determined; Lebesgue measurability is an example.

Using the Axiom of Choice, one easily sees that not all games are determined. It is more interesting and more difficult to prove that the games which arise in practice are determined. Games which are open in a certain topological sense were proved to be determined by Gale and Stewart in 1953; this includes all games in which every round is finite. In 1975, Martin proved that Borel games are determined. Earlier, in 1970, Martin proved the stronger conclusion that analytic games are determined but, for this, he assumed the existence of a measurable cardinal. That was the first in a series of deep discoveries connecting determinacy and large cardinals. Another important milestone is the 1989 theorem of Martin and Steel that, if there are infinitely many Woodin cardinals, then every projective game is determined.

▼ SEMS projects

► Borel Complexity in Topology
Erica Assang, Thomas Kim, Daniel Mai

Advisor: Riley Thornton
Abstract: We can measure the complexity of a mathematical formula by counting alternating existential and universal quantifiers, and The Borel complexity of a set is the minimum quantifier complexity of a formula describing a set using only open predicates and quantifiers over countable sets. For instance the set of equivalence relations on $\mathbb{N}$ has complexity $\Pi^0_1$ since it can defined with only universal quantifiers, while the set of linear orders with a minimum is $\Sigma^0_2$ since it requires an ``exists" followed by a ``for all".

robust distance graphs project

In this project, we analyzed the Borel complexity of several notions in topology.

We studied the set of Stone spaces with a given Cantor-Bendixson rank (a fundamental topological invariant measuring how densely packed a space's points are). We showed that the set of Stone spaces with Cantor--Bendixson rank $\alpha$ has complexity at most $\Pi_{2\alpha+4}$. And, we showed $\Pi_{2\alpha+2}$ is an upper bound on complexity if we restrict to countable spaces. We also prove that for finite $\alpha,$ $\Pi_{2\alpha+2}$ is a lower bound, and thus the Borel complexity of the set of countable spaces of finite Cantor--Bendixson rank is exactly $\Pi_{2\alpha+2}$. This project is ongoing and we expect to be able to extend these results to all $\alpha$ and establish a lower bound for the complexity of ranks for general Stone spaces.

We also analyzed various properties of subshifts-- topological spaces of functions equipped with a translation operation. We investigated topological entropy (a fundamental dynamical invariant measuring the information content in a subshift). And, we determined upper and lower bounds for the complexity of some fundamental subshifts.

► Mathematics of Deep Learning Architectures
Iven Bai, Justin Fung, Vishant Raajkumar, Daniel Wang, Leo Xu

Advisor: Junichi Koganemaru
Abstract: Deep learning is an umbrella term for a popular class of algorithms that have seen major applications in recent years. Aside from its many practical applications in data science, they have also been used as a useful tool for helping researchers prove rigorous mathematical theorems. For example, Physics-informed neural networks (PINNs) (first proposed by Raissi, Perdikaris, and Karniadakis) were recently employed by teams of researchers to identify potential blow-up solutions to fluid equations. In this project our goal is to investigate the mathematics that underly prototypical deep learning algorithms and understand how to apply these algorithms to analyze real-world data.

We show numerically that deep learning algorithms can outperform classical autoregressive moving average models in time series forecasting. We also train a Physics-informed neural network to find approximate solutions to the incompressible free boundary Navier-Stokes equations.

Mathematics of Deep Learning Architectures project

► Anti-Ramsey Numbers for Regular Bipartite Graphs
Justin Blanco, Anna Deng, Joseph Wang

Advisor: Michael Young
Abstract: The anti-Ramsey number $AR(G, H)$ is the maximum value of $k$ such that there exists a $k$-coloring of edges in $G$ that does not contain any rainbow subgraph isomorphic to $H$, i.e. any copy of $H$ in $G$ contains at least 2 edges of the same color.

We explored anti-Ramsey problems in different types of graphs and proved the formula $$AR(B(n, d), nK_2) = d(n-2)+1$$ for $$n < \frac{d-2d^2+d^{2}n}{2-3d+2d^2},$$

where $B(n, d)$ is the $d$-regular bipartite graph with $n$ vertices in each part, and $nK_2$ is a perfect matching with $n$ edges.

 

Anti-Ramsey Numbers project

► One-dimensional Calculus of Variations
Yidai Cen, Brian Park

Advisor: Reinaldo Resende
Abstract: We used the Euler-Lagrange equation to analyze the behavior of one-dimensional minimizers in variational problems, such as the brachistochrone, minimal surfaces of revolution, and the cycloid (see .gif). In certain cases, this approach allowed us to derive the exact formula for the minimizer. However, establishing the existence of minimizers requires introducing Sobolev spaces, specifically $W^{1,2}$. These spaces include functions that, while not a priori continuous, admit a weak notion of derivative. We examined examples of Sobolev functions, along with strong and weak convergence properties, and proofs of theorems related to Sobolev spaces, such as the Poincaré inequality and Sobolev embedding theorems.

We reviewed the proof of the existence of minimizers within $W^{1,2}$ and then carefully studied the proof of the regularity theorem, which asserts that, under suitable conditions, a minimizer $u \in W^{1,2}$ will exhibit classical smoothness properties, i.e., $u \in C^{\infty}$.

One-dimensional Calculus of Variations project

► Accelerating NCE Convergence with Adaptive Normalizing Constant Computation
Sebastian Kumar, Wenyuan Shen

Advisor: Maria Chikina
Abstract: Noise Contrastive Estimation (NCE) is a widely used method for training generative models, typically used as an alternative to Maximum Likelihood Estimation (MLE) when exact computations of probability are hard. NCE trains generative models by discriminating between data and appropriately chosen noise distributions.

Although NCE is statistically consistent, it suffers from slow convergence and high variance when there is small overlap between the noise and data distributions. Both these problems are related to the flatness of the NCE loss landscape.

Accelerating NCE Convergence project

We propose an innovative approach to circumvent slow convergence rates by quick inference of the optimal normalizing constant at every gradient step. This allows the rest of the parameters to have more freedom during NCE optimization. We analyze the Bennett Acceptance Ratio (BAR) for quick computation of the normalizing constant and show improved performance for both methods on convex and non-convex settings. We prove that BAR converges to the right partition function under mild conditions of overlap. Using this we show an improvement in performance of BAR based on NCE over the vanilla baseline on the MNIST dataset.

► On the number of robust distance graphs in a Euclidean space
Lufei Yang, Colin Yip, Madison Zhao

Advisor: Konstantin Tikhomirov
Abstract: We say that a graph $G$ with vertices in a $k$-dimensional Euclidean space is a $t$-robust distance graph if any two vertices $v,w$ are adjacent if and only if the Euclidean distance between $v$ and $w$ is less than $1-t$ and are not adjacent if and only if the Euclidean distance is greater than $1+t$. Robust distance graphs naturally appear in the context of high-dimensional data analysis. In particular, the ''richness'' of the set of robust distance graphs is related to efficiency of dimension reduction for similarity networks. In this project, we estimate the number of isomorphism classes of robust $d$-regular distance graphs in $R^k$ using methods from high-dimensional convex geometry.

On the number of robust distance graphs project

2024 SURF team

Students: Riyaz Ahuja, Soren Dupont, Nelitha Kulasiri, Henry Siegel, Yoojung Christina Kwon, Ishin Shah, Eric Wang, Francesca Zhehan Yu, Michael Zhou, Rui Zhou

Faculty: Jeremy Avigad, Giovanni Leoni, Aristotelis Panagiotopoulos, Wes Pegden, Ernest Schimmerling, Prasad Tetali, Tomasz Tkocz, Sean Welleck

2024 SEMS research team

Students: Erica Assang, Iven Bai, Justin Blanco, Yidai Cen, Anna Deng, Justin Fung, Thomas Kim, Sebastian Kumar, Daniel Mai, Brian Park, Vishant Raajkumar, Wenyuan Shen, Daniel Wang, Joseph Wang, Leo Xu, Lufei Yang, Colin Yip, Madison Zhao

Faculty: Maria Chikina, Junichi Koganemaru, Reinaldo Resende, Riley Thornton, Konstantin Tikhomirov, Michael Young