Projects

Gradient Optimizing Chaogates

Overview:

Chaogates represent a class of reconfigurable logic gates that leverage nonlinear maps for computation. While traditionally difficult to configure due to their high-dimensional parameter space and the non-differentiability of Boolean logic, this work introduces a differentiable formulation that enables gradient-based optimization.

Through this approach, each chaogate functions akin to a small neural network, achieving:

This opens avenues for automating parameter discovery in nonlinear computational devices and integrating chaogates into modern computation frameworks.

Diagram illustrating the process of optimizing Chaogates as though it were a neural network

Chaogate Optimization Schematic

Technical Implementation:

Highlights:

Growing Neural Networks

Current Status:

Overview:

This work explores a novel concept in artificial neural networks: Multi-Layer Perceptrons (MLPs) that dynamically grow their structure during training. Unlike conventional neural networks with fixed architectures, this approach enables adaptable networks that optimize both structure and performance as they learn.

Diagram showing the Controller-Mask method for growing neural networks dynamically

Controller-Mask Algorithm Schematic

Technical Implementation:

Highlights:

Metalearning Neural Activation Functions

Overview:

This project explores an approach where neural networks learn their own activation functions dynamically during training. Instead of relying on fixed or predefined activations, activation functions are represented as neural subnetworks that adaptively optimize 1-1 nonlinear mappings in situ while the main network simultaneously learns the task at hand.

Diagram illustrating metalearning activation functions, where activations are subnetworks that are optimized dynamically during training

Metalearning Activations Schematic

Technical Implementation:

Highlights:

Neural Network Control of Chaos

Current Status:

Overview:

This project takes on the problem of controlling chaotic systems using neural networks. Realizing that full simulation of chaotic systems is computationally expensive and often infeasible, we develop neural network controllers based on neural ordinary differential equations that learn control signals that steer the dynamical systems towards the desired target state. Although these problems are theoretically formulable as optimal control problems within variational calculus the solutions are often intractable. We demonstrate that neural networks can learn to control chaotic systems with high accuracy and efficiency.

Diagram illustrating the use of neural networks for controlling dynamical systems in a model predictive control setting

Neural Network Control Schematic

Technical Implementation:

Highlights:

Other things I’ve worked on

High Energy Physics

Leptons are elementary particles with half integer spins (like electrons) that are crucial in the study of high energy physics. Many times it is important to differentiate between leptons that are produced in the initial collision and those that are produced in the decay of other particles. We worked to develop a reccurent neural network-based model that can identify and isolate leptons from other particles in collision data from the ATLAS detector at CERN. All neural network models were built in Python using PyTorch.

Quantum Computing

I have experience with Qiskit, a quantum computing software development kit developed by IBM. I have also built classical quantum computer simulators in Julia and Python and have worked with quantum algorithms like Grover’s and Shor’s algorithms. I have also built quantum machine learning models like variational eigensolvers.

Condensed Matter Physics

I have worked with ultrafast laser systems to study magnetization dynamics in thin films. I have experience with lithography and thin film deposition techniques along with various characterization techniques like X-ray diffraction and atomic force microscopy.

I have worked on developing an optical second harmonic detection system to study antiferromagnetic thin films.

I have also characterized superconducting RF cavity dirty layers and their effect on cavity performance. To do so effectively, I developed simulations of Bardeen-Cooper-Schrieffer (BCS) theory and London theory to study the effect of dirty layers on superconducting properties, improving data analysis efficiency by 48 times due to reduced human intervention compared to previous methods.

Monte Carlo simulations

I have used classical and quantum Monte Carlo to create atomic scale simulations with the Lennard-Jones potential.