
Rehan Kapadia
· Colleen and Roberto Padovani Early Career Chair in Electrical and Computer Engineering, and Associate Professor of Electrical and Computer EngineeringVerifiedUniversity of Southern California · Ming Hsieh Department of Electrical and Computer Engineering
Active 2009–2026
About
Professor Rehan Kapadia joined the faculty of the University of Southern California in the Ming Hsieh Department of Electrical and Computer Engineering in July 2014. He earned his bachelor's degree in electrical engineering from the University of Texas at Austin in 2008 and completed his Ph.D. in electrical engineering at the University of California, Berkeley in 2013. During his doctoral studies, he was recognized as a National Science Foundation Graduate Research Fellow and received the David J. Sakrison Memorial Prize for outstanding research. Since joining USC, Professor Kapadia has been honored with several prestigious awards, including the Air Force Young Investigator Program award, the Office of Naval Research Young Investigator Program award, and the American Vacuum Society Peter Mark Memorial Award for young scientists. Professor Kapadia's research interests lie at the intersection of material science and electrical engineering, with a focus on non-equilibrium electron devices and materials growth technologies for next-generation electronic and photonic devices. His recent research highlights include the experimental demonstration of hot-electron emission processes in graphene, the demonstration of high quantum efficiency hot-electron driven hydrogen reduction, and the development of low-temperature growth techniques for high-mobility III-V semiconductors on non-epitaxial substrates. Additionally, he serves as the co-director of the Center for Integrated Electronics and Biological Organisms (CIEBOrg) at USC, which reflects his commitment to advancing interdisciplinary research in electronics and biological systems.
Research topics
- Computer Science
- Nanotechnology
- Materials science
- Artificial Intelligence
- Physics
- Optoelectronics
- Neuroscience
- Organic chemistry
- Atomic physics
- Chemical physics
- Engineering
- Optics
- Electrical engineering
- Chemistry
Selected publications
Interpretable Physics Extraction from Data for Linear Dynamical Systems using Lie Generator Networks
arXiv (Cornell University) · 2026-03-28
preprintOpen accessSenior authorWhen the system is linear, why should learning be nonlinear? Linear dynamical systems, the analytical backbone of control theory, signal processing and circuit analysis, have exact closed-form solutions via the state transition matrix. Yet when system parameters must be inferred from data, recent neural approaches offer flexibility at the cost of physical guarantees: Neural ODEs provide flexible trajectory approximation but may violate physical invariants, while energy preserving architectures do not natively represent dissipation essential to real-world systems. We introduce Lie Generator Networks (LGN), which learn a structured generator A and compute trajectories directly via matrix exponentiation. This shift from integration to exponentiation preserves structure by construction. By parameterizing A = S - D (skew-symmetric minus positive diagonal), stability and dissipation emerge from the underlying architecture and are not introduced during training via the loss function. LGN provides a unified framework for linear conservative, dissipative, and time-varying systems. On a 100-dimensional stable RLC ladder, standard derivative-based least-squares system identification can yield unstable eigenvalues. The unconstrained LGN yields stable but physically incorrect spectra, whereas LGN-SD recovers all 100 eigenvalues with over two orders of magnitude lower mean eigenvalue error than unconstrained alternatives. Critically, these eigenvalues reveal poles, natural frequencies, and damping ratios which are interpretable physics that black-box networks do not provide.
The Routing and Filtering Structure of Attention
arXiv (Cornell University) · 2026-05-12
preprintOpen accessSenior authorThe attention interaction matrix $QK^{\top}$ contains two entangled computations: a skew-symmetric component that redistributes information between positions (routing) and a symmetric component that scales mutual relevance (filtering). We decompose 1776 heads across five pretrained transformers and find routing operating at low rank, well below the routing capacity allocated by the weight kernel. We introduce $S$-$D$ attention as a diagnostic parameterization that disentangles routing from filtering by construction with guaranteed stability ($\mathrm{Re}(λ) \le 0$) and trains stably without layer normalization. When disentangled and unnormalized, routing self-organizes into a spectral cascade, effective rank $2$ at the first layer, expanding with depth across six scales from 7M to 355M parameters. The cascade predicts where attention can be simplified: linearizing the first seven layers of 125M $S$-$D$ attention costs ${<}5\%$ perplexity, whereas standard attention collapses under the same intervention. The linearizable region widens with depth. Replacing the first four layers with ELU+1 linear attention reaches within $1.4\%$ of baseline at full head dimension. Cascade-allocated architectures trade attention parameters for perplexity ($47\%-65\%$ fewer attention parameters at $+3.9\%$ to $+8.4\%$ PPL). The routing-filtering decomposition makes the spectral budget legible; the cascade makes it actionable.
Lie Generator Networks Extract EIS-Grade Battery Diagnostics from Pulse Relaxation Data
arXiv (Cornell University) · 2026-05-14
preprintOpen accessSenior authorElectrochemical impedance spectroscopy (EIS) is the most informative diagnostic for lithium-ion batteries: its frequency-resolved spectra decompose cell behavior into distinct electrochemical processes, revealing mechanism-specific degradation invisible to voltage and resistance measurements. Yet EIS requires dedicated hardware and minutes-long acquisitions incompatible with field deployment. Here we show that Lie Generator Networks (LGN), a structure-preserving identification framework, extract electrochemical time constants from 60 seconds of post-pulse voltage relaxation, data that battery management systems already collect, that encode the same diagnostic and prognostic information as impedance spectra. LGN learns the generator matrix of the relaxation dynamics with stability guaranteed by architecture, yielding time constants precise enough to resolve electrochemical variation that conventional curve fitting cannot detect from identical data. Across five datasets totaling over 850 cells, four institutions, and multiple chemistries, LGN tracks degradation with near-perfect rank correlation ($|ρ_s| = 0.999$), enables cross-validated reconstruction of full Nyquist spectra at 2% median error across 227 cells, predicts which capacity-matched cells fail first from three early diagnostics, and recovers Arrhenius activation energies with zero physics priors without retraining or cell-specific tuning. LGN requires no training data, no impedance hardware, and no chemistry-specific calibration, converting any existing relaxation pulse into an impedance-grade diagnostic. This enables real-time health monitoring, rapid second-life grading, production-line quality control, and physics-informed prognosis from minutes of measurement.
Lie Generator Networks for Nonlinear Partial Differential Equations
arXiv (Cornell University) · 2026-03-31
preprintOpen accessSenior authorLinear dynamical systems are fully characterized by their eigenspectra, accessible directly from the generator of the dynamics. For nonlinear systems governed by partial differential equations, no equivalent theory exists. We introduce Lie Generator Network-Koopman (LGN-KM), a neural operator that lifts nonlinear dynamics into a linear latent space and learns the continuous-time Koopman generator ($L_k$) through a decomposition $L_k = S - D_k$, where $S$ is skew-symmetric representing conservative inter-modal coupling, and $D_k$ is a positive-definite diagonal encoding modal dissipation. This architectural decomposition enforces stability and enables interpretability through direct spectral access to the learned dynamics. On two-dimensional Navier--Stokes turbulence, the generator recovers the known dissipation scaling and a complete multi-branch dispersion relation from trajectory data alone with no physics supervision. Independently trained models at different flow regimes recover matched gauge-invariant spectral structure, exposing a gauge freedom in the Koopman lifting. Because the generator is provably stable, it enables guaranteed long-horizon stability, continuous-time evaluation at arbitrary time, and physics-informed cross-viscosity model transfer.
The Routing and Filtering Structure of Attention
ArXiv.org · 2026-05-12
articleOpen accessSenior authorThe attention interaction matrix $QK^{\top}$ contains two entangled computations: a skew-symmetric component that redistributes information between positions (routing) and a symmetric component that scales mutual relevance (filtering). We decompose 1776 heads across five pretrained transformers and find routing operating at low rank, well below the routing capacity allocated by the weight kernel. We introduce $S$-$D$ attention as a diagnostic parameterization that disentangles routing from filtering by construction with guaranteed stability ($\mathrm{Re}(λ) \le 0$) and trains stably without layer normalization. When disentangled and unnormalized, routing self-organizes into a spectral cascade, effective rank $2$ at the first layer, expanding with depth across six scales from 7M to 355M parameters. The cascade predicts where attention can be simplified: linearizing the first seven layers of 125M $S$-$D$ attention costs ${<}5\%$ perplexity, whereas standard attention collapses under the same intervention. The linearizable region widens with depth. Replacing the first four layers with ELU+1 linear attention reaches within $1.4\%$ of baseline at full head dimension. Cascade-allocated architectures trade attention parameters for perplexity ($47\%-65\%$ fewer attention parameters at $+3.9\%$ to $+8.4\%$ PPL). The routing-filtering decomposition makes the spectral budget legible; the cascade makes it actionable.
SSRN Electronic Journal · 2026-01-01
preprintOpen accessLie Generator Networks for Nonlinear Partial Differential Equations
ArXiv.org · 2026-03-31
articleOpen accessSenior authorLinear dynamical systems are fully characterized by their eigenspectra, accessible directly from the generator of the dynamics. For nonlinear systems governed by partial differential equations, no equivalent theory exists. We introduce Lie Generator Network-Koopman (LGN-KM), a neural operator that lifts nonlinear dynamics into a linear latent space and learns the continuous-time Koopman generator ($L_k$) through a decomposition $L_k = S - D_k$, where $S$ is skew-symmetric representing conservative inter-modal coupling, and $D_k$ is a positive-definite diagonal encoding modal dissipation. This architectural decomposition enforces stability and enables interpretability through direct spectral access to the learned dynamics. On two-dimensional Navier--Stokes turbulence, the generator recovers the known dissipation scaling and a complete multi-branch dispersion relation from trajectory data alone with no physics supervision. Independently trained models at different flow regimes recover matched gauge-invariant spectral structure, exposing a gauge freedom in the Koopman lifting. Because the generator is provably stable, it enables guaranteed long-horizon stability, continuous-time evaluation at arbitrary time, and physics-informed cross-viscosity model transfer.
Lie Generator Networks Extract EIS-Grade Battery Diagnostics from Pulse Relaxation Data
ArXiv.org · 2026-05-14
articleOpen accessSenior authorElectrochemical impedance spectroscopy (EIS) is the most informative diagnostic for lithium-ion batteries: its frequency-resolved spectra decompose cell behavior into distinct electrochemical processes, revealing mechanism-specific degradation invisible to voltage and resistance measurements. Yet EIS requires dedicated hardware and minutes-long acquisitions incompatible with field deployment. Here we show that Lie Generator Networks (LGN), a structure-preserving identification framework, extract electrochemical time constants from 60 seconds of post-pulse voltage relaxation, data that battery management systems already collect, that encode the same diagnostic and prognostic information as impedance spectra. LGN learns the generator matrix of the relaxation dynamics with stability guaranteed by architecture, yielding time constants precise enough to resolve electrochemical variation that conventional curve fitting cannot detect from identical data. Across five datasets totaling over 850 cells, four institutions, and multiple chemistries, LGN tracks degradation with near-perfect rank correlation ($|ρ_s| = 0.999$), enables cross-validated reconstruction of full Nyquist spectra at 2% median error across 227 cells, predicts which capacity-matched cells fail first from three early diagnostics, and recovers Arrhenius activation energies with zero physics priors without retraining or cell-specific tuning. LGN requires no training data, no impedance hardware, and no chemistry-specific calibration, converting any existing relaxation pulse into an impedance-grade diagnostic. This enables real-time health monitoring, rapid second-life grading, production-line quality control, and physics-informed prognosis from minutes of measurement.
Interpretable Physics Extraction from Data for Linear Dynamical Systems using Lie Generator Networks
arXiv (Cornell University) · 2026-03-28
articleOpen accessSenior authorWhen the system is linear, why should learning be nonlinear? Linear dynamical systems, the analytical backbone of control theory, signal processing and circuit analysis, have exact closed-form solutions via the state transition matrix. Yet when system parameters must be inferred from data, recent neural approaches offer flexibility at the cost of physical guarantees: Neural ODEs provide flexible trajectory approximation but may violate physical invariants, while energy preserving architectures do not natively represent dissipation essential to real-world systems. We introduce Lie Generator Networks (LGN), which learn a structured generator A and compute trajectories directly via matrix exponentiation. This shift from integration to exponentiation preserves structure by construction. By parameterizing A = S - D (skew-symmetric minus positive diagonal), stability and dissipation emerge from the underlying architecture and are not introduced during training via the loss function. LGN provides a unified framework for linear conservative, dissipative, and time-varying systems. On a 100-dimensional stable RLC ladder, standard derivative-based least-squares system identification can yield unstable eigenvalues. The unconstrained LGN yields stable but physically incorrect spectra, whereas LGN-SD recovers all 100 eigenvalues with over two orders of magnitude lower mean eigenvalue error than unconstrained alternatives. Critically, these eigenvalues reveal poles, natural frequencies, and damping ratios which are interpretable physics that black-box networks do not provide.
Thermally-Enhanced Quantum Efficiency of a Hot Electron Laser Assisted Cathode
2025-04-14
articleSenior authorWe experimentally investigated the impact on electron emission of adding thermal energy to a Hot Electron Laser Assisted Cathode (HELAC). We observed an increase in photo-gated emission with temperature up to a certain point where the photo-gated emission mechanism was destroyed due to insulator breakdown in the device. We observed that this breakdown is imminent after a sudden increase in device current. We hypothesize this results from ohmic heating thermal runaway in the insulator, permanently damaging the device's response to absorbed photons. We aim to more precisely predict and avoid this breakdown, while operating the HELAC in a temperature range where sustainable improvement in quantum efficiency is feasible.
Recent grants
NSF · $350k · 2020–2024
Heterogeneous III-V CMOS on Si via Direct Growth
NSF · $300k · 2016–2021
Frequent coauthors
- 169 shared
Ali Javey
University of California, Berkeley
- 103 shared
Kuniharu Takei
- 60 shared
Maxwell Zheng
- 47 shared
Ragib Ahsan
University of Southern California
- 45 shared
Hyun Uk Chae
University of Southern California
- 41 shared
Debarghya Sarkar
Indian Institute of Astrophysics
- 39 shared
Min Hyung Lee
- 39 shared
Joel W. Ager
Lawrence Berkeley National Laboratory
Labs
Education
- 2005
Ph.D., Electrical Engineering
University of Southern California
- 2001
M.S., Electrical Engineering
University of Southern California
- 1999
B.S., Electrical Engineering
University of Southern California
Awards & honors
- David J. Sakrison Memorial Prize (2013)
- National Science Foundation Graduate Research Fellow (2008-2…
- UC Berkeley Venture Lab Competition Winner (2009)
- R. Earle Wright Endowed Presidential Scholar (2007)
- Distinguished College Scholar (2006, 2007, 2008)
- Resume-aware match score
- Save to shortlist
- AI-drafted outreach
See your match with Rehan Kapadia
PhdFit ranks faculty by your research interests, methods, and publications — grounded in their actual work, not templates.
- Free to start
- No credit card
- 30-second signup