Nandan Kumar Jha
Nandan Kumar Jha
Home
Research
Publications
Highlights
Talks
Media
Contact
Private Inference
Characterizing and Optimizing End-to-End Systems for Private Inference
Studies private inference as an end-to-end systems problem, identifying bottlenecks across the full stack and optimizing performance beyond isolated model changes.
Karthik Garimella
,
Zahra Ghodsi
,
Nandan Kumar Jha
,
Siddharth Garg
,
Brandon Reagen
PDF
Cite
Code
Poster
Entropy and Private Language Models
Invited seminar on entropy dynamics and efficient private language-model inference.
Apr 1, 2025 2:00 PM — 3:00 PM
New York University
Slides
Video
DeepReDuce: ReLU Reduction for Fast Private Inference
ICML spotlight talk on criticality-based ReLU reduction for fast private inference.
Jul 1, 2021
Virtual Conference
Slides
Video
Cite
×