Nandan Kumar Jha
Nandan Kumar Jha
Home
Research
Publications
Highlights
Talks
Media
Contact
Privacy-Preserving Machine Learning
AERO: Entropy-Guided Attention for Private LLM Inference
Develops entropy-guided attention and hierarchical entropy regularization for efficient private LLM inference with reduced nonlinearities.
Nandan Kumar Jha
,
Brandon Reagen
Cite
Code
Poster
Video
Earlier arXiv
Press release
Cite
×