Conference spotlight on DeepReDuce, a set of optimizations for selectively reducing ReLU operations to lower the cost of cryptographically secure private inference while preserving model quality.
ICML 2021 spotlight talk on criticality-based ReLU reduction for fast private inference.