5th Year Thesis Presentation - Trevor Leong July 29, 2024 4:00pm — 5:00pm Location: In Person - Gates Hillman 9115 Speaker: TREVOR LEONG, Master's Student, Computer Science Department, Carnegie Mellon University Improvements to the evaluation of ML models in FHE FHE (Fully Homomorphic encryption) is a class of encryption schemes that enables computation over ciphertexts that correspond to operations on the encrypted plaintexts. This useful property allows clients to outsource computation to servers without revealing what their inputs were. A notable application of FHE is in private Machine Learning as a Service (MLaaS), where clients can submit data to a server-hosted machine learning model and receive processed results, all while preserving data confidentiality.However, the practical implementation of FHE in evaluating machine learning models remains challenging. A significant hurdle is the restricted set of operations permissible under FHE. This is compounded by the fact that each FHE operation incurs a significant performance overhead when compared to its plaintext counterpart. As a result, computation of nonlinear functions like Softmax requires complex polynomial approximations and even basic operations such as Matrix multiplication require a significant amount of time to evaluate.This thesis seeks to mitigate these performance constraints associated with FHE in machine learning model evaluation. Initially, I introduce a modification to the HELR algorithm, enhancing its efficiency when deployed on a hardware accelerator, achieving an 8x reduction in latency. Then, I propose a novel application of a softmax approximation for evaluation in FHE that leads to a 4x reduction in latency. Lastly, I describe a procedure for evaluating the embedding layer on the server without the client learning the model’s embedding matrix at minimal computation overhead.Thesis Committee:Wenting Zheng (Advisor)Lujo BauerIn Person and Zoom Participation. See announcement.