Über den Autor
Dr. Manas A. Pathak received the BTech degree in computer science from Visvesvaraya National Institute of Technology, Nagpur, India, in 2006, and the MS and PhD degrees from the Language Technologies Institute at Carnegie Mellon University (CMU) in 2009 and 2012 respectively. He is currently working as a research scientist at Adchemy, Inc. His research interests include intersection of data privacy, machine learning, speech processing.
Thesis Overview.- Speech Processing Background.- Privacy Background.- Overview of Speaker Verification with Privacy.- Privacy-Preserving Speaker Verification Using Gaussian Mixture Models.- Privacy-Preserving Speaker Verification as String Comparison.- Overview of Speaker Indentification with Privacy.- Privacy-Preserving Speaker Identification Using Gausian Mixture Models.- Privacy-Preserving Speaker Identification as String Comparison.- Overview of Speech Recognition with Privacy.- Privacy-Preserving Isolated-Word Recognition.- Thesis Conclusion.- Future Work.- Differentially Private Gaussian Mixture Models.
This thesis discusses the privacy issues in speech-based applications such as biometric authentication, surveillance, and external speech processing services. Author Manas A. Pathak presents solutions for privacy-preserving speech processing applications such as speaker verification, speaker identification and speech recognition.
The author also introduces some of the tools from cryptography and machine learning and current techniques for improving the efficiency and scalability of the presented solutions. Experiments with prototype implementations of the solutions for execution time and accuracy on standardized speech datasets are also included in the text. Using the framework proposed may now make it possible for a surveillance agency to listen for a known terrorist without being able to hear conversation from non-targeted, innocent civilians.
Nominated as outstanding PhD thesis from Carnegie Mellon University Develops an efficient computational framework, making it possible to create speech processing applications such as voice biometrics, mining and speech recognition that are privacy-preserving Presents a technology solution, which would allow a user to utilize an IVR system without fear that the system could learn undesired information, such as gender or nationality, or be able to record and edit voice