Policymakers may still be catching up to deepfakes, but biometric liveness detection developers are much further ahead, according to a recent online presentation from Pindrop. VP of Research Elie Khoury explains that the company looks for both perceivable and imperceivable.
An example of the former is the struggle some text to speech AI models have with fricatives. The text analysis, acoustic model and vocoders that make up text to speech models also tend to leave telltale signs behind which are not heard by the human ear, but give away their products’ source as generative AI. Khoury and Pindrop VP of Product, Research and Engineering Amit Gupta discussed the advantages of this kind of algorithmic analysis over relying on watermarks.