The New Fintech That Knows Whether Your Voice Matches Your Face
“We are like a musical instrument,” he says. “Our voice comes through our lungs and throat, and it’s affected by the shape of our face, the mouth cavity, the nasal canals, the lips, all this area. Basically, we train our model to distinguish between all those differences.”
The AI learns, by processing countless examples, which faces match which voice. And which don’t.
“If you do it multiple times, thousands and tens of thousands of times, you can understand what a voice looks like,” says Halido.
“Our goal basically is to sketch a person’s face, but I will be very honest with you. We still need a lot, a lot of data to do so.”
Moving to Commercial Mode
The technology is based on 2019 research at MIT (Massachusetts Institute of Technology) into the correlation between voice and face.
Researchers found that a quick snatch of audio reveals a huge amount of data about a person’s gender, age, ethnicity, skin tone, nose and jaw shape, and more.
Corsound was founded in June 2020, as a subsidiary of Tel Aviv-based AI company Cortica, to develop ways to implement that technology. It currently has 17 staff in Tel Aviv, most of them working in R&D based on over 200 AI patents belonging to its parent company.
Corsound has so far raised $3.5 million, from Canadian venture capital firm Awz Ventures and from the Israel Innovation Authority.
With a few POCs (proofs of concept) ready, according to Halido, the company is now moving from development mode to commercial mode and seeking clients.
“It’s the first time, as far as we know, that there is a technology that can prevent identity theft,” he says.
The threats posed by generative AI are already huge, and they’re getting bigger.
“Banks should do everything to protect customers. And that’s why they need to embrace new technologies to understand the threats that come in with the AI technology.”
Synthetic Voice Alert
Although banks traditionally are quite conservative about embracing technology, he says, bank and finance companies are Corsound’s biggest target market.
The company demonstrated its technology to potential bank customers in February at the Finovate 2024 conference in London, with simulated calls from customers.
In the first, an Australian woman calls to transfer $2,000 to her husband. The voice is a fake, but it fools existing call technology which, according to Corsound, currently has a 90 percent market share.
The same recording is then played to Corsound’s technology and is immediately flagged as “Fraud: Synthetic Voice Alert” because it knows the voice has been digitally manipulated.
Halido then simulates the onboarding process for a new bank customer using his company’s technology.
“My identity is secure because my voice is my password,” he tells the bank’s computer. “Please verify me.” Corsound instantly verifies his voice as genuine – because it is.
He then uploads a “stolen picture” – an image of company CEO Gal Haselkorn – rather than a picture of himself.
The AI immediately rejects his application because it knows that the face he’s provided does not match the voice.
Corsound’s technology will, says Halido, save the finance and banking industry billions of dollars a year in fraudulent transactions.
The voice-to-face technology also has wider applications and could revolutionize law-enforcement investigations, generating facial structures and facial sketches just from voice recordings.
John Jeffay is journalist. This article is published courtesy of Israel21c.