- PreSeed Now
- Posts
- This startup wants to clone your voice
This startup wants to clone your voice
Eleven Labs can get you to say anything, in any language
PreSeed Now brings you an in-depth profile of a different B2B or deep tech startup every Tuesday and Thursday. Subscribe to get it straight to your inbox.
Hello there,
In today’s edition we’ve got another chunky startup profile for you, and if you’re a paying PreSeed Now member, a bonus startup just for you at the end of the newsletter.
Synthetic voice tech is an interesting field for startups right now. You might have seen the startup that controversially makes call centre staff sound like white Americans. And more positively, there’s Respeecher, which emulated James Earl Jones’ iconic Darth Vader voice for a new Star Wars project.
There’s a lot of potential in voice cloning technology, and London-based Eleven Labs is one startup in the space well worth looking at. Scroll down to read all about them.
I won’t even mention the scary word ‘deepfake’ once (except that time), because I respect you enough as a PreSeed Now reader to separate the technology’s wider potential from its potential misuses.
Our bonus member-only startup, meanwhile, is in the A.I.-enhanced healthtech space. If you’re a member, you’ll find them at the end of the email.
– Martin
Eleven Labs can get you to say anything, in any language
Dubbing video content into other languages is an important business. It opens up new markets for films, TV shows, and online videos.
But there’s a reason many real fans prefer to watch the original with subtitles over a dubbed version. Listen to a dubbed animé film next to the Japanese original and you’ll understand what I mean - a lot of the original’s flavour is lost once well-meaning local actors get involved.
So what if you could clone the voices of the original actors, and make those cloned voices fluent in any language? That’s the offering London-based Eleven Labs has cooked up.
“We want to make content available in any voice and in any language… preserving the same emotions, the same intonation, the same speaker voices… in Spanish or Polish, or any other language that you speak and understand,” explains Eleven Labs co-founder Mati Staniszewski.
Dub be good to me
Eleven Labs sets itself apart from others in the space because of its ability to generate cloned audio in multiple languages from a small sample (a few minutes or even a few seconds) of the original voice.
Staniszewski says the process is similar to traditional dubbing, in which a transcript is manually translated for a voice actor to perform. Eleven Labs performs automatic translation, which can be manually tweaked if any corrections are needed. It’s just that the original audio serves as the ‘voice actor’.
If the original audio emphasises certain words, or conveys a particular emotion in their voice, this will be carried over to the translated version.
The technology works with a few seconds of source audio, but Staniszewski says around five minutes of high-quality audio provides the best results. This recording is then interpolated with data from around 30,000 voices in the startup’s data set to create a version of your voice that can say anything.
Eleven Labs demos show how emulations of celebrities’ voices can be dubbed onto an original. You can hear a few artifacts in the emulated audio, but it’s an impressive effect, and the startup continues to refine the tech.
Of course, A.I. models can’t necessarily get everything right. It’s easy to imagine a word pronounced a certain way in one language meaning something completely different in another when spoken in that way. Staniszewski agrees there are many nuances that the A.I. will need to learn over time, but they already train the technology on audio in multiple languages to pick up on issues like this.
The Eleven Labs story begins 12 years ago, when Staniszewski met his future co-founder Piotr Dabkowski in Poland. They both moved to the UK, with Dabkowski working as a software engineer at companies like Google, while Staniszewski worked on the business side of companies including Palantir.
Staniszewski says the pair often talked of starting a company together, and collaborated on many hackathon projects over the years. Eventually they found their startup opportunity with the sudden increase in video calls in 2020.
“We decided to create a sentiment analysis of how we speak, the pronunciation that we use, how to make our pronunciation better, how to detect the emotions we use… We deployed that over a weekend,” says Staniszewski.
They later realised how they could take this tech further when Dabkowski and his girlfriend, who doesn’t speak English, wanted to watch the Polish-dubbed version of a movie. But Polish versions of movies are often voiced by a single ‘narrator’ (a ‘lektor’) who explains what is happening, rather than a cast of Polish actors performing the script.
The approach saves money, and is surprisingly quite popular in Poland. But it highlights the budgetary pressures around localising the increasing amount of TV, movie, and online video content audiences around the world might want to explore.
Staniszewski and Dabkowski explored other opportunities to use this tech, such as in countries where the local language is mandated to be used in a percentage of media content, but there might not be big budgets available to high quality voice actors. And thus Eleven Labs was formally established in May this year to explore the opportunities.
Turning it up to Eleven
Now a team of five, Eleven Labs launched a private beta that they tested with a group of YouTubers with large subscriber bases. They’ve taken insights from this test, and are refining the tech to make it more precise and scalable. They plan a November launch for an initial version of the voice cloning feature for the public to try.
Upgrade to read the rest of this article, and get access to our full archive of early-stage startup profiles, and our Startup Tracker database: 👇
Subscribe to Premium to read the rest.
Become a paying subscriber of Premium to get access to this post and other subscriber-only content.
Already a paying subscriber? Sign In.
A subscription gets you:
- • Full profiles of early-stage startups every Tuesday & Thursday: go deeper on each startup
- • Access to our acclaimed Startup Tracker database of early-stage UK startups