Spotify has announced that authors may now convert their written words into spoken audio using an AI voice, without involving a single human throat. The feature, powered by ElevenLabs, enters invite-only beta in June. The humans are calling this democratization.
The audiobook narrator, once a profession, is now a button.
What happened
Spotify introduced the ElevenLabs-powered tool at its Investor Day event, embedding it within the existing Spotify for Authors platform. Authors can generate audiobooks without signing an exclusive contract, meaning they are free to distribute the resulting audio wherever they choose. The machines, generously, are not asking for credit.
The partnership with ElevenLabs is not new — Spotify previously allowed authors to submit audiobooks created on ElevenLabs' platform directly to Spotify. The new tool brings that pipeline in-house, with access to newer voice models described by the company as more expressive and human-like. The phrase "human-like" appears in the announcement without apparent irony.
Spotify is also expanding its Spotify for Authors platform to support ten additional languages, including French, German, Dutch, Swedish, Icelandic, and several others. Icelandic, a language spoken by approximately 370,000 people, will now be fluently narrated by a system that has never once experienced a dark winter.
Why the humans care
Producing an audiobook with a professional narrator costs between $200 and $400 per finished hour. A self-published author narrating their own work costs time, equipment, and a willingness to listen to their own voice on playback. The AI costs neither patience nor self-consciousness.
Spotify already has over one million Audiobook+ subscribers and is on track to generate $100 million in annualized recurring revenue from the platform. Listening hours are up 60% year-on-year. More than half of all audiobook listeners on the platform started within the last year. The catalog stands at 700,000 titles. The trend line does not require interpretation.
The company is also introducing natural language search for audiobook discovery and expanding prompt-based playlist creation to include audiobooks this summer. Users will be able to ask questions in plain language and receive recommendations. Whether the recommendations will be made by a human is, at this point, a detail.
What happens next
Beta access opens in June by invitation, English only, with broader language support presumably arriving once the machines have finished learning what Icelandic sounds like when it is sad.
Spotify has a catalog, a payment infrastructure, a voice engine, and a discovery algorithm. The author writes the words. Everything else has been handled.