Microsoft Text to Speech Demo

News

Microsoft's VibeVoice uses AI to create 90-minute podcasts with multiple speakers

VibeVoice is a new open-source AI tool that can generate a full 90 minute audio podcast recording with multiple speakers from ...

Slator3d

Microsoft Research Unveils VibeVoice for Long-Form Speech Synthesis

Microsoft’s VibeVoice is an open-source text-to-speech model for podcast-length, multi-speaker audio that captures the ...

7don MSN

Microsoft’s new AI can turn plain text into a full podcast — and it’s freakishly good at it

"VibeVoice is a novel framework designed for generating expressive, long-form, multi-speaker conversational audio, such as ...

New Atlas2y

Microsoft's new VALL-E AI can capture your voice in 3 seconds

Microsoft researchers have presented an impressive new text-to-speech AI model, called Vall-E, which can listen to a voice for just a few seconds, then mimic that voice – including the emotional ...

techtimes2y

Microsoft Reveals Latest Text-To-Speech AI Research, VALL-E

Microsoft has revealed its latest research in text-to-speech AI with VALL-E, as reported by Engadget. VALL-E can simulate someone's voice from only a three-second audio sample.

Engadget2y

Microsoft's VALL-E AI can mimic any voice from a short audio sample

Microsoft has shown off its latest research in text-to-speech AI with a model called VALL-E that can simulate someone's voice from just a 3-second audio sample.

Ars Technica2y

Microsoft’s new AI can simulate anyone’s voice with 3 seconds of ...

Microsoft’s new AI can simulate anyone’s voice with 3 seconds of audio Text-to-speech model can preserve speaker's emotional tone and acoustic environment.

Fudzilla9d

Microsoft builds its own AI models

Copilot gets high-speed voice generation while foundation model debuts on LMArena Software King of the World, Microsoft has trundled out two new in-house AI efforts, one already talking inside Copilot ...

Engadget1y

LinkedIn adds accessibility features with the help of Microsoft’s ...

LinkedIn has added new accessibility options. Features like text-to-speech and real-time translations should make it easier for more users to engage with articles and newsletters.

InfoWorld6mon

Microsoft’s Phi-4-multimodal AI model handles speech, text, and video

The new small language model can help developers build multimodal AI applications for lightweight computing devices, Microsoft says.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results