I wrote this article which covers speech synthesis...
# share-your-work
k
I wrote this article which covers speech synthesis ML, with a focus on the most challenging + interesting problem in speech: prosody generation. Understanding this is key to building multi-modal input/output for LLMs
Teaching computers to talk: the prosody problem
https://www.papercup.com/blog/realistic-synthetic-voices If anyone’s interested in the topic hit me up 🤩