🎯 Working at Netflix as Research Scientist.
🎯 I got my PhD @ KTH Royal Insitute of Technology, Stockholm, Sweden supervised by Gustav Eje Henter and Jonas Beskow -> My Thesis
🎯 Fall 2024, I was in Menlo Park, CA as a Research Scientist Intern at Meta.
🎯 Summer 2024 I was in Seattle, WA as a Research Intern at Microsoft Research.
🔭 I work on Speech synthesis with probabilistic generative models
💬 Ask me about: Python, Deep Learning, Machine Learning and Generative modelling
📫 Reach me or read my blog at: https://shivammehta25.github.io/
💬 Open for collaborations and interesting projects!
My recent works:
⚡ MAGI: Multimodal Audio and Gesture, Integrated: https://shivammehta25.github.io/MAGI/
⚡ 🍵 Matcha-TTS: https://shivammehta25.github.io/Matcha-TTS/
⚡ Unified speech and gesture synthesis using flow matching: https://shivammehta25.github.io/Match-TTSG/
⚡ Diff-TTSG: https://shivammehta25.github.io/Diff-TTSG/
⚡ OverFlow: https://shivammehta25.github.io/OverFlow
⚡ Neural HMM TTS: https://shivammehta25.github.io/Neural-HMM





