A Vite + ReactJS application for creating, editing, and sharing AI-powered podcasts and avatars.
PodVerse is an all-in-one podcasting toolkit built with Vite and React. It lets users convert text into podcast episodes, clone voices, remove background noise from audio, and generate lifelike talking avatars. Designed for content creators, educators, and hobbyists, PodVerse streamlines every step of podcast production and AI avatar creation in a single intuitive interface.
PodVerse addresses the growing need for accessible audio content creation tools by integrating four core features—Text-to-Podcast, Voice Cloning, Audio Background Remover, and AI Avatars—into one cohesive web application. Built on a modern stack (Vite, React, Node.js), it offers lightning-fast performance, a sleek UI, and end-to-end workflows from script to final audio or video avatar.
- Barrier to entry: Traditional podcasting requires recording equipment, editing software, and a production pipeline that can intimidate newcomers.
- Resource constraints: Small teams or solo creators often lack time, budget, or expertise to hire audio engineers or voice actors.
- Fragmented toolchain: Users must jump between multiple services (text-to-speech, noise-reduction, voice-cloning, avatar generation), each with its own integration challenges.
PodVerse unifies all steps of modern podcast and avatar production in one platform:
- Text-to-Podcast: Convert any text or PDF into a downloadable podcast episode in seconds.
- Voice Cloning Studio: Upload or record a sample voice (or pick from celebrity samples) and synthesize speech that matches.
- Audio BG Remover: Clean up noisy recordings with a single click, yielding studio-quality audio.
- AI Avatars: Turn a photo into a talking digital avatar, synced to typed script or uploaded audio.
By centralizing these features, PodVerse minimizes tool-hopping, accelerates iteration, and empowers anyone to produce professional-grade audio and video content.
- Chats Sidebar:
- “New Chat” button (first message becomes chat title).
- Context menu to Share or Delete past chats.
- Conversation View:
- Header shows chat title.
- Audio player with play/pause, ±10s skips, progress bar, timestamps, share & download.
- Copy-text icon to quickly grab your prompt.
- Input Controls:
- Text input with “Type your message here…”
- “Inspire” button for 4–5 random prompts.
- “Upload File” (PDF/text).
- Language selector (EN, HI, MR, GU, ES, FR, DE, IT, JP, ZH, RU, PT, KO, TR).
- “Select Duration” (30 s, 1 min, 2 min, 3 min).
- “Select Speaker 1/2” dropdowns (2–6 voices per language) with profile pictures, “Play Sample” buttons.
- “Send” to generate podcast—automatically saved in Chats.
- Tabs:
- Upload Voice: Upload WAV/MP3 or record directly in-browser.
- Sample Voices: Pre-cloned celebrities (e.g., Shahrukh Khan, Elon Musk).
- Workflow:
- Choose or upload a voice sample.
- Enter script text.
- Click Generate Voice.
- Download or play the cloned-voice audio.
- Upload/Record: Drag & drop or in-app recording.
- One-Click Cleanup: AI-powered noise reduction.
- Output: Studio-quality WAV download.
- Photo Upload: JPEG/PNG; auto-detects gender.
- Input Modes:
- Text → Speech: Type up to 500 characters.
- Audio → Lip-Sync: Upload WAV/MP3/M4A for exact tone mimicry.
- Generate Avatar:
- Outputs a video-format avatar in a built-in player.
- Saved automatically in sidebar for sharing.
📁 mypodcast
│ ├── 📁 .git
│ ├── 📁 api
│ ├── 📁 backend
│ ├── 📁 local_images
│ ├── 📁 node_modules
│ ├── 📁 public
│ ├── 📁 results
│ ├── 📁 src
│ ├── 📁 uploads
│ ├── 📄 .gitignore
│ ├── 📄 eslint.config.js
│ ├── 📄 index.html
│ ├── 📄 package.json
│ ├── 📄 package-lock.json
│ ├── 📄 PodVerse.png
│ ├── 📄 README.md
│ ├── 📄 tailwind.config.js
│ └── 📄 vite.config.js
📁 uploads
📄 .hintrc
Note: We include
.venv/in version control to ensurerequirements.txtis always available for reproducible backend builds.
- Independent Podcasters: Rapid prototyping of episodes without recording gear.
- Educational Content: Generate narration and avatars for e-learning modules.
- Marketing & AGencies: Quickly produce brand-voice podcasts and animated spokes-avatars.
- Accessibility: Text-based content can be turned into spoken form in multiple languages.
- Project Code License: All source code and assets of PodVerse are released under the MIT License, granting full reuse, modification, and distribution rights with attribution.
- Voice Samples License: Pre-cloned celebrity voices (e.g., Shahrukh Khan, Elon Musk) are provided under the Creative Commons Attribution 4.0 International (CC BY 4.0) license. Users must give appropriate credit when using or redistributing these voice models.
- User-Provided Content:
- Text & PDFs: Ensure you hold the necessary copyrights or permissions for any uploaded text.
- Audio Files: You must own or have rights to any recorded or uploaded audio.
- Images: Uploaded photos for avatar generation should be your own or licensed appropriately.
- Privacy & Data Handling: User uploads are processed securely and are not stored beyond the session unless explicitly saved.
- Jash Thakkar (@JashT14)
- Neel Shah
- Ganesh Palav
Based on early-access feedback, upcoming features include:
- Favorite Voices: Mark and quickly access your top voices.
- Emotional Expressions: Add natural emotions (happy, sad, excited) to cloned voices.
- Multi-Avatar Podcasting: Host real-time conversations among multiple AI avatars.
- Podcast Creation with Cloned Voices: Script multi-voice episodes using cloned personas.
- Expanded Language & Speaker Library: Support for more languages and regional accents.
- Multilingual Voice Cloning: Clone voices outside EN-US (e.g., Hindi, Spanish).
License: MIT













