Speech Recognition - Deploy It Using Gradio

This project implements a speech recognition system using OpenAI's Whisper model and a dataset from Hugging Face. The model is deployed using Gradio, providing an interactive web-based interface for real-time speech-to-text conversion.

Introduction

This project leverages Whisper, a state-of-the-art speech recognition model, to transcribe audio into text. The model is fine-tuned using a dataset from Hugging Face and is deployed with Gradio, allowing users to easily test and interact with the system.

Features

✅ Uses Whisper model for high-accuracy speech recognition
✅ Pretrained dataset from Hugging Face
✅ Real-time transcription with Gradio web interface
✅ Supports multiple audio formats (WAV, MP3, etc.)
✅ Easy deployment and integration

Dataset & Model

Model Used: OpenAI Whisper
Dataset Source: Hugging Face Speech Datasets
Training: Fine-tuned on diverse speech samples to enhance accuracy

Installation & Setup

1. Clone the Repository

git clone https://github.com/your-username/Speech-Recognition-Deploy-It-Using-Gradio.git
cd Speech-Recognition-Deploy-It-Using-Gradio

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
model-inference.ipynb		model-inference.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Recognition - Deploy It Using Gradio

Introduction

Features

Dataset & Model

Installation & Setup

1. Clone the Repository

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Speech Recognition - Deploy It Using Gradio

Introduction

Features

Dataset & Model

Installation & Setup

1. Clone the Repository

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages