CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Project Overview

VoiceLinkVR Server is a Python-based voice transcription and translation service that provides:

Speech recognition using Whisper and SenseVoice
Multi-language translation (LibreTranslate + online services)
User management and authentication (JWT-based)
Rate limiting and request logging
Web management interface

Architecture: Modern FastAPI application (migrated from Flask) with async support, dependency injection, and automatic API documentation.

Development Commands

Local Development

# Install dependencies
pip install -r src/requirements.txt

# Run development server with hot reload
python run.py

# Run with custom workers (production mode)
UVICORN_WORKERS=4 python run.py

# Run with specific reload setting
UVICORN_RELOAD=true python run.py

Testing

# Test translation functionality
python test_translate.py

Docker Deployment

# CPU-only deployment
docker-compose -f docker-compose-cpu.yml up -d

# CUDA deployment (requires CUDA 12.2+)
docker-compose -f docker-compose-cuda.yml up -d

# China mirror deployments
docker-compose -f docker-compose-cpu-cn.yml up -d
docker-compose -f docker-compose-cuda-cn.yml up -d

Production Server

# Direct uvicorn command
uvicorn src.main:app --host 0.0.0.0 --port 8980 --workers 4

Architecture & Code Structure

Core Architecture Pattern

Framework: FastAPI with async/await support
Database: SQLAlchemy ORM with SQLite/MySQL support
Authentication: JWT tokens using python-jose
Password Hashing: Werkzeug (bcrypt)
Rate Limiting: slowapi with Redis backend
Template Engine: Jinja2
Server: Uvicorn ASGI server

Directory Structure

src/
├── core/           # Core functionality
│   ├── config.py   # Pydantic settings management
│   ├── dependencies.py # FastAPI dependency injection
│   ├── services.py # Business logic (translation, audio processing)
│   ├── rate_limiter.py  # Rate limiting with Redis/memory backend
│   ├── logging_config.py # Logging configuration
│   ├── text_compressor.py # Text compression utilities
│   └── translation_service.py # Translation service abstraction
├── db/             # Database layer
│   ├── base.py     # SQLAlchemy base configuration
│   └── models.py   # ORM models (User, RequestLog)
├── routers/        # API routes
│   ├── api.py      # Main API endpoints (/api/*)
│   ├── manage_api.py # Admin API endpoints (/manageapi/*)
│   └── ui.py       # Web interface routes (/ui/*)
├── schemas/        # Pydantic data models
├── templates/      # HTML templates
├── static/         # Static files
├── main.py         # FastAPI app entry point
└── requirements.txt # Python dependencies

Key Dependencies

fastapi: Modern web framework
uvicorn[standard]: ASGI server
sqlalchemy: ORM and database toolkit
python-jose[cryptography]: JWT implementation
slowapi: Rate limiting
httpx: Async HTTP client
translators: Translation services integration
openai: OpenAI client for Whisper API
apscheduler: Background task scheduling

External Service Dependencies

Whisper Service: Speech recognition (default: localhost:8000)
SenseVoice Service: Alternative speech recognition (default: localhost:8800)
LibreTranslate: Local translation service (default: localhost:5000)
Redis: Rate limiting backend (optional)

Key Implementation Details

Configuration Management

All configuration is managed through src/core/config.py using Pydantic Settings:

Environment variables take precedence over defaults
.env file support for local development
Service URLs, API keys, database paths, rate limiting settings

Authentication Flow

Users login via /api/login with username/password
Server returns JWT access token (7-day expiry)
Subsequent API calls include token in Authorization header
Admin users have is_admin=True flag in database

Rate Limiting

Global rate limiting via slowapi middleware
Per-user rate limiting based on limit_rule field
Redis backend for distributed rate limiting
Default limits: "10000/day;1000/hour"

Request Logging

All API requests automatically logged to request_log table
Includes username, IP, endpoint, duration, status
Middleware-based implementation in src/main.py

Translation Services

Local LibreTranslate: Primary translation service
Online Translators: Fallback via translators library
Service Priority: Configurable via TRANSLATOR_SERVICES_LIST
Timeout Handling: 1.5s timeout with failover

Audio Processing

Supports WAV and OPUS audio formats
OPUS decoding via opuslib
Automatic format detection and conversion
Content filtering for error results
Text compression for repeated characters (configurable)

Scheduled Tasks

User Expiration Check: Daily at midnight UTC, disables expired users
Filter Config Update: Weekly on Monday at 3:00 AM UTC, updates filter rules from web

Common Development Tasks

Adding New API Endpoints

Add route to appropriate router file (api.py or manage_api.py)
Use Pydantic models for request/response validation
Implement authentication with Depends(get_current_user)
Add rate limiting decorator if needed

Database Schema Changes

Update models in src/db/models.py
Add migration logic to check_and_update_db() in main.py
Test with existing database to ensure compatibility

Modifying Business Logic

Core functions go in src/core/services.py
Use async/await for I/O operations
Add proper error handling and logging
Test with test_translate.py for translation functions

Template Modifications

HTML templates in src/templates/
Use request.url_for() for URL generation
Access session data via request.session
Pass messages through template context

Environment Variables

Key environment variables for configuration:

WHISPER_HOST, WHISPER_PORT: Whisper service location
LIBRETRANSLATE_HOST, LIBRETRANSLATE_PORT: Translation service
JWT_SECRET_KEY: JWT signing key (change in production)
SQL_PATH: Database connection string
LIMITER_REDIS_URL: Redis for rate limiting
UVICORN_WORKERS: Number of worker processes
UVICORN_RELOAD: Enable hot reload (development only)

Important Notes

Security: Change default JWT_SECRET_KEY in production
Database: SQLite default, MySQL supported via SQLALCHEMY_DATABASE_URL
Rate Limiting: Requires Redis for production deployments
Service Dependencies: Ensure Whisper and LibreTranslate services are running
First Admin: First login creates admin account automatically
User Expiration: Daily cron job disables expired users at midnight UTC
这个项目不要本地验证，让我手动验证
请不需要执行测试，我会手动测试并将测试结果发给你

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Project Overview

Development Commands

Local Development

Testing

Docker Deployment

Production Server

Architecture & Code Structure

Core Architecture Pattern

Directory Structure

Key Dependencies

External Service Dependencies

Key Implementation Details

Configuration Management

Authentication Flow

Rate Limiting

Request Logging

Translation Services

Audio Processing

Scheduled Tasks

Common Development Tasks

Adding New API Endpoints

Database Schema Changes

Modifying Business Logic

Template Modifications

Environment Variables

Important Notes

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md

Project Overview

Development Commands

Local Development

Testing

Docker Deployment

Production Server

Architecture & Code Structure

Core Architecture Pattern

Directory Structure

Key Dependencies

External Service Dependencies

Key Implementation Details

Configuration Management

Authentication Flow

Rate Limiting

Request Logging

Translation Services

Audio Processing

Scheduled Tasks

Common Development Tasks

Adding New API Endpoints

Database Schema Changes

Modifying Business Logic

Template Modifications

Environment Variables

Important Notes