Local LLM UI

A modern, responsive UI for interacting with Large Language Models (LLMs). Built with React, TypeScript, Vite, and shadcn/ui (Radix primitives + Tailwind CSS), this application provides a beautiful interface for chatting with AI models through multiple providers.

Features

Modern UI: Built with shadcn/ui components (Radix UI + Tailwind CSS) for a clean, accessible interface
Multiple AI Providers: Support for Ollama, LM Studio, Amazon Bedrock, Bedrock Mantle, Anthropic, Groq, and Cerebras
MCP Server Integration: Connect external tools via Model Context Protocol — stdio, HTTP, and SSE transports
Web Search: Built-in Tavily web search tool for grounding responses with real-time data
Real-time Streaming: Stream responses from AI models in real-time with smooth animations
Reasoning/Thinking Support: Display thinking process for reasoning models (MiniMax, DeepSeek-R1, NemoTron, etc.)
Unified Model Selector: Searchable model picker in chat input, grouped by provider with unavailable services clearly flagged
Inference Settings: Adjust temperature, top-p, and max tokens via popover in chat input
Context Window Indicator: Track token usage against model context limits
Prompt Optimizer: Optimize prompts for Claude 4.5 models using best practices (Bedrock only)
Document Upload: Upload documents (PDF, TXT, HTML, MD, CSV, DOC, DOCX, XLS, XLSX) with Bedrock models
Usage Metrics: View token usage and latency displayed after each AI response
Chat History: Manage multiple chat sessions with automatic history tracking via IndexedDB
Saved Prompts: Save and reuse frequently used prompts
User Preferences: Persistent settings for preferred AI provider and custom avatar initials
Dark/Light Mode: Toggle between visual modes
Responsive Design: Works seamlessly across desktop and mobile devices

MCP Server Support

Connect external tools to your LLM conversations via the Model Context Protocol. Configure servers in Preferences → MCP Servers with support for local (stdio) and remote (HTTP/SSE) transports.

MCP Tools Status	MCP Server Configuration

Toolbar indicator shows connected servers and available tools at a glance
Security hardened — command allowlists, SSRF protection, env var isolation (details)

Prerequisites

Before running this application, ensure you have the following installed:

Node.js: Version 18.x or higher
npm: Version 9.x or higher (comes with Node.js)
AI Provider (at least one):
- Ollama - Local AI models
- LM Studio - Local AI platform with model management
- Anthropic - Claude models (requires API key)
- Amazon Bedrock - AWS cloud AI service (requires AWS credentials)
- Bedrock Mantle - OpenAI-compatible Bedrock endpoint
- Groq - Fast cloud inference (requires API key)
- Cerebras - Cloud inference (requires API key)

Installation

Clone the repository:

git clone https://github.com/praveenc/local-llm-ui.git
cd local-llm-ui

Install dependencies:
```
npm install
```

AI Provider Setup

Option 1: Ollama

Download and install Ollama from ollama.com
Pull a model (e.g., qwen3-8b):
```
ollama pull qwen3-8b-8k:latest
```
Verify Ollama is running:
```
ollama list
```
Ollama runs on http://localhost:11434 by default.

Option 2: LM Studio

Download and install LM Studio from lmstudio.ai
Download a model through the LM Studio interface
Start the local server:
- Open LM Studio
- Go to the "Developer" or "Server" tab
- Click "Start Server"
- Ensure it's running on port 1234
(Optional) Enable JIT Loading:
- Go to Developer → Server Settings
- Enable "JIT Loading" to load models on-demand

Option 3: Amazon Bedrock

Set up AWS credentials using one of these methods:

Option A: Environment Variables

export AWS_ACCESS_KEY_ID=your_access_key_id
export AWS_SECRET_ACCESS_KEY=your_secret_access_key
export AWS_REGION=us-west-2  # or your preferred region

Option B: AWS CLI

aws configure

Option C: AWS Credentials File Create ~/.aws/credentials:

[default]
aws_access_key_id = your_access_key_id
aws_secret_access_key = your_secret_access_key

Ensure IAM Permissions: Your AWS user/role needs these permissions:
- bedrock:ListInferenceProfiles
- bedrock:InvokeModel or bedrock:InvokeModelWithResponseStream
Request Model Access (if needed):
- Go to AWS Bedrock console
- Navigate to "Model access"
- Request access to desired models (e.g., Claude, Llama)

Option 4: Anthropic

Get an API key from console.anthropic.com
Configure in the app's Preferences:
- Enter your Anthropic API key

Option 5: Bedrock Mantle

Bedrock Mantle provides an OpenAI-compatible endpoint for Bedrock models.

Configure in the app's Preferences:
- Set your Mantle API key
- Set your Mantle region

Option 6: Groq

Get an API key from console.groq.com
Configure in the app's Preferences:
- Enter your Groq API key

Option 7: Cerebras

Get an API key from cloud.cerebras.ai
Configure in the app's Preferences:
- Enter your Cerebras API key

Running the Application

Development Mode

Start the development server with hot module replacement:

npm run dev

The application will be available at http://localhost:5173

Production Build

Build the application for production:

npm run build

Preview the production build:

npm run preview

Usage

Getting Started

Start the Application: Run npm run dev
Select a Model:
- Click the model selector button in the chat input
- Browse models grouped by provider
- Use the search to filter models
- If a provider is unavailable, the selector shows a brief setup or connection hint
Start Chatting:
- Type your message in the input field
- Press Enter or click the send button
- Watch the AI response stream in real-time

Model Configuration

Adjust model parameters via the settings icon in the chat input:

Temperature (0.0 - 1.0): Controls randomness
- Lower values (0.1-0.3): More focused and deterministic
- Higher values (0.7-1.0): More creative and varied
Top P (0.0 - 1.0): Controls diversity via nucleus sampling
Max Tokens: Maximum length of the response

Note for Claude 4.x Models: Claude 4.x models don't support both temperature and topP simultaneously. The UI provides a toggle to choose which parameter to use.

Reasoning Models

When using reasoning/thinking models (MiniMax, DeepSeek-R1, NemoTron, etc.):

A collapsible "Thinking" section appears showing the model's reasoning process
Click to expand/collapse the reasoning content
Reasoning is persisted with the conversation

Document Upload (Bedrock Only)

When using Amazon Bedrock models:

Click the + button in the chat input
Select "Add photos or files"
Choose files (max 4.5 MB each)
Supported formats: PDF, TXT, HTML, MD, CSV, DOC, DOCX, XLS, XLSX, images
Send your message with the attached documents

Context Window

The context indicator in the chat input shows:

Current token usage vs model's context limit
Hover for detailed breakdown (input/output tokens)

Saved Prompts

Save frequently used prompts:

Open the Saved Prompts panel from the sidebar
Create new prompts with categories
Click to insert saved prompts into the chat input

Project Structure

local-llm-ui/
├── src/
│   ├── components/
│   │   ├── ai-elements/       # AI UI components (conversation, message, prompt-input, reasoning, context)
│   │   ├── chat/              # Chat container and related components
│   │   ├── layout/            # Layout components
│   │   ├── prompts/           # Saved prompts components
│   │   ├── sidebar/           # Sidebar components
│   │   ├── shared/            # Shared components
│   │   └── ui/                # shadcn/ui components
│   ├── db/                    # Dexie.js database schema
│   ├── hooks/                 # React hooks
│   ├── layout/                # App shell and layout
│   ├── services/              # API services for each provider
│   ├── types/                 # TypeScript types
│   ├── utils/                 # Utility functions
│   └── main.tsx               # Application entry point
├── server/
│   ├── aisdk-proxy.ts         # Groq/Cerebras proxy
│   ├── anthropic-aisdk-proxy.ts # Anthropic proxy
│   ├── bedrock-aisdk-proxy.ts # Bedrock AI SDK proxy
│   ├── bedrock-proxy.ts       # Bedrock models proxy
│   ├── lmstudio-aisdk-proxy.ts # LM Studio chat proxy
│   ├── lmstudio-proxy.ts      # LM Studio SDK proxy
│   ├── mantle-proxy.ts        # Bedrock Mantle proxy
│   ├── mcp-manager.ts         # MCP client lifecycle and tools
│   ├── ollama-aisdk-proxy.ts  # Ollama proxy
│   └── security.ts            # Shared security utilities
├── public/                    # Static assets
├── vite.config.ts             # Vite configuration with proxy middleware
└── package.json               # Dependencies and scripts

Configuration

Environment Variables

Create a .env file for custom configuration (optional):

# AWS Configuration (if not using AWS CLI or credentials file)
AWS_REGION=us-west-2
# AWS_ACCESS_KEY_ID=your_access_key_id
# AWS_SECRET_ACCESS_KEY=your_secret_access_key

Proxy Configuration

The Vite development server proxies requests to AI services:

/api/ollama → http://localhost:11434
/api/lmstudio → http://localhost:1234
/api/bedrock, /api/bedrock-aisdk → Server-side AWS SDK proxy
/api/anthropic → Anthropic Claude proxy
/api/mantle → Bedrock Mantle proxy
/api/aisdk → Groq/Cerebras proxy
/api/lmstudio-sdk, /api/lmstudio-aisdk → LM Studio proxies

Troubleshooting

No Models Available

Problem: The model selector shows no models

Solutions:

Check the unavailable provider messages shown in the model selector for the exact issue
Ollama: Ensure Ollama is running and you've pulled at least one model
LM Studio: Start LM Studio, enable its local server, and ensure it's running on port 1234
Bedrock: Configure AWS credentials in your environment and verify model access/IAM permissions
Groq/Cerebras/Anthropic/Bedrock Mantle: Check API keys and provider settings in Preferences

Connection Failed

Problem: "Cannot connect" error messages

Solutions:

Verify the AI service is running on the correct port
Check firewall settings
For cloud providers, verify API keys are configured correctly

Reasoning Not Showing

Problem: Thinking/reasoning content not displayed

Solutions:

Ensure you're using a reasoning model (MiniMax, DeepSeek-R1, NemoTron, etc.)
Check that the model outputs reasoning in a supported format

Development

Available Scripts

npm run dev - Start development server
npm run build - Build for production
npm run preview - Preview production build
npm run lint - Run ESLint
npm run lint:fix - Fix ESLint issues
npm run format - Format with Prettier
npm run test - Run tests in watch mode
npm run test:run - Run tests once

Code Style

This project uses:

ESLint: For code linting
Prettier: For code formatting
TypeScript: For type safety
Husky: Pre-commit hooks for lint-staged

Technologies Used

React 19: UI framework
TypeScript 5.9: Type safety
Vite 7: Build tool and dev server
shadcn/ui: UI component library (Radix UI + Tailwind CSS)
Tailwind CSS 4: Utility-first CSS
Dexie.js: IndexedDB wrapper for persistence
TanStack Query: Data fetching and caching for provider-backed UI state
AI SDK: Vercel AI SDK for streaming (@ai-sdk/amazon-bedrock, @ai-sdk/anthropic, @ai-sdk/groq, @ai-sdk/cerebras, @ai-sdk/openai-compatible, @ai-sdk/mcp)
MCP SDK: @modelcontextprotocol/sdk for MCP server connectivity
Streamdown: Markdown rendering in chat
Lucide React: Icons

License

This project is licensed under the MIT License - see the LICENSE file for details.

Support

For issues or questions:

Check the troubleshooting section
Verify your AI provider is properly configured
Check the browser console for error messages
Open an issue on GitHub

Acknowledgments

UI built with shadcn/ui and Radix UI
Supports Ollama, LM Studio, Amazon Bedrock, Anthropic, Groq, and Cerebras
Powered by Vite, React, and Vercel AI SDK

Name		Name	Last commit message	Last commit date
Latest commit History 229 Commits
.beads		.beads
.claude		.claude
.github/images		.github/images
.husky		.husky
.kiro		.kiro
.pi		.pi
docs		docs
public		public
server		server
src		src
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
components.json		components.json
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
skills-lock.json		skills-lock.json
styleguide-shadcn.html		styleguide-shadcn.html
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts
vitest.config.ts		vitest.config.ts

Folders and files

Latest commit

History

Repository files navigation

Local LLM UI

Features

MCP Server Support

Prerequisites

Installation

AI Provider Setup

Option 1: Ollama

Option 2: LM Studio

Option 3: Amazon Bedrock

Option 4: Anthropic

Option 5: Bedrock Mantle

Option 6: Groq

Option 7: Cerebras

Running the Application

Development Mode

Production Build

Usage

Getting Started

Model Configuration

Reasoning Models

Document Upload (Bedrock Only)

Context Window

Saved Prompts

Project Structure

Configuration

Environment Variables

Proxy Configuration

Troubleshooting

No Models Available

Connection Failed

Reasoning Not Showing

Development

Available Scripts

Code Style

Technologies Used

License

Support

Acknowledgments

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages