GoodQ4All: Local-First Multimodal AI Memory & Video Intelligence Stack

Offline Video Search, Scene Segmentation, Speech Transcription (Whisper), Speaker Diarization, and SQLite + Qdrant Semantic Search on Windows 11

_{(Note: The optional Ask GoodQ voice agent is a hosted extension using ElevenLabs APIs. The core GoodQ4All memory system itself is 100% local and offline.)}

⚡ The Local-First Multimodal Memory Stack

GoodQ4All is a 100% private, offline alternative to cloud-based media intelligence services. It ingests video, audio, and text files into queryable, structured scene-level memories, persisting the knowledge graph and vector representations locally on your computer.

Following a strict "proof-backed" system doctrine, GoodQ4All documents every perception step, tracks evidence manifests, and logs a comprehensive audit trail so that every memory claim can be verified.

🎬 From Media to Memory

Get This Level of Local Control (Unified Operator UI):

Click the preview above to watch the high-fidelity onboarding video.
From Video Quality as Low as This: (Raw Media Inputs):

Apollo 11 Moon Walk (nasa_descent.gif)
Saturn V Launch (nasa_launch.gif)
Using This All-in-One Installer (Unified Windows Installer):

🚀 Download GoodQ4All Setup v2.4.0.exe
[!IMPORTANT] System Requirement: Windows 11 only. GoodQ4All is built for Windows-first local execution. It requires at least 25 GB of free space to store local database structures, models, and cache files.
- SmartScreen Workaround: Since the setup installer is currently self-signed, Windows SmartScreen may show an "Unknown Publisher" dialog. Click More info and select Run anyway to proceed.
- Integrity Checksum: Verify your download authenticity by running the following command in PowerShell:
```
Get-FileHash GoodQ4All_Setup_2.4.0.exe
```
  Expected SHA256 hash: Refer to the GitHub Releases page for the latest signed executable checksum.

🛡️ Core Capabilities & Architecture

1. Proof-Backed Ingestion & Hardened API Facade

Machine memory should earn every claim it makes. GoodQ4All generates step-by-step logs (step_runs.jsonl), scene manifests, and intermediate features for every ingested file.

Single-Use Confirmation Tokens: Ingestion submission routes are protected by a server-generated token handshake with single-use nonce validation, preventing unauthenticated/out-of-bounds execution.
Epistemic Verification: Ingestion is tracked using verifiable manifests and SQLite-backed relational schemas.
No Silent Failures: The Control Agent and Watchdog processes bubble errors directly to the operator consoles, providing absolute visibility into the execution stack.

2. Local Model Governance & VRAM Budgeting

To run large-parameter local models safely on consumer hardware (e.g. RTX 4070 Ti SUPER 16GB) without Out-of-Memory (OOM) crashes, GoodQ4All implements strict VRAM and execution controls:

Model Lifecycle Manager: A specialized context manager (lib/model_lifecycle.py) that audits free VRAM using PyTorch and nvidia-smi before loading models, dynamically evicting idle networks from GPU memory.
Local Agent Stack (MiniAgentClient): Gated LLM reasoning and local tool execution through zero-dependency policy enforcement middleware, loading schemas, policies, and contracts dynamically from the version-controlled agents/stack/ directory.
Endpoint Fallback Orchestration: Automatically falls back from the primary local vLLM server (prefer_speed, running Qwen2.5) to a local Ollama service (prefer_quality, running Phi-4) or a CPU-safe model variant when VRAM thresholds are breached.

3. TurboQuant Hybrid Vector Caching

High-precision 32-bit floating point embeddings are persisted in Qdrant and FAISS. For rapid candidate filtering, GoodQ4All uses TurboQuant—an SQLite sidecar caching technology employing Lloyd-Max Polar Quantization and Johnson-Lindenstrauss residual projections.

Performance: Achieves sub-millisecond candidate pre-filtering.
Accuracy: 100% search accuracy is maintained by performing the final rank scoring on the uncompressed raw float32 vectors.

Note

Hybrid Precision Caching Model: GoodQ4All uses an additive sidecar vector cache architecture. High-precision 32-bit floating point (float32) embeddings remain the authoritative truth of the system, stored in Qdrant and FAISS. Performance-oriented query pre-filtering is handled via lightweight TurboQuant fields (Lloyd-Max Polar Quantization + Johnson–Lindenstrauss residual corrections) stored in SQLite. This ensures zero data loss, guarantees rollback capability, and cuts memory usage.

4. Adaptive Hardware Profiles

The pipeline dynamically adjusts its computational needs to match your system specs:

BASELINE (CPU-safe): Fully operational, offline-ready execution on standard CPU hardware. Bypasses GPU requirements gracefully.
GPU_ENHANCED: Activates local NVIDIA GPU (CUDA 12.1) and WSL2 accelerated audio processing paths for fast, high-volume ingestion.

⚙️ Setup Paths

Route A: Standalone User Installation (Recommended)

GoodQ4All compiles the isolated Python environment, the Qdrant database, and perception libraries into a single executable wrapper:

Download and run GoodQ4All_Setup_1.0.0.exe.
Launch GoodQ4All from the desktop shortcut.
Open the local Retro Memory Explorer dashboard at http://127.0.0.1:30000/ui/retro_console_v1/.
Drag-and-drop video/audio files onto the yellow-dotted Upload Pad to begin automatic ingestion.

Route B: Developer Source Setup (Advanced)

If you are developing, customizing the pipeline, or running from source:

Developer Source Setup Steps (Advanced)

1. Developer Onboarding Video

2. Step-by-Step Developer Installation

Step	Type or do this	Demo frame
1	Clone the official source: `git clone https://github.com/GoodQ02/goodq4all.git`
2	Enter the project cabin: `cd goodq4all`
3	Run the bootstrap installer: `python scripts/bootstrap_install.py` _{CPU-safe first-run variant: python scripts/bootstrap_install.py --disable-gpu --disable-wsl-audio --skip-model-prefetch.}
4	Customize local config: edit the bootstrap-created `.env.local` when using local model, cache, or provider settings.
5	Validate the bootstrap: `.\scripts\bootstrap_validate.bat`
6	Run the launcher/readiness check: `.\LAUNCH_GOODQ.ps1`
7	Start Watchdog, then copy one small media file into the import inbox zone (defaults to %USERPROFILE%\GoodQ_Data\import_inbox): `conda run --no-capture-output -n goodq_core python -m cli.watchdog`
8	Start the API and inspect proof: `conda run --no-capture-output -n goodq_core python -m api.server`

📺 User Interfaces

GoodQ4All ships with two local operator console variants:

Classic Operator Console (served at /ui/operator_console_v1/): Exposes the current scope strip, flight deck, proof/evidence status, recurrence reports, and video inventories.
Retro Memory Explorer (v1.4.7) (served at /ui/retro_console_v1/): A premium cyber-CRT dashboard featuring a four-panel resizable/collapsible layout with floating restore tabs, an entity co-occurrence graph with dynamic zoom and flight transitions, an Inspector panel containing keyframe image/transcript views, and bidirectional timeline checklists.

📖 Authoritative Documentation

Start Here

Guided demo: docs/guides/DEMO.md
First run: docs/guides/FIRST_RUN.md
Install: docs/guides/install/INSTALL.md
Quickstart: docs/guides/install/QUICKSTART.md
Clean memory start: docs/guides/CLEAN_MEMORY_START.md
Data Privacy: docs/guides/general/PRIVACY.md

Technical Details

Architecture: docs/architecture/SYSTEM_ARCHITECTURE.md
Memory Storage: docs/architecture/MEMORY_STORAGE.md
Hybrid Caching: docs/architecture/TURBOQUANT_HYBRID_CACHING.md
Current Agent State: docs/agent/CURRENT_STATE.md

📄 License

MIT. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 1,007 Commits
.agents		.agents
.github		.github
agents		agents
api		api
archive		archive
branding		branding
cli		cli
common		common
configs		configs
docs		docs
envs		envs
lib		lib
pipelines		pipelines
processing_onboarding		processing_onboarding
reports		reports
retrieval		retrieval
samples		samples
scripts		scripts
steps		steps
tests		tests
ui		ui
vendor		vendor
vllm_wsl		vllm_wsl
wsl2_audio		wsl2_audio
.env.local.template		.env.local.template
.env.model_cache		.env.model_cache
.env.template		.env.template
.gitattributes		.gitattributes
.gitignore		.gitignore
.ignore		.ignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LAUNCH_GOODQ.bat		LAUNCH_GOODQ.bat
LAUNCH_GOODQ.ps1		LAUNCH_GOODQ.ps1
LICENSE		LICENSE
PLAN.md		PLAN.md
PROJECT.md		PROJECT.md
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
TEST_INFRA.md		TEST_INFRA.md
TEST_READY.md		TEST_READY.md
THIRD_PARTY_NOTICES.md		THIRD_PARTY_NOTICES.md
__init__.py		__init__.py
context7.json		context7.json
dev_off.bat		dev_off.bat
dev_off.sh		dev_off.sh
dev_on.bat		dev_on.bat
dev_on.sh		dev_on.sh
environment-baseline-lock.yml		environment-baseline-lock.yml
environment.gpu.yml		environment.gpu.yml
environment.yml		environment.yml
gemini.md		gemini.md
goodq_version.py		goodq_version.py
pytest.ini		pytest.ini
requirements-baseline-lock.txt		requirements-baseline-lock.txt
setup.py		setup.py
start_goodq_dev.ps1		start_goodq_dev.ps1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GoodQ4All: Local-First Multimodal AI Memory & Video Intelligence Stack

⚡ The Local-First Multimodal Memory Stack

🎬 From Media to Memory

🛡️ Core Capabilities & Architecture

1. Proof-Backed Ingestion & Hardened API Facade

2. Local Model Governance & VRAM Budgeting

3. TurboQuant Hybrid Vector Caching

4. Adaptive Hardware Profiles

⚙️ Setup Paths

Route A: Standalone User Installation (Recommended)

Route B: Developer Source Setup (Advanced)

1. Developer Onboarding Video

2. Step-by-Step Developer Installation

📺 User Interfaces

📖 Authoritative Documentation

Start Here

Technical Details

📄 License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GoodQ4All: Local-First Multimodal AI Memory & Video Intelligence Stack

⚡ The Local-First Multimodal Memory Stack

🎬 From Media to Memory

🛡️ Core Capabilities & Architecture

1. Proof-Backed Ingestion & Hardened API Facade

2. Local Model Governance & VRAM Budgeting

3. TurboQuant Hybrid Vector Caching

4. Adaptive Hardware Profiles

⚙️ Setup Paths

Route A: Standalone User Installation (Recommended)

Route B: Developer Source Setup (Advanced)

1. Developer Onboarding Video

2. Step-by-Step Developer Installation

📺 User Interfaces

📖 Authoritative Documentation

Start Here

Technical Details

📄 License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages