small-models

Star

Here are 22 public repositories matching this topic...

PrismML-Eng / Bonsai-demo

Star

Bonsai Demo

bonsai mlx llm small-models llamacpp prism-ml

Updated Apr 24, 2026
Shell

SqueezeAILab / SqueezeLLM

Star

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

natural-language-processing text-generation transformer llama quantization model-compression efficient-inference post-training-quantization large-language-models llm small-models localllm

Updated Aug 13, 2024
Python

SqueezeAILab / KVQuant

Star

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

natural-language-processing compression text-generation transformer llama quantization mistral model-compression efficient-inference efficient-model large-language-models llm small-models localllm localllama

Updated Aug 13, 2024
Python

aitomatic / openssa

Star

OpenSSA: Small Specialist Agents based on Domain-Aware Neurosymbolic Agent (DANA) architecture for industrial problem-solving

domain-knowledge industrial-ai small-models specialist-agents

Updated Aug 14, 2025
Python

markendo / downscaling_intelligence

Star

Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models

computer-vision small-models instruction-tuning mllm multimodal-large-language-models

Updated Mar 21, 2026
Python

MCG-NJU / AMD

Star

[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models

action-recognition video-understanding distillation self-supervised-learning temporal-action-detection foundation-models small-models cvpr2024

Updated Jan 11, 2026
Python

logic-OT / Decoder-Only-LLM

Star

This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context

nlp computer-vision deep-learning inference transformer attention-mechanism decoder-model large-language-models llm small-models

Updated Aug 27, 2024
Jupyter Notebook

logic-OT / BobVLM

Star

BobVLM – A 1.5B multimodal model built from scratch and pre-trained on a single P100 GPU capable of image descriptions and moderate question answering. 🤗🎉

nlp experiment library deep-learning gpu multimodal huggingface huggingface-transformers vision-transformer llm llms small-models vlms

Updated Feb 17, 2025
Python

zhangyifei01 / Awesome-Self-supervised-Learning-of-Tiny-Models

Star

Overview of self-supervised learning of tiny models, including distillation-based methods (aks. self-supervised distillation) and non-distillation methods.

knowledge-distillation self-supervised binary-neural-networks self-supervised-distillation lightweight-models tiny-models small-models

Updated Nov 13, 2022

sfarhat / dapt

Star

Code for "On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models"

synthetic-data distillation pre-training contrastive-learning small-models

Updated Apr 5, 2024
Python

ENSTA-U2IS-AI / optuMNIST

Star

Help us define the Pareto front of small models for MNIST classification. Frugal AI.

deep-neural-networks deep-learning mnist-classification frugality small-models

Updated Jul 13, 2023
Python

KapitalSP / VOID

Star

If OpenAI builds engines, VOID builds the chassis.

python modular cross-platform api-gateway self-hosted termux ai-framework embedded-ai edge-ai small-models llama-cpp local-llm gguf offline-ai experimental-ai lightweight-ai rule-based-ai

Updated Feb 19, 2026
Python

NoaiRox / specialist-agent

Star

Provide specialized AI agents that develop, review, debug, and deploy production-ready code efficiently across various programming tasks.

Updated Apr 26, 2026
JavaScript

watsonsage-co / github

Star

Portable Knowledge — Prepared For AI

metadata information-retrieval archives fieldops structured-data enterprise-services packaging-system small-models local-ai agentic-workflow ai-ready-data self-describing-data self-contained-systems

Updated Apr 19, 2026

utdevnp / LlamaTalks

Star

LlamaTalks is a modern web application designed to facilitate seamless conversations with powerful language models,

basic ai chatbot starter-kit quick-start multimodal large-language-models quick-setup small-models ollama ollama-api

Updated Jun 29, 2025
TypeScript

MauricioPerera / ctt-shell

Star

Universal AI agent framework — 1B models compose multi-step plans like 12B with structured context (CTT). 6 domains, 8 MCP tools, 167 tests, zero runtime deps.

git shell wordpress typescript mcp rbac tfidf browser-automation few-shot-learning n8n ai-agent llm small-models context-time-training guard-rails domain-adapters

Updated Mar 18, 2026
TypeScript

Sourav-Tripathy / the-pensieve

Star

A DeepSeek-R1 1.5B model is given a task and left to reason through it autonomously — no code, no guessing. It commits its own progress to GitHub. Currently attempting a Sudoku puzzle. Struggling, honestly.

sudoku-puzzle reasoning solving-problems llm small-models

Updated Apr 5, 2026
Python

elemein / agent-stoat

Star

Local-first coding agent built with small models in mind and tight VRAM budgets.

python cli llm small-models local-llm agentic ollama coding-agent

Updated Mar 26, 2026
Python

dane-codes / TellMeWhy-Context-Injection

Star

Fine-tunes a T5-small model on the TellMeWhy dataset using context injection from a large language model (Gemini) to improve causal reasoning for “why” questions in narratives. Combines efficient training with human and automated evaluations to assess impact.

nlp transformers gemini question-answering language-model fine-tuning huggingface t5 context-injection commensense human-evaluation ai-evaluation small-models bleurt t5-small

Updated May 18, 2025
Jupyter Notebook

sammcf / hermes-model-discipline

Star

Hermes Agent plugin: bounded runbook-driven workflows for small local models. Deterministic scripts, verifier-first discipline, guarded publication.

runbook small-models llm-agents deterministic-workflows hermes-agent hermes-plugin model-discipline

Updated Apr 6, 2026
Python

Improve this page

Add a description, image, and links to the small-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the small-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

small-models

Here are 22 public repositories matching this topic...

PrismML-Eng / Bonsai-demo

SqueezeAILab / SqueezeLLM

SqueezeAILab / KVQuant

aitomatic / openssa

markendo / downscaling_intelligence

MCG-NJU / AMD

logic-OT / Decoder-Only-LLM

logic-OT / BobVLM

zhangyifei01 / Awesome-Self-supervised-Learning-of-Tiny-Models

sfarhat / dapt

ENSTA-U2IS-AI / optuMNIST

KapitalSP / VOID

NoaiRox / specialist-agent

watsonsage-co / github

utdevnp / LlamaTalks

MauricioPerera / ctt-shell

Sourav-Tripathy / the-pensieve

elemein / agent-stoat

dane-codes / TellMeWhy-Context-Injection

sammcf / hermes-model-discipline

Improve this page

Add this topic to your repo