small-models
Here are 22 public repositories matching this topic...
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
-
Updated
Aug 13, 2024 - Python
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
-
Updated
Aug 13, 2024 - Python
OpenSSA: Small Specialist Agents based on Domain-Aware Neurosymbolic Agent (DANA) architecture for industrial problem-solving
-
Updated
Aug 14, 2025 - Python
Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models
-
Updated
Mar 21, 2026 - Python
[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models
-
Updated
Jan 11, 2026 - Python
This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context
-
Updated
Aug 27, 2024 - Jupyter Notebook
BobVLM – A 1.5B multimodal model built from scratch and pre-trained on a single P100 GPU capable of image descriptions and moderate question answering. 🤗🎉
-
Updated
Feb 17, 2025 - Python
Overview of self-supervised learning of tiny models, including distillation-based methods (aks. self-supervised distillation) and non-distillation methods.
-
Updated
Nov 13, 2022
Code for "On the Surprising Efficacy of Distillation as an Alternative to Pre-Training Small Models"
-
Updated
Apr 5, 2024 - Python
Help us define the Pareto front of small models for MNIST classification. Frugal AI.
-
Updated
Jul 13, 2023 - Python
If OpenAI builds engines, VOID builds the chassis.
-
Updated
Feb 19, 2026 - Python
Provide specialized AI agents that develop, review, debug, and deploy production-ready code efficiently across various programming tasks.
-
Updated
Apr 26, 2026 - JavaScript
Portable Knowledge — Prepared For AI
-
Updated
Apr 19, 2026
LlamaTalks is a modern web application designed to facilitate seamless conversations with powerful language models,
-
Updated
Jun 29, 2025 - TypeScript
Universal AI agent framework — 1B models compose multi-step plans like 12B with structured context (CTT). 6 domains, 8 MCP tools, 167 tests, zero runtime deps.
-
Updated
Mar 18, 2026 - TypeScript
A DeepSeek-R1 1.5B model is given a task and left to reason through it autonomously — no code, no guessing. It commits its own progress to GitHub. Currently attempting a Sudoku puzzle. Struggling, honestly.
-
Updated
Apr 5, 2026 - Python
Local-first coding agent built with small models in mind and tight VRAM budgets.
-
Updated
Mar 26, 2026 - Python
Fine-tunes a T5-small model on the TellMeWhy dataset using context injection from a large language model (Gemini) to improve causal reasoning for “why” questions in narratives. Combines efficient training with human and automated evaluations to assess impact.
-
Updated
May 18, 2025 - Jupyter Notebook
Hermes Agent plugin: bounded runbook-driven workflows for small local models. Deterministic scripts, verifier-first discipline, guarded publication.
-
Updated
Apr 6, 2026 - Python
Improve this page
Add a description, image, and links to the small-models topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the small-models topic, visit your repo's landing page and select "manage topics."