tok

Cut LLM token costs by 60–90%. A Go library for prompt compression, output filtering, token estimation, and secrets scanning — built for AI coding agents.

What tok Is

tok is a library, not a CLI. It exposes token-efficiency primitives as a clean Go API:

Prompt compression — tok.PromptCompress / tok.Compress shrink verbose prompts 20–70% (six modes).
Output filtering — a 31-layer pipeline (entropy pruning, perplexity filtering, AST-aware compression, H2O heavy-hitter, …) that strips noise from command output before it re-enters an LLM context.
Token estimation — tok.EstimateTokens* and tok.EstimateCost give model-aware token counts and pricing.
Secrets scanning — tok.SecretDetector and tok.IsSensitiveFilename catch credentials before they leak into prompts.
Rate limiting & tracking — persistent SQLite-backed gain tracking (tok.NewTracker).

It is consumed directly as a Go module, and it powers the tok commands inside Hawk (hawk tok ...), which imports it as a library.

Install

go get github.com/GrayCodeAI/tok

import "github.com/GrayCodeAI/tok"

tok ships no standalone tok CLI binary. Its CLI surface is exposed through Hawk, which embeds this library: hawk tok compress, hawk tok estimate, and hawk tok scan.

Quick Start

Compress a prompt

out := tok.PromptCompress("Please implement authentication", tok.IntensityUltra)
// → "Implement auth."

Filter command output

out, _ := tok.Compress(verboseOutput, tok.Aggressive)
// 200 lines → a few lines: pass/fail + failures

Estimate tokens & cost

n    := tok.EstimateTokensForModel(text, "gpt-4o")
cost := tok.EstimateCost(text, "gpt-4o")

Scan for secrets

d := tok.NewSecretDetector()
findings := d.DetectSecrets(text)
redacted := d.RedactSecrets(text)

Library API Highlights

tok.PromptCompress(text, intensity) — prompt compression (Lite / Full / Ultra). ~150 phrase substitutions, drop-lists for articles / filler / pleasantries, and auto-clarity (security/destructive segments pass through verbatim). intensity is monotonic: len(ultra) <= len(full) <= len(lite).
tok.Compress(text, opts...) — the full output pipeline with options: WithCustomFilters, WithCodeAware(lang), WithPerplexityGuided(scorer, ratio), profile options, and more.
tok.IsSensitiveFilename(path) — 3-layer filename detection (exact basename, sensitive directory, name token). Companion to the content-based SecretDetector. Catches .env, id_rsa, ~/.ssh/..., test_credentials.json, etc.
tok.SmartTruncate(content, maxLines, lang) — code truncation that preserves function signatures and always reports the exact drop count (kept + dropped == total).
tok.ExtractJSON / ExtractJSONArray / ExtractAllJSON — brace-balanced JSON extraction from LLM output with surrounding prose, markdown fences, and unterminated objects.
tok.NewTracker(ctx) — persistent gain tracker (SQLite + WAL, 90-day retention, pure-Go via modernc.org/sqlite). Aggregate, Recent, Prune queries.
tok.EstimateTokensFast / WithEncoding / ForModel — model-aware token estimation.
filter.CompressWithRetry — validate-fix-retry loop: caller supplies a Validator and AdjustFunc; the loop escalates mode/intensity and retries up to N times.
filter.NewTOMLFilter / LoadTOMLFilterFile — full 8-stage TOML pipeline as a pluggable Filter.

Full reference: pkg.go.dev/github.com/GrayCodeAI/tok.

Compression Modes

Mode	Style	Savings
`lite`	Drop filler, keep grammar	~20%
`full`	Drop articles, fragments OK	~40% (default)
`ultra`	Telegraphic, abbreviations	~60%
`wenyan-lite`	Classical Chinese light	~30%
`wenyan`	Classical Chinese standard	~50%
`wenyan-ultra`	Classical Chinese max	~70%

Output Filtering (31-Layer Pipeline)

Research-backed algorithms: entropy pruning, perplexity filtering, AST-aware compression, H2O heavy-hitter, attention sink preservation, semantic chunking, and 25+ more.

Custom Filter DSL

Define regex find/replace rules in a TOML file and plug them into the pipeline. Opt-in — no rules, no change.

# filters.toml
[[rule]]
name        = "collapse-uuids"
pattern     = "[0-9a-f]{8}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{4}-[0-9a-f]{12}"
replacement = "<uuid>"
priority    = 10

rules, _ := tok.LoadFilterRules("filters.toml")
out, _ := tok.Compress(text, tok.WithCustomFilters(rules))

Team / Shared Compression Profiles

Bundle mode + tier + budget into a named, versioned TOML profile teams can share. Built-ins: default, aggressive, code-safe.

p, _ := tok.LoadProfile(".tok/profiles/code-safe.toml")
out, _ := tok.Compress(text, p.Options()...)

Code-Aware (Symbol-Preserving) Compression

tok.WithCodeAware(lang) marks input as source code and guards function/type/export signatures so compression can never strip them. Defaults to a dependency-light regex symbol provider; swap in an LSP-backed one via WithSymbolProvider.

Perplexity-Guided Token Dropping

tok.WithPerplexityGuided(scorer, ratio) (LLMLingua-style) drops the lowest-importance tokens first. Default HeuristicPerplexityScorer is zero-dependency; plug in your own PerplexityScorer, or tok's experimental OllamaScorer when built with -tags experimental_ollama. Opt-in.

Benchmarks

Measured on this repo via benchmarks/run.sh (raw vs tok.Compress(..., tok.Aggressive)):

fixture	raw bytes	raw tokens	tok bytes	tok tokens	saved
git log	2,873	718	298	74	89 %
git diff	385,051	96,262	1,117	279	99 %
ls -la	66,341	16,585	148	37	99 %
find .go	19,145	4,786	147	36	99 %

Profile the hot paths with ./scripts/profile.sh [compress|tokens|filter|secrets|all].

Architecture

tok
├── tok.go, *.go         Public library API (top-level package)
├── internal/
│   ├── compress/        Input compression engine (6 modes)
│   ├── filter/          Output pipeline (31 layers)
│   ├── secrets/         Secret detection + redaction
│   ├── tracking/        SQLite token-usage database
│   ├── fastops/         Hot-path primitives (entropy, etc.)
│   └── core/            Token estimation & cost
├── benchmarks/          Token-savings benchmarks
└── evals/               Eval harness

Pure-Go, zero CGO, no runtime dependencies.

Contributing

git clone https://github.com/GrayCodeAI/tok.git && cd tok
make test && make lint
./scripts/build.sh        # verifies the library compiles (go build ./... + go vet)

See CONTRIBUTING.md.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 654 Commits
.githooks		.githooks
.github		.github
.pi/plans		.pi/plans
api		api
benchmarks		benchmarks
commands		commands
config		config
docs		docs
evals		evals
examples		examples
filters		filters
internal		internal
mcp		mcp
plans		plans
plugins/tok/skills		plugins/tok/skills
rules		rules
scripts		scripts
skills		skills
types		types
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.golangci.yml		.golangci.yml
.goreleaser.yml		.goreleaser.yml
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
AUTHORS.md		AUTHORS.md
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
DEVELOPER_GUIDE.md		DEVELOPER_GUIDE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README_es.md		README_es.md
README_fr.md		README_fr.md
README_ja.md		README_ja.md
README_ko.md		README_ko.md
README_zh.md		README_zh.md
SECURITY.md		SECURITY.md
VERSION		VERSION
advisor.go		advisor.go
advisor_test.go		advisor_test.go
bench_test.go		bench_test.go
budget_debug_test.go		budget_debug_test.go
chunker.go		chunker.go
chunker_test.go		chunker_test.go
codeaware.go		codeaware.go
codeaware_test.go		codeaware_test.go
codecov.yml		codecov.yml
compaction.go		compaction.go
compress.go		compress.go
compress_test.go		compress_test.go
compressor.go		compressor.go
cost.go		cost.go
cost_test.go		cost_test.go
entropy.go		entropy.go
errors.go		errors.go
estimate_test.go		estimate_test.go
example_test.go		example_test.go
extract.go		extract.go
extract_test.go		extract_test.go
filters.go		filters.go
filters_test.go		filters_test.go
format.go		format.go
go.mod		go.mod
go.sum		go.sum
integration_test.go		integration_test.go
jsoncrunch.go		jsoncrunch.go
jsoncrunch_test.go		jsoncrunch_test.go
lefthook.yml		lefthook.yml
logcrunch.go		logcrunch.go
logcrunch_test.go		logcrunch_test.go
optimizer.go		optimizer.go
optimizer_test.go		optimizer_test.go
options.go		options.go
perplexity.go		perplexity.go
perplexity_ollama.go		perplexity_ollama.go
perplexity_test.go		perplexity_test.go
profile.go		profile.go
profile_test.go		profile_test.go
race_off_test.go		race_off_test.go
race_on_test.go		race_on_test.go
ratelimit.go		ratelimit.go
ratelimit_test.go		ratelimit_test.go
restoration.go		restoration.go
restoration_test.go		restoration_test.go
secrets.go		secrets.go
secrets_filename_test.go		secrets_filename_test.go
secrets_fuzz_test.go		secrets_fuzz_test.go
secrets_test.go		secrets_test.go
sgconfig.yaml		sgconfig.yaml
staticcheck.conf		staticcheck.conf
stats.go		stats.go
stream.go		stream.go
stream_test.go		stream_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tok

What tok Is

Install

Quick Start

Compress a prompt

Filter command output

Estimate tokens & cost

Scan for secrets

Library API Highlights

Compression Modes

Output Filtering (31-Layer Pipeline)

Custom Filter DSL

Team / Shared Compression Profiles

Code-Aware (Symbol-Preserving) Compression

Perplexity-Guided Token Dropping

Benchmarks

Architecture

Contributing

License

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

tok

What tok Is

Install

Quick Start

Compress a prompt

Filter command output

Estimate tokens & cost

Scan for secrets

Library API Highlights

Compression Modes

Output Filtering (31-Layer Pipeline)

Custom Filter DSL

Team / Shared Compression Profiles

Code-Aware (Symbol-Preserving) Compression

Perplexity-Guided Token Dropping

Benchmarks

Architecture

Contributing

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages