big-code-analysis

big-code-analysis is a hard fork of the rust-code-analysis project. This project is an unapologetic vibe-coded fork that seeks to add as many features and functions as fast as possible.

Nonetheless, it is still a Rust library to analyze and extract information from source code written in many different programming languages. It is based on a parser generator tool and an incremental parsing library called Tree Sitter.

A command line tool called bca is provided to interact with the API of the library in an easy way.

This tool can be used to:

Call big-code-analysis API
Print nodes and metrics information
Export metrics in different formats
Generate a Markdown or HTML quality-metrics report (bca report markdown / bca report html)

In addition, we provide a bca-web tool to use the library through a REST API.

Live example reports

bca runs against its own source on every push to main and publishes the result alongside the documentation:

HTML hotspot report: https://dekobon.github.io/big-code-analysis/reports/index.html
Markdown PR/MR comment: https://dekobon.github.io/big-code-analysis/reports/report.md

The wiring lives in .github/workflows/pages.yml. For downstream projects, the CI integration recipe is the canonical adoption guide — it documents the recommended pinned-release install path (with BCA_VERSION + sha256 pin) plus a cargo install alternative. The in-tree pages.yml workflow builds bca from the current checkout because main may carry CLI artifact schemas that no released bca supports yet — see the schema-compatibility note in the recipe before copying that pattern.

Usage

big-code-analysis supports many types of programming languages and computes a great variety of metrics. You can find up to date documentation at Documentation.

On the Commands page, there is a list of commands that can be run to get information about metrics, nodes, and other general data provided by this software.

Using as a library

big-code-analysis is published on crates.io and can be embedded directly. The crate is on the 1.x line and ships under a written stability contract: the public API surface is held stable across patch and minor bumps, and breaking shape changes are reserved for the next major bump. Metric values may still drift across minor bumps when a grammar pin moves or a metric definition is fixed — see STABILITY.md for the full versioning contract, MSRV policy, escape hatches, and exactly what we do and do not promise within 1.x.

For task-oriented walkthroughs — quick start, in-memory analysis, walking FuncSpace results, and error handling — see the Using as a Library section of the book.

Python bindings (PyO3) live in big-code-analysis-py/ and ship the same metric pipeline as a Python package. See the book's Python Bindings section for the install matrix, batch / async / SARIF recipes, and the full error taxonomy.

Per-language Cargo features

Every tree-sitter grammar is gated behind a per-language Cargo feature. The default feature set is all-languages, so a bare

big-code-analysis = "1.1.0"

pulls every grammar in (matching the library's historical behaviour and what the bca / bca-web binaries ship). Library consumers that only need a subset of languages can opt out of the defaults and re-enable just the grammars they want:

big-code-analysis = { version = "1.1.0", default-features = false, features = ["rust", "typescript"] }

Supported language features: bash, cpp, csharp, elixir, go, groovy, irules, java, javascript, kotlin, lua, mozjs, perl, php, python, ruby, rust, tcl, typescript. The irules feature adds F5 iRules (a Tcl dialect; extensions .irule / .irules). The cpp feature covers the Cpp, Ccomment, and Preproc LANG variants and pulls in bca-tree-sitter-mozcpp, bca-tree-sitter-ccomment, and bca-tree-sitter-preproc together (published forks of the matching Mozilla grammars — see the publish strategy notes in RELEASING.md).

The LANG enum keeps every variant defined regardless of the active feature set; selecting a [LANG] variant whose feature is off returns Err(MetricsError::LanguageDisabled(LANG)) from every dispatch entry point (analyze, metrics_from_tree, action, get_ops, the deprecated get_function_spaces* shims, and LANG::get_tree_sitter_language). The set of compiled-in variants is queryable via LANG::is_enabled.

Building

The repository ships a Makefile that wraps every common build, test, lint, and docs task. Run make help for the full list, and make check-tools to verify the optional tools are installed.

make build           # debug build of the entire workspace
make build-release   # optimised release build

If you prefer to run cargo directly, or want to build a single crate:

cargo build                              # library only
cargo build -p big-code-analysis-cli     # CLI only
cargo build -p big-code-analysis-web     # web server only
cargo build --workspace                  # everything in one shot

Testing

make test           # cargo test --workspace --all-features --lib --bins --tests
make test-doc      # cargo test --workspace --all-features --doc
make pre-commit    # full local gate: fmt-check, clippy, tests, udeps, lint families

make pre-commit is the recommended gate before committing — it is equivalent to what CI runs. If GNU Make 4 or any of the optional tools are unavailable, the raw cargo invocation still works:

cargo test --workspace --all-features --verbose

Updating insta tests

We use insta, to update the snapshot tests you should install cargo insta

make insta-review   # cargo insta test --review

Will run the tests, generate the new snapshot references and let you review them.

Updating grammars

Have a look at Update grammars guide to learn how to update languages grammars.

Contributing

If you want to contribute to the development of this software, have a look at the guidelines contained in our Developers Guide.

Licenses

Mozilla-defined grammars are released under the MIT license.
big-code-analysis, big-code-analysis-cli and big-code-analysis-web are released under the Mozilla Public License v2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 1,661 Commits
.cargo		.cargo
.claude		.claude
.config		.config
.github		.github
big-code-analysis-book		big-code-analysis-book
big-code-analysis-cli		big-code-analysis-cli
big-code-analysis-py		big-code-analysis-py
big-code-analysis-web		big-code-analysis-web
docs		docs
enums		enums
generate-grammars		generate-grammars
man		man
packaging		packaging
src		src
tests		tests
tree-sitter-ccomment		tree-sitter-ccomment
tree-sitter-mozcpp		tree-sitter-mozcpp
tree-sitter-mozjs		tree-sitter-mozjs
tree-sitter-preproc		tree-sitter-preproc
tree-sitter-tcl		tree-sitter-tcl
utils		utils
xtask		xtask
.bca-baseline.toml		.bca-baseline.toml
.bcaignore		.bcaignore
.checkmake.ini		.checkmake.ini
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.grammar-marker-baseline.toml		.grammar-marker-baseline.toml
.pre-commit-audit-config.yaml		.pre-commit-audit-config.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
.rumdl.toml		.rumdl.toml
.snapshot-anchor-baseline.txt		.snapshot-anchor-baseline.txt
.taplo.toml		.taplo.toml
.taskcluster.yml		.taskcluster.yml
.typos.toml		.typos.toml
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
RELEASING.md		RELEASING.md
SECURITY.md		SECURITY.md
STABILITY.md		STABILITY.md
about.hbs		about.hbs
about.toml		about.toml
bca-thresholds.toml		bca-thresholds.toml
bca.toml		bca.toml
check-enums-codegen-drift-test.py		check-enums-codegen-drift-test.py
check-enums-codegen-drift.sh		check-enums-codegen-drift.sh
check-grammar-crate.py		check-grammar-crate.py
check-grammar-marker-sync-test.py		check-grammar-marker-sync-test.py
check-grammar-marker-sync.py		check-grammar-marker-sync.py
check-grammars-crates.sh		check-grammars-crates.sh
check-manpage-assets.py		check-manpage-assets.py
check-snapshot-anchors.py		check-snapshot-anchors.py
check-versions.py		check-versions.py
codecov.yml		codecov.yml
deny.toml		deny.toml
minisign.pub		minisign.pub
mise.toml		mise.toml
recreate-grammars.sh		recreate-grammars.sh
split-minimal-tests.py		split-minimal-tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

big-code-analysis

Live example reports

Usage

Using as a library

Per-language Cargo features

Building

Testing

Updating insta tests

Updating grammars

Contributing

Licenses

About

Uh oh!

Releases 2

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

big-code-analysis

Live example reports

Usage

Using as a library

Per-language Cargo features

Building

Testing

Updating insta tests

Updating grammars

Contributing

Licenses

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages