BirdNET+ V3.0 Developer Preview

Analyze audio with BirdNET+ V3.0 developer preview models. This repository provides multiple ways to run the model:

Tool	Description	Best For
`analyze.py`	Command-line batch processing	Processing many files, scripting
`app.py`	Streamlit interactive UI	Quick experimentation, visualization
`web-demo/`	Browser-only demo (ONNX)	Sharing, no Python needed

About This Release

Current version: Developer Preview 3 - 11K Species (Jan 2026)

Key changes vs 2.X:

Variable-length input (removed fixed 3s constraint)
32 kHz audio input (was 48 kHz)
Improved architecture and training
Larger, more diverse training dataset
Expanded non-bird species

Known limitations:

No human voice detection yet
Limited non-target sound handling (rain, wind, engines)
Species list needs cleanup

⚠️ Developer Preview Notice: Models, labels, and code will change before final release.

You can download the latest models and labels from Zenodo or run the tools which will download them automatically on first use.

Quick Start

# 1. Clone and setup
git clone https://github.com/birdnet-team/birdnet-V3.0-dev.git
cd birdnet-V3.0-dev

# 2. Create virtual environment
python3 -m venv .venv
source .venv/bin/activate        # Mac/Linux
# .venv\Scripts\activate         # Windows

# 3. Install dependencies  
pip install -r requirements.txt

# 4. Run analysis (model downloads automatically on first run)
python analyze.py example/soundscape.wav

Tools Overview

Option A: Command-Line Analysis (`analyze.py`)

Best for batch processing and scripting.

python analyze.py /path/to/audio.wav

Options:

Flag	Default	Description
`--model`	PyTorch FP32	Path to model (.pt or .onnx)
`--chunk_length`	3.0	Chunk length in seconds
`--overlap`	0.0	Chunk overlap in seconds
`--min-conf`	0.15	Minimum confidence threshold
`--device`	auto	`cpu` or `cuda`
`--out-csv`	`<audio>.results.csv`	Output CSV path
`--export-embeddings`	false	Include embeddings in output

Examples:

# Basic usage (PyTorch model, downloads automatically)
python analyze.py example/soundscape.wav

# Use FP16 ONNX model (recommended: smaller, same accuracy)
python analyze.py example/soundscape.wav --model models/BirdNET+_V3.0-preview3_Global_11K_FP16.onnx

# Custom settings
python analyze.py example/soundscape.wav --chunk_length 2.0 --min-conf 0.2 --out-csv results.csv

# Use GPU
python analyze.py example/soundscape.wav --device cuda

Output: CSV with columns name, start_sec, end_sec, confidence, label (+ embeddings if requested)

Option B: Streamlit Web App (`app.py`)

Best for interactive exploration with visual feedback.

# Start the app
streamlit run app.py

# Or with larger file upload limit (e.g., 2GB)
streamlit run app.py --server.maxUploadSize 2048

Opens at http://localhost:8501

Features:

Upload audio (wav, mp3, ogg, flac, m4a)
View mel spectrogram
Select model format (FP32/FP16/INT8 ONNX)
Adjust chunk length, overlap, confidence threshold
Download results as CSV

Headless/server mode:

streamlit run app.py --server.address 0.0.0.0 --server.port 8501

Option C: Browser Demo (`web-demo/`)

Runs entirely in the browser using ONNX Runtime Web. No Python required after build.

cd web-demo

# Download model and labels (one-time setup)
./scripts/download-model.sh     # Mac/Linux
# .\scripts\download-model.ps1  # Windows

# Install and run
npm install
npm run dev

Opens at http://localhost:5173

Using FP16 for smaller download: The default download is FP32 (516 MB). To use FP16 (259 MB), convert locally with python convert.py ... --fp16 and copy to web-demo/public/assets/.

Build for deployment:

npm run build
npm run preview

Model Conversion (Optional)

Convert the FP32 ONNX model to smaller formats for faster loading:

Format	Size	Accuracy	Recommendation
FP32	516 MB	Baseline	Development/reference
FP16	259 MB	Identical to FP32	Recommended for production
INT8-head	245 MB	Identical to FP32	Moderate size reduction
INT8 (full)	131 MB	⚠️ Unreliable	Not recommended

⚠️ Full INT8 Warning: Full INT8 quantization produces many false positives due to error accumulation in the backbone layers. Use --int8-head for reliable INT8 or --fp16 for best results.

# Convert to FP16 (recommended)
python convert.py models/BirdNET+_V3.0-preview3_Global_11K_FP32.onnx --fp16

# Convert to INT8 head-only (reliable, 52% size reduction)
python convert.py models/BirdNET+_V3.0-preview3_Global_11K_FP32.onnx --int8-head

# Show model info
python convert.py models/BirdNET+_V3.0-preview3_Global_11K_FP32.onnx --info

# Convert with validation
python convert.py models/BirdNET+_V3.0-preview3_Global_11K_FP32.onnx --fp16 --validate

Note on --fp16-io: The --fp16-io flag converts everything (weights, compute, I/O) to float16. This is not recommended — it does not work with ONNX Runtime Web (WASM backend has no float16 compute support). The default --fp16 only stores weights as float16 while keeping all computation in float32, which works with both desktop and web ONNX Runtime.

Output files use the same naming pattern: ..._FP16.onnx

Species Filtering (Optional)

Create a smaller, specialized model by filtering to only the species you need:

# Create a species list file (one species per line)
cat > my_species.txt << EOF
Cyanocitta cristata_Blue Jay
Poecile atricapillus_Black-capped Chickadee
Turdus migratorius_American Robin
Junco hyemalis_Dark-eyed Junco
EOF

# Filter model to only include these species
python convert.py models/BirdNET+_V3.0-preview3_Global_11K_FP32.onnx \
    --species-list my_species.txt \
    --labels models/BirdNET+_V3.0-preview3_Global_11K_Labels.csv

# Combine with FP16 for maximum compression
python convert.py models/BirdNET+_V3.0-preview3_Global_11K_FP32.onnx \
    --species-list my_species.txt \
    --labels models/BirdNET+_V3.0-preview3_Global_11K_Labels.csv \
    --fp16

Output files:

..._<species_list_name>.onnx - Filtered model
..._<species_list_name>_Labels.csv - Corresponding labels file
..._<species_list_name>_FP16.onnx - Filtered + FP16 (if requested)

Species list format:

One species per line
Use SciName_CommonName format (e.g., Cyanocitta cristata_Blue Jay)
Also accepts just scientific name or common name
Lines starting with # are comments

Size reduction examples:

Species Count	FP32 Size	FP16 Size	vs Full FP32
11,560 (full)	516 MB	259 MB	baseline
9,834 (birds only)	454 MB	228 MB	-56%
5,000	279 MB	140 MB	-73%
1,000	134 MB	68 MB	-87%
500	116 MB	59 MB	-89%
100	102 MB	52 MB	-90%
10	99 MB	50 MB	-90%

Note: The base model (backbone) is ~98 MB. Filtering removes only the classification head weights, so even a 10-species model is still ~99 MB in FP32. Combine with FP16 for maximum compression.

Installation Details

Windows with CUDA (Optional)

If you have an NVIDIA GPU, enable CUDA support:

# Check CUDA availability
nvidia-smi

# Install CUDA-enabled PyTorch
pip install -U torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118

Model Download

Models are downloaded automatically on first run. To download manually:

Download from Zenodo
Place in models/ directory

License

Source Code: The source code for this project is licensed under the MIT License.
Models: The models used in this project are licensed under the Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA 4.0).

Please ensure you review and adhere to the specific license terms provided with each model.

Terms of Use

Please refer to the TERMS OF USE file for detailed terms and conditions regarding the use of the BirdNET+ V3.0 developer preview models.

Citation

Lasseck, M., Eibl, M., Klinck, H., & Kahl, S. (2025). BirdNET+ V3.0 model developer preview (Preview 2). Zenodo. https://doi.org/10.5281/zenodo.18247420

@dataset{lasseck2025birdnet,
  title     = {BirdNET+ V3.0 model developer preview (Preview 3)},
  author    = {Lasseck, M. and Eibl, M. and Klinck, H. and Kahl, S.},
  year      = {2026},
  publisher = {Zenodo},
  doi       = {10.5281/zenodo.18247420},
  url       = {https://doi.org/10.5281/zenodo.18247420}
}

Funding

Our work in the K. Lisa Yang Center for Conservation Bioacoustics is made possible by the generosity of K. Lisa Yang to advance innovative conservation technologies to inspire and inform the conservation of wildlife and habitats.

The development of BirdNET is supported by the German Federal Ministry of Research, Technology and Space (FKZ 01|S22072), the German Federal Ministry for the Environment, Climate Action, Nature Conservation and Nuclear Safety (FKZ 67KI31040E), the German Federal Ministry of Economic Affairs and Energy (FKZ 16KN095550), the Deutsche Bundesstiftung Umwelt (project 39263/01) and the European Social Fund.

Partners

BirdNET is a joint effort of partners from academia and industry. Without these partnerships, this project would not have been possible. Thank you!

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
example		example
img		img
models		models
web-demo		web-demo
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TERMS_OF_USE.md		TERMS_OF_USE.md
TERMS_OF_USE.txt		TERMS_OF_USE.txt
analyze.py		analyze.py
app.py		app.py
convert.py		convert.py
image.png		image.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BirdNET+ V3.0 Developer Preview

About This Release

Table of Contents

Quick Start

Tools Overview

Option A: Command-Line Analysis (`analyze.py`)

Option B: Streamlit Web App (`app.py`)

Option C: Browser Demo (`web-demo/`)

Model Conversion (Optional)

Species Filtering (Optional)

Installation Details

Windows with CUDA (Optional)

Model Download

License

Terms of Use

Citation

Funding

Partners

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BirdNET+ V3.0 Developer Preview

About This Release

Table of Contents

Quick Start

Tools Overview

Option A: Command-Line Analysis (analyze.py)

Option B: Streamlit Web App (app.py)

Option C: Browser Demo (web-demo/)

Model Conversion (Optional)

Species Filtering (Optional)

Installation Details

Windows with CUDA (Optional)

Model Download

License

Terms of Use

Citation

Funding

Partners

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Option A: Command-Line Analysis (`analyze.py`)

Option B: Streamlit Web App (`app.py`)

Option C: Browser Demo (`web-demo/`)

Packages