fineweb
Here are 8 public repositories matching this topic...
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
-
Updated
Jul 24, 2025 - Python
Training GPT-2 on FineWeb-Edu in JAX/Flax
-
Updated
Dec 28, 2024 - Python
A 66M parameter decoder-only transformer language model implemented from scratch in PyTorch. Features a custom SentencePiece tokenizer, RoPE positional embeddings, SwiGLU feed-forward network, per-layer KV cache for efficient autoregressive inference, and a Svelte-based streaming chat interface.
-
Updated
Apr 5, 2026 - Python
Decoder-only LLM from scratch with reproducible data pipelines, tokenizer/sharding workflows, and GPU training.
-
Updated
Apr 13, 2026 - Python
FineWeb-Edu dataset analysis using Apache Spark - DSC 232R group project
-
Updated
Mar 24, 2026 - Jupyter Notebook
Improve this page
Add a description, image, and links to the fineweb topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the fineweb topic, visit your repo's landing page and select "manage topics."