Generate fine-tuning notebooks for Small Language Models in minutes.
Visit localhost:3000/signup to create an account or login with GitHub.
- Click Get Started on the dashboard
- Drag and drop your JSONL file or click to browse (supports live client-side chat preview for first 10KB)
- Required format: ChatML with
messagesarray
{"messages": [{"role": "user", "content": "..."}, {"role": "assistant", "content": "..."}]}After upload, you'll see:
- Total Examples - Number of training examples
- Total Tokens - Approximate token count
- Quality Score - Dataset quality rating (0-100%)
Choose your fine-tuning goal:
- Instruction Following - General task completion
- Chat/Conversation - Multi-turn dialogue
- Code Generation - Programming tasks
- Summarization - Text condensation
- Classification - Category assignment
Where will your model run?
- Cloud (A100/H100) - Maximum performance
- Cloud (T4/L4) - Budget-friendly GPU
- Edge/Mobile - Optimized for devices
- CPU Only - No GPU required
SLMGEN analyzes your dataset and requirements to recommend the best models:
- Score - Overall fit percentage
- VRAM - GPU memory required
- Training Time - Estimated duration
Select a preset:
- Fast (~15 min) - Quick experimentation
- Balanced (~45 min) - Best for most cases
- Quality (~2 hours) - Maximum quality
Or customize:
- LoRA Rank, Learning Rate, Epochs, Batch Size, etc.
Click Generate Notebook to create your Colab notebook.
During generation, you'll see a Training Terminal Simulator that visualizes the Unsloth training commands in real-time. Once complete, a Success Confetti celebration confirms your notebook is ready.
Options:
- Open in Colab - Start training immediately
- Download - Save locally
{
"messages": [
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "What is 2+2?"},
{"role": "assistant", "content": "2+2 equals 4."}
]
}- Minimum 100 examples for basic fine-tuning
- 1,000+ examples recommended for best results
- Consistent formatting across all examples
- Include system prompts for better control
- Mix easy and hard examples for robustness
| Model | Size | Best For | VRAM |
|---|---|---|---|
| Phi-4 Mini | 3.8B | Instruction, Reasoning | 8GB |
| Llama 3.2 | 1-3B | General, Chat | 4-8GB |
| Gemma 2 | 2B | Efficient fine-tuning | 4GB |
| Qwen 2.5 | 3B | Multilingual | 8GB |
| DeepSeek Coder | 1.3B | Code generation | 4GB |
- LoRA Rank: 8
- Learning Rate: 5e-4
- Epochs: 1
- Time: ~15 minutes
- LoRA Rank: 16
- Learning Rate: 2e-4
- Epochs: 3
- Time: ~45 minutes
- LoRA Rank: 32
- Learning Rate: 1e-4
- Epochs: 5
- Time: ~2 hours
After training, export your model to:
- HuggingFace Hub - Share publicly or privately
- ONNX - For edge deployment
- Ollama - Run locally with Ollama
- GGUF - Quantized for llama.cpp
- Check for duplicate examples
- Ensure consistent message formatting
- Add more diverse examples
- Reduce batch size
- Lower LoRA rank
- Use a smaller model
- Enable gradient checkpointing
- Increase epochs
- Reduce learning rate
- Check data quality
- GitHub: github.com/eshanized/slmgen
- Email: eshanized@proton.me