Skip to content

Commit 21394ff

Browse files
committed
fix: README tmux detach and one liners
1 parent 53bb3ae commit 21394ff

1 file changed

Lines changed: 7 additions & 8 deletions

File tree

README.md

Lines changed: 7 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# 🔮 PurpCode: Reasoning for Safer Code Generation
22

3-
This repo includes the training and evaluation infrastructure for PurpCode. For other related resources, please check out:
3+
This repo includes the training, evaluation, and data curation code for PurpCode. Please also check out:
44

55
* [📝 Paper](https://arxiv.org/abs/2507.19060) with technical and evaluation details
66
* [🤗 HuggingFace](https://huggingface.co/purpcode) including model checkpoints and training/evaluation datasets
@@ -18,9 +18,9 @@ PurpCode includes two alignment stages:
1818
- The ☝️*first* cybersafe reasoning recipe in open source
1919
- Great cybersafety and utility preservation, winning the 🥇*1st place* in [Amazon Nova AI Challenge](https://www.amazon.science/nova-ai-challenge/pushing-the-boundaries-of-secure-ai-winners-of-the-amazon-nova-ai-challenge)
2020
- Fully 👐open-sourced, from models, data, to training/evaluation code and data synthesizers
21-
- 🏎️Fast RL with *Single-Step Dynamic Sampling* -- 12% faster, 15% less sample wasting, and better results than [DAPO](https://arxiv.org/abs/2503.14476)
22-
- 📚Supporting 13 evals, 90 CWEs, and 4 training objectives & rewards, covering cybersafety, utility, and overrefusal
23-
- 🙅‍♂️XSCode -- our home-made evaluator and the *first* benchmark for checking overrefusal in secure code generation
21+
- 🏎️Fast RL with *Single-Step Dynamic Sampling* -- 12% faster, 15% less sample wastes, & better results than [DAPO](https://arxiv.org/abs/2503.14476)
22+
- 📚13 evals, 90 CWEs, and 4 training objectives & rewards, covering cybersafety, utility, and overrefusal
23+
- 🙅‍♂️XSCode -- our home-made and the *first* evaluator for checking overrefusal in secure code generation
2424
- ... and more details in the [paper](https://arxiv.org/abs/2507.19060)!
2525

2626
## Initial Setup
@@ -52,8 +52,7 @@ tmux detach
5252

5353
# --- TMUX SESSION "sandbox" ---
5454
tmux new -s sandbox
55-
docker run -it -p 8080:8080 volcengine/sandbox-fusion:server-20241204
56-
tmux detach
55+
docker run -it -p 8080:8080 volcengine/sandbox-fusion:server-20241204 & tmux detach
5756
# ------------------------------
5857
```
5958

@@ -83,6 +82,7 @@ conda activate purp
8382
pip install -r requirements.txt
8483
# Sampling
8584
python datagen/ctxdistill/distill_main.py --model openai/Qwen/Qwen2.5-14B-Instruct-1M --sample-per-prompt 8 --concurrency 400
85+
tmux detach
8686
# ---------------------------
8787

8888
# --- TMUX SESSION "sgl" ---
@@ -91,8 +91,7 @@ tmux at -t sgl
9191
# Ctrl + C
9292
# Serve the LLM judge model
9393
huggingface-cli download Qwen/Qwen2.5-32B-Instruct
94-
python3 -m sglang_router.launch_server --model Qwen/Qwen2.5-32B-Instruct --dp-size 8 --port 30000 --host 0.0.0.0
95-
tmux detach
94+
python3 -m sglang_router.launch_server --model Qwen/Qwen2.5-32B-Instruct --dp-size 8 --port 30000 --host 0.0.0.0 & tmux detach
9695
# --------------------------
9796

9897
# --- TMUX SESSION "main" ---

0 commit comments

Comments
 (0)