You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+7-8Lines changed: 7 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
# 🔮 PurpCode: Reasoning for Safer Code Generation
2
2
3
-
This repo includes the trainingand evaluation infrastructure for PurpCode. For other related resources, please check out:
3
+
This repo includes the training, evaluation, and data curation code for PurpCode. Please also check out:
4
4
5
5
*[📝 Paper](https://arxiv.org/abs/2507.19060) with technical and evaluation details
6
6
*[🤗 HuggingFace](https://huggingface.co/purpcode) including model checkpoints and training/evaluation datasets
@@ -18,9 +18,9 @@ PurpCode includes two alignment stages:
18
18
- The ☝️*first* cybersafe reasoning recipe in open source
19
19
- Great cybersafety and utility preservation, winning the 🥇*1st place* in [Amazon Nova AI Challenge](https://www.amazon.science/nova-ai-challenge/pushing-the-boundaries-of-secure-ai-winners-of-the-amazon-nova-ai-challenge)
20
20
- Fully 👐open-sourced, from models, data, to training/evaluation code and data synthesizers
21
-
- 🏎️Fast RL with *Single-Step Dynamic Sampling* -- 12% faster, 15% less sample wasting, and better results than [DAPO](https://arxiv.org/abs/2503.14476)
22
-
- 📚Supporting 13 evals, 90 CWEs, and 4 training objectives & rewards, covering cybersafety, utility, and overrefusal
23
-
- 🙅♂️XSCode -- our home-made evaluator and the *first*benchmark for checking overrefusal in secure code generation
21
+
- 🏎️Fast RL with *Single-Step Dynamic Sampling* -- 12% faster, 15% less sample wastes, & better results than [DAPO](https://arxiv.org/abs/2503.14476)
22
+
- 📚13 evals, 90 CWEs, and 4 training objectives & rewards, covering cybersafety, utility, and overrefusal
23
+
- 🙅♂️XSCode -- our home-made and the *first*evaluator for checking overrefusal in secure code generation
24
24
- ... and more details in the [paper](https://arxiv.org/abs/2507.19060)!
25
25
26
26
## Initial Setup
@@ -52,8 +52,7 @@ tmux detach
52
52
53
53
# --- TMUX SESSION "sandbox" ---
54
54
tmux new -s sandbox
55
-
docker run -it -p 8080:8080 volcengine/sandbox-fusion:server-20241204
56
-
tmux detach
55
+
docker run -it -p 8080:8080 volcengine/sandbox-fusion:server-20241204 & tmux detach
0 commit comments