GitHub - KongLongGeFDU/PFDial: The code repository of paper "PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts"

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

Note: For the Chinese version of this README, please refer to README_zh.md.

🔔 News

🏆 [2025-05] Our paper has been accepted at ACL 2025 Findings.
🤗 [2025-03] Dataset released on Hugging Face.
🎉 [2025-03] Our paper is released on arXiv: arXiv:2503.06706.

📚 Overview

PFDial (Process Flow Dialogue) addresses a core challenge for process-driven dialogue systems used in customer service and equipment maintenance: even strong LLMs frequently fail when asked to follow strictly predefined process constraints.

We construct a dataset of 12,705 high-quality Chinese dialogue instructions, derived from 440 UML flowcharts containing 5,055 process nodes. Based on the PlantUML specification, each UML flowchart is decomposed into atomic dialogue units — structured five-tuples — which are then transformed into instruction-tuning data.

Each UML flowchart → five-tuple atomic dialogue units → instruction-tuning data

✨ Highlights

🧩 12,705 high-quality Chinese dialogue samples derived from 440 UML flowcharts
📈 A 7B model trained with only 800 samples, or a 0.5B model trained on the full data, both surpass 90% accuracy
🥇 An 8B model outperforms GPT-4o by up to 43.88% (avg. +11.00%) on challenging tasks
🔄 In-depth analysis of backward transitions, decision branching, and the impact of different dataset formats

📂 Project Structure

PFDial/
├── PFDial-Raw_Data/         # 🔹 Raw data (structured as five-tuples with IDs)
├── PFDial-H-Raw_Data/       # 🔸 Hard-bench raw data
├── PFDial-SFT_Data/         # ✅ Supervised fine-tuning data
└── SFT_Script/              # 🛠️ Reference training scripts (OpenRLHF)

📊 Dataset Statistics

📌 Statistics	Train	ID Test	OOD Test
🧩 Flowcharts	440	80	80
🔄 State Nodes	5,055	902	1,262
🔁 Sequential Samples	9,029	1,628	2,265
🔀 Decision Samples	3,676	645	698
💬 Dialogue Samples	12,705	2,273	2,963
📏 Avg. Length	277.16	270.57	326.10

Table: Key statistics of the PFDial dataset.

🛠️ Usage Guide

1. Prepare Data

All raw data are provided in PFDial-Raw_Data/ as five-tuples with IDs.
The hard benchmark for stress-testing models is in PFDial-H-Raw_Data/.
Ready-to-use supervised fine-tuning data is in PFDial-SFT_Data/.

You can also load the SFT split directly from Hugging Face:

from datasets import load_dataset

dataset = load_dataset("konglongge/PFDial")

2. Supervised Fine-tuning

We provide reference training scripts based on OpenRLHF:

bash SFT_Script/sft.sh

Adjust model path, data path, and hyper-parameters in the script before running.

3. Evaluation

Evaluate models on the ID and OOD test sets to reproduce the numbers reported in the paper. We recommend reporting dialogue-level accuracy along with separate accuracies on sequential and decision branches.

📊 Key Findings

Setting	Result
7B model + 800 SFT samples	> 90% dialogue accuracy
0.5B model + full SFT data	> 90% dialogue accuracy
8B model vs. GPT-4o (hard)	up to +43.88% absolute improvement
Backward transitions	Largest remaining gap — see paper for analysis

For more experimental details and ablations, please refer to our paper.

📝 Citation

If you find this project useful in your research, please cite us:

@inproceedings{zhang-etal-2025-pfdial,
    title     = "{PFD}ial: A Structured Dialogue Instruction Fine-tuning Method Based on {UML} Flowcharts",
    author    = "Zhang, Ming and Wang, Yuhui and Shen, Yujiong and Yang, Tingyi and
                 Jiang, Changhao and Wu, Yilong and Dou, Shihan and Chen, Qinhao and
                 Xi, Zhiheng and Zhang, Zhihao and Dong, Yi and Wang, Zhen and
                 Fei, Zhihui and Wan, Mingyang and Liang, Tao and Ma, Guojun and
                 Zhang, Qi and Gui, Tao and Huang, Xuanjing",
    editor    = "Che, Wanxiang and Nabende, Joyce and Shutova, Ekaterina and
                 Pilehvar, Mohammad Taher",
    booktitle = "Findings of the Association for Computational Linguistics: ACL 2025",
    month     = jul,
    year      = "2025",
    address   = "Vienna, Austria",
    publisher = "Association for Computational Linguistics",
    url       = "https://aclanthology.org/2025.findings-acl.134/",
    doi       = "10.18653/v1/2025.findings-acl.134",
    pages     = "2626--2649",
    ISBN      = "979-8-89176-256-5"
}

🔗 Related Projects

Project	Description	Link
TransferTOD (EMNLP 2024)	Generalizable Chinese multi-domain TOD with transfer capabilities	GitHub
LLMEval-Med (EMNLP 2025)	Real-world clinical benchmark for medical LLMs	GitHub
LLMEval-Fair (ACL 2026)	Robust & fair evaluation, 200K+ questions	GitHub

📞 Contact Us

For questions or collaboration, please:

Open an Issue on GitHub
Contact the project maintainers:
- Ming Zhang: mingzhang23@m.fudan.edu.cn
- Yuhui Wang: yuhuiwang22@m.fudan.edu.cn

PFDial | Fudan NLP Lab

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
PFDial-H-Raw_Data		PFDial-H-Raw_Data
PFDial-Raw_Data		PFDial-Raw_Data
PFDial-SFT_Data		PFDial-SFT_Data
SFT_Script		SFT_Script
.gitignore		.gitignore
PFDial-paper.pdf		PFDial-paper.pdf
README.md		README.md
README_zh.md		README_zh.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

🔔 News

📚 Overview

✨ Highlights

📂 Project Structure

📊 Dataset Statistics

🛠️ Usage Guide

1. Prepare Data

2. Supervised Fine-tuning

3. Evaluation

📊 Key Findings

📝 Citation

🔗 Related Projects

📞 Contact Us

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

🔔 News

📚 Overview

✨ Highlights

📂 Project Structure

📊 Dataset Statistics

🛠️ Usage Guide

1. Prepare Data

2. Supervised Fine-tuning

3. Evaluation

📊 Key Findings

📝 Citation

🔗 Related Projects

📞 Contact Us

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages