Article-Title Generation System based on T5, and traditional Transformer

Write a complete transformer model line by line, and use it to generate the passage title (Seq2Seq)

we utilize a seq2seq model based on the Transformer in the transformer folder, where this model is exactly the same as the one brought up in this paper: Attention is all you need
使用一个 <Title, Content> 数据集进行微调
The final result is, if you give model a piece of content, it will return a well-written and summarized topic of this content.

Notice:

This project is still in the progress, which will be finished in a few days.
the tranformer model is finished, and you can run it by python main.py

Environment Configuration

pip install -r requirements

Project Structure

all the transformer code is in the transformer folder, which includes:

Constants.py
Layers.py
Models.py
Modules.py
Optim.py
SubLayers.py
Translator.py

the main training loop is in the main.py.
the training data for the transformer is in sample_data.json
all T5 related code is in the fold T5
For the training data imdb of the t5 model, you should pre-download to the data folder, and use the HFDataset object in the data_preprocess.py to handle it.
the tokenization folder will include the BPE tokenizer and Word Piece tokenizer in the near future.

Run

python main.py

Training Snapshot

Result

We did not add any evaluation metrics like Perplexity, BLEU, ROUGE for now ...

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
config		config
data		data
image		image
t5		t5
tokenization		tokenization
transformer		transformer
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
beam_search.py		beam_search.py
evaluation.py		evaluation.py
loader.py		loader.py
main.py		main.py
sample_data.json		sample_data.json
utils.py		utils.py
vocab.txt		vocab.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Article-Title Generation System based on T5, and traditional Transformer

Notice:

Environment Configuration

Project Structure

Run

Training Snapshot

Result

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Article-Title Generation System based on T5, and traditional Transformer

Notice:

Environment Configuration

Project Structure

Run

Training Snapshot

Result

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages