Skip to content

Bili-Sakura/Visual-Generative-Foundation-Model-Collection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

62 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Visual-Generative-Foundation-Model-Collection

πŸ€— Collection

Important

We only hold the core publicly available ones pre-trained on ImageNet.

A selected collection of CORE visual generative foundation model including code, paper, checkpoint etc.

TODO: @src/diffusers Models

  • PixelDiT
  • JLT
  • JiT
  • NiT
  • PixNerd
  • PixelFlow
  • SiT
  • ADM
  • DDT
  • DeCo
  • DiM
  • Diffusion-RWKV
  • DiT
  • DiT-MoE
  • EPG
  • EDM2
  • FD-Loss
  • FiT
  • FiTv2
  • iMF
  • LightningDiT
  • LiT
  • RiT
  • MDT
  • MDTv2
  • PAE
  • ProMoE
  • pMF
  • RAE
  • RAEv2
  • PixelREPA
  • REPA
  • REPA-E
  • Self-Flow
  • USP

Update the checklist as new models are added or completed.

Benchmarks

Note

† DiT-MoE uses additional synthetic training data generated by FLUX and SD3.

FID and IS are evaluated on 50k samples, reported with CFG if applicable. Γ—2 in NFEs indicates that CFG doubles NFEs at inference time.

ImageNet-256

Model NFE #Param GFLOPs FID IS Precision Recall Code Paper Model
Pixel modeling
ADM-G 250Γ—2 4.59 0.82 0.52 Official Code Paper πŸ€— Model
PixelFlow 677M 1.98 282.1 0.81 0.60 Official Code Paper πŸ€— Model
JiT-H/16 100Γ—2 953M 182 1.86 303.4 Official Code Paper πŸ€— Model
PixelREPA-H/16 953M 182 1.81 317.2 Official Code Paper πŸ€— Model
EPG 1 1.58 Official Code Paper πŸ€— Model
PixNerd-XL/16 700M 134 1.93 297 Official Code Paper πŸ€— Model
DeCo-XL/16 682M 1.62 301 0.80 0.62 Official Code Paper πŸ€— Model
pMF-H/16 1 956M 271 2.22 268.8 Official Code Paper πŸ€— Model
Latent modeling
DiT-XL/2 250Γ—2 675M 119 2.27 278.24 0.83 0.57 Official Code Paper πŸ€— Model
DiT-MoE-XL/2-8E2A† 4.1B 323.74 1.72 315.73 0.83 0.64 Official Code Paper πŸ€— Model
DiffuSSM-XL-G 673M 2.28 259.13 0.86 0.56 Official Code Paper πŸ€— Model
MDT-XL/2 676M 119 1.79 283.01 0.81 0.61 Official Code Paper πŸ€— Model
MDTv2-XL/2 676M 119 1.58 314.73 0.79 0.65 Official Code Paper πŸ€— Model
FiT-XL/2 824M 153 4.21 254.87 0.84 0.51 Official Code Paper πŸ€— Model
SiT-XL/2 250Γ—2 675M 119 2.06 277.50 0.83 0.59 Official Code Paper πŸ€— Model
SiT-XL/2 + REPA 250Γ—2 675M 119 1.42 305.7 0.80 0.65 Official Code Paper πŸ€— Model
SiT-XL/2 + USP 675M 119 7.35 128.50 Official Code Paper πŸ€— Model
Self-Flow-XL/2 675M 119 5.70 151.40 0.72 0.67 Official Code Paper πŸ€— Model
FiTv2-XL/2 671M 147 2.26 260.95 0.81 0.59 Official Code Paper πŸ€— Model
LightningDiT-XL/2 724M 119 1.35 295.3 Official Code Paper πŸ€— Model
iMF-XL/2 1 610M 175 1.72 282.0 Official Code Paper πŸ€— Model
LiT-XL/2-G 675M 2.32 265.20 0.82 0.57 Official Code Paper πŸ€— Model
RiT Paper πŸ€— Model
SiT-XL/2 + REG 677M 119 1.36 299.4 0.77 0.66 Official Code Paper πŸ€— Model
DDT-XL/2 724M 119 1.26 310.6 Official Code Paper πŸ€— Model
DRWKV-H/2 779M 34.95 2.16 275.36 0.83 0.58 Official Code Paper πŸ€— Model
NiT-XL 675M 119 2.03 265.26 Official Code Paper πŸ€— Model
ProMoE-XL-Flow 1.568B 2.59 265.62 Official Code Paper πŸ€— Model
RAE, DiT-DH-XL/2 50Γ—2 1254M 146 1.13 262.6 Official Code Paper πŸ€— Model

ImageNet-512

Model NFE #Param GFLOPs FID IS Precision Recall Code Paper Model
Pixel modeling
ADM-G 250Γ—2 7.72 0.87 0.42 Official Code Paper πŸ€— Model
JiT-H/32 100Γ—2 956M 183 1.94 309.1 Official Code Paper πŸ€— Model
EPG 2.35 Official Code Paper πŸ€— Model
PixNerd-XL/16 700M 583 2.84 245.6 Official Code Paper πŸ€— Model
DeCo-XL/16 682M 2.22 290.0 0.80 0.60 Official Code Paper πŸ€— Model
pMF-H/32 1 959M 272 2.48 284.9 Official Code Paper πŸ€— Model
Latent modeling
DiT-XL/2 250Γ—2 675M 525 3.04 240.82 0.84 0.54 Official Code Paper πŸ€— Model
DiT-MoE-XL/2-8E2A† 4.1B 2.30 298.35 0.85 0.57 Official Code Paper πŸ€— Model
DiffuSSM-XL-G 673M 3.41 255.06 0.85 0.49 Official Code Paper πŸ€— Model
EDM2-XXL 1523M 552 1.81 Official Code Paper πŸ€— Model
SiT-XL/2 250Γ—2 675M 525 2.62 252.21 0.84 0.57 Official Code Paper πŸ€— Model
FiTv2-XL/2 671M 525 2.90 263.11 0.83 0.53 Official Code Paper πŸ€— Model
LiT-XL/2-G 675M 3.69 207.97 0.85 0.53 Official Code Paper πŸ€— Model
DDT-XL/2 724M 525 1.28 305.1 Official Code Paper πŸ€— Model
DRWKV-H/2 779M 2.95 265.20 0.84 0.54 Official Code Paper πŸ€— Model
NiT-XL 675M 525 1.45 272.77 Official Code Paper πŸ€— Model
RAE, DiT-DH-XL/2 50Γ—2 1254M 642 1.13 259.6 Official Code Paper πŸ€— Model

About

A collection of visual generative foundation model including code, paper, checkpoint etc.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages