Releases · PKU-YuanGroup/Open-Sora-Plan

25 Jul 06:28

LinB203

v1.2.0

adb2a20

Release v1.2.0 Latest

Latest

v1.2.0 is here! Utilizing a 3D full attention architecture instead of 2+1D. We released a true 3D video diffusion model trained on 4s 720p.

Architecture shift from 2+1D model to 3D full attention architecture and no longer supports 2+1D.
Instead of joint image-video training, the image weights are trained first as the initialization for the video.
Release all data annotations, the data are filtered by aesthetic and motion.
Improve CasualVideoVAE performance and report performance on validation set of WebVid and Panda70M.

Although the 3D attention architecture excels in spatio-temporal consistency, it is so expensive to train that it is difficult to scale up. We hope to collaborate with the open-source community to optimize the 3D DiT architecture. For further details, please refer to our report.

Assets 2

27 May 10:02

LinB203

v1.1.0

2a8b232

Release v1.1.0

Support for longer videos, dynamic resolution training and inference.
Support for Ascend training and inferencing
Release all training data and annotations.
Improve CasualVideoVAE performance.

In this version, we employ ShareGPT4Video for video annotation, followed by training the model on 3k hours of video data. The resulting model exhibited advancements in both video quality and duration. For further details, please refer to our report.

Assets 2

09 Apr 06:43

LinB203

v1.0.0

a737503

Release v1.0.0

Added text conditional control to generate videos.
Support HUAWEI NPU in hw branch.
Released all training data and annotations.
Add training, sampling scripts.
Add CausalVideoVAE training details.

We trained all models to use 40K videos crawled from the web, most of which are landscape related content. The complete training process takes about 2048 GPU hours. More detailed changes can be found in our report.

We hope this release further benefits the community and makes text-to-video models more accessible.

Assets 3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: PKU-YuanGroup/Open-Sora-Plan

Release v1.2.0

Release v1.1.0

Release v1.0.0