‹ Reports
The Dispatch

Open-Sora Plan Sees Surge in Activity with New Features and Persistent Technical Challenges

Open-Sora Plan, an open-source initiative by PKU-YuanGroup, aims to replicate OpenAI's Sora model for text-to-video generation. The project supports Huawei Ascend AI systems and is designed to produce industry-standard video quality.

In the past 30 days, the project has seen significant activity, highlighted by the release of version 1.2.0 on August 13, 2024. This update introduced an image-to-video generation model and marked a transition to a true 3D video diffusion model. However, the project also faces challenges with technical issues such as CUDA errors and GPU resource management, as evidenced by numerous open issues.

Recent Activity

Recent issues and pull requests reveal a focus on expanding features while addressing technical difficulties. Key issues include CUDA-related errors (#386) and inquiries about model parameters (#390), indicating ongoing struggles with GPU configurations and documentation clarity.

Development Team Activities

Recent Commits (Reverse Chronological)

  1. LinB203 - Documentation updates (README.md), bug fixes.
  2. LinB203 & yunyangge - v1.2.0 release collaboration.
  3. cxh0519 - Minor documentation updates.
  4. apprivoiser - NPU compatibility improvements.

Of Note

Quantified Reports

Quantify commits



Quantified Commit Activity Over 30 Days

Developer Avatar Branches PRs Commits Files Changes
lb203 3 0/0/0 45 287 99241
yunyang Ge 1 2/1/1 1 22 5130
apprivoiser 1 0/0/0 3 32 1489
Xinhua Cheng 1 1/1/0 1 1 6

PRs: created by that dev and opened/merged/closed-unmerged during the period

Quantify Issues



Recent GitHub Issues Activity

Timespan Opened Closed Comments Labeled Milestones
7 Days 8 6 4 8 1
30 Days 45 20 81 45 1
90 Days 98 28 200 98 1
All Time 252 72 - - -

Like all software activity quantification, these numbers are imperfect but sometimes useful. Comments, Labels, and Milestones refer to those issues opened in the timespan in question.

Detailed Reports

Report On: Fetch issues



Recent Activity Analysis

The Open-Sora-Plan repository has recently experienced a surge in activity, with 180 open issues currently documented. Notably, there are several urgent requests for assistance and clarification regarding model training and inference, particularly concerning CUDA errors and memory management. A recurring theme among the issues is the challenge of managing GPU resources effectively, especially when using multiple GPUs or specific configurations like A100 or H100 cards. Additionally, there are multiple inquiries about missing files and modules, indicating potential gaps in documentation or setup instructions.

Several issues highlight critical errors that users face during training and inference, such as out-of-memory (OOM) errors and problems related to the integration of various components of the model. The community appears engaged, with many users seeking support for technical challenges, suggesting a robust interaction between developers and users.

Issue Details

Recent Issues

  1. Issue #393: 求开源29x480p的ckpt

    • Priority: Low
    • Status: Open
    • Created: 0 days ago
  2. Issue #392: inpainting train

    • Priority: Medium
    • Status: Open
    • Created: 1 day ago
  3. Issue #390: Question: Why the compression ratio for VAE is set to be 4X8X8

    • Priority: Medium
    • Status: Open
    • Created: 3 days ago
  4. Issue #386: NCCL Error 1: unhandled cuda error

    • Priority: High
    • Status: Open
    • Created: 4 days ago
  5. Issue #384: killed after the model is saved

    • Priority: Medium
    • Status: Open
    • Created: 5 days ago
  6. Issue #383: A request

    • Priority: Low
    • Status: Open
    • Created: 8 days ago
  7. Issue #382: Issues with Training CausalVideoVAE v1.2.0

    • Priority: High
    • Status: Open
    • Created: 11 days ago
  8. Issue #380: Can you tell me the typical inference time? Would it take 19 hours for 17 frame video after stage 1 training?

    • Priority: Low
    • Status: Open
    • Created: 12 days ago
  9. Issue #375: get_3d_sincos_pos_embed

    • Priority: Medium
    • Status: Open
    • Created: 16 days ago
  10. Issue #366: about sp seed config. Why seed in each device is different?

    • Priority: Medium
    • Status: Open
    • Created: 18 days ago

Notable Trends and Themes

  • Many issues revolve around CUDA-related errors, particularly with configurations involving multiple GPUs (e.g., A100 and H100), indicating a need for better documentation on GPU resource management.
  • There are frequent requests for clarification on model parameters and configurations, suggesting that users may struggle to understand how to effectively utilize the provided models.
  • Users are actively engaging with each other to troubleshoot problems, demonstrating a collaborative community spirit but also highlighting the need for clearer guidance from maintainers.
  • The presence of several unresolved issues related to model training hints at potential stability or usability concerns that could affect user adoption.

This analysis underscores the importance of addressing both technical challenges and user experience improvements to enhance the overall effectiveness of the Open-Sora-Plan project.

Report On: Fetch pull requests



Overview

The repository PKU-YuanGroup/Open-Sora-Plan has a total of 8 open pull requests and 115 closed pull requests, reflecting ongoing development and community engagement in enhancing the project's capabilities. The recent pull requests focus on various aspects such as feature enhancements, bug fixes, documentation updates, and refactoring.

Summary of Pull Requests

Open Pull Requests

  • PR #325: [refactor]: add type checking to sample image and video functions
    This PR introduces type checking for functions related to sampling images and videos, improving code reliability. It was created 53 days ago and edited recently.

  • PR #322: [fix]: Fix wrong file name in readme
    A minor fix that corrects a command line example in the README file by replacing a non-existent script name with an existing one. Created 57 days ago.

  • PR #321: [feat]: support navit
    This substantial PR adds support for the NaViT model, including numerous files and significant code changes. It was created 57 days ago.

  • PR #279: add support snr_gamma for rebalancing loss
    Introduces a new parameter for rebalancing loss in the model, referencing an academic paper for context. Created 88 days ago.

  • PR #247: [docs]: fix the bug in cal_fvd.py
    A documentation-related fix that addresses issues in the evaluation script. Created 123 days ago.

  • PR #241: Fix grammatical errors in documentation
    This PR improves the readability of documentation by fixing grammatical issues. It was created 124 days ago.

  • PR #227: Changes for sample T2V
    Refactors the sample_t2v.py script to simplify deployment and adds type hints. Created 127 days ago.

  • PR #208: :recycle: [Refactor] path handling for clarity using os.path.dirname
    Refactors path handling in the codebase to improve clarity. Created 129 days ago.

Closed Pull Requests

  • PR #389: release Open-Sora Plan v1.2.0 i2v
    This PR marks the release of version 1.2.0 of Open-Sora Plan, merging multiple updates and improvements. Closed recently.

  • PR #388: release Open-Sora Plan v1.2.0 i2v (not merged)
    A duplicate attempt at releasing version 1.2.0 that was not merged.

  • PR #352: Update Report-v1.2.0.md
    Minor updates to the release report document, correcting typos and enhancing clarity. Closed recently.

  • PR #334: [feat]: Inpaint model and video ip adapter
    Introduces new models for inpainting and video processing, showcasing significant additions to the project’s capabilities. Closed recently.

  • PR #290: fix the bug
    A straightforward bug fix addressing issues in super-resolution scripts, merged successfully.

  • PR #289: Open-Sora-Plan v1.1.0 NPU
    Support for training and inference on NPU, enhancing hardware compatibility with specific implementations closed successfully.

Analysis of Pull Requests

The pull requests submitted to the Open-Sora Plan repository reveal several key themes and trends that reflect both ongoing development efforts and community engagement:

Focus on Feature Enhancements

A significant number of recent pull requests are dedicated to adding new features or improving existing functionalities within the project. For instance, PR #321 introduces support for the NaViT model, which is a substantial addition that likely enhances the project's capabilities in video generation tasks. This trend indicates a proactive approach by contributors to expand the project's scope beyond its initial design, potentially attracting more users and contributors interested in advanced modeling techniques.

Documentation Improvements

Several pull requests focus on enhancing documentation quality (e.g., PR #322, PR #241). These updates are crucial as they help new users understand how to use the software effectively while also ensuring that existing users can leverage new features without confusion. The emphasis on correcting errors and improving clarity suggests a commitment to maintaining high-quality resources that facilitate user engagement with the project.

Bug Fixes and Refactoring

The presence of multiple bug fixes (e.g., PR #290) alongside refactoring efforts (e.g., PR #325) highlights an ongoing commitment to code quality and reliability within the repository. By addressing bugs promptly and refactoring code for better readability or performance, contributors demonstrate their dedication to creating a robust software environment that minimizes user frustration and maximizes functionality.

Community Engagement

The variety of contributors involved in these pull requests reflects an active community surrounding Open-Sora Plan. The diversity of contributions—from feature additions to minor fixes—indicates a collaborative spirit where users feel empowered to participate in the development process actively. This engagement is essential for sustaining long-term project growth and ensuring that it remains relevant in a rapidly evolving field like AI-driven video generation.

Anomalies

One notable anomaly is PR #388, which attempted to release version 1.2.0 but was not merged despite being similar to PR #389, which successfully released it shortly after. This raises questions about workflow management within the repository—specifically how duplicate efforts are handled and communicated among contributors.

In conclusion, the pull request activity within PKU-YuanGroup/Open-Sora-Plan showcases a vibrant development environment characterized by feature enhancements, diligent documentation practices, proactive bug fixing, and strong community involvement—all of which are vital components for fostering a successful open-source project in today's competitive landscape.

Report On: Fetch commits



Repo Commits Analysis

Development Team and Recent Activity

Team Members

  • LinB203

    • Recent Activity:
    • Released version v1.2.0 of Open-Sora Plan, which included significant updates to inpainting and various scripts. Collaborated with yunyangge on this release.
    • Made numerous updates to documentation (README.md, Report-v1.2.0.md) and fixed bugs across multiple files.
    • Active in merging pull requests and addressing issues related to training scripts and model performance.
  • yunyangge

    • Recent Activity:
    • Contributed to the release of v1.2.0, focusing on inpainting features and dataset management.
    • Engaged in collaborative efforts with LinB203, particularly in enhancing the model's capabilities for video generation.
  • cxh0519

    • Recent Activity:
    • Made minor updates to documentation (Report-v1.2.0.md) and merged a pull request related to documentation improvements.
  • apprivoiser

    • Recent Activity:
    • Contributed to the project by making updates related to model training and inference, specifically targeting improvements for NPU compatibility.

Summary of Recent Activities

  • The team has been actively working on the Open-Sora Plan v1.2.0, focusing on enhancing the image-to-video generation capabilities, improving documentation, and fixing bugs.
  • LinB203 has been the most active contributor, with a significant number of commits (45) reflecting ongoing development efforts.
  • Collaboration is evident between LinB203 and yunyangge, particularly in major releases and feature enhancements.
  • Documentation updates are a recurring theme, indicating a commitment to maintaining clear guidelines for users and contributors.
  • The project is evolving rapidly with new features being added while also addressing existing issues through bug fixes.

Patterns and Themes

  • Collaboration: Strong collaboration between team members, particularly LinB203 and yunyangge, suggests a cohesive development environment focused on shared goals.
  • Documentation Focus: Continuous updates to documentation indicate an emphasis on user engagement and clarity in project usage.
  • Feature Expansion: The recent release highlights a trend towards expanding the project's capabilities, particularly in video generation technologies.

Conclusions

The development team is actively engaged in enhancing the Open-Sora Plan project through collaborative efforts, significant feature releases, and ongoing documentation improvements. The focus on community contributions and robust feature sets positions the project as a notable player in AI-driven video generation within the open-source landscape.