‹ Reports
The Dispatch

OSS Report: THUDM/ChatGLM3


ChatGLM3 Development Stagnates Amid Dependency Conflicts and Performance Issues

ChatGLM3, an open-source bilingual conversational language model by THUDM and Tsinghua University's KEG Lab, has seen limited progress recently, with ongoing issues related to dependency conflicts and model performance hindering development.

Recent Activity

Recent issues and pull requests (PRs) in the ChatGLM3 project primarily revolve around dependency conflicts and performance problems. Notable issues include #1304, which addresses conflicts in requirements.txt, and #1300, which reports a ValueError during model execution. These issues suggest that users face challenges integrating the model due to unclear documentation or support for specific configurations. The development team has focused on resolving these issues through collaborative efforts, as seen in PR #1283, which fixes checkpoint resumption performance. However, the pace of resolving these challenges appears slow, with many issues remaining open.

Development Team and Recent Activity

  1. zR (zRzRzRzRzRzRzR)

    • Merged PRs related to bug fixes and performance improvements.
    • Collaborated on fixing tool invocation bugs and enhancing demo functionalities.
  2. pandong (pandong2011)

    • Addressed performance issues with checkpoint resumption.
    • Made minor bug fixes and codebase clarifications.
  3. Lilong·BaiYang (lilongxian)

    • Contributed to OpenAI agent chat enhancements.
  4. Jin Hai (JinHai-CN)

    • Updated README for RAGFlow integration.
  5. tim

    • Fixed tool invocation bugs.
  6. sunyuan

    • Implemented automatic breakpoint relay training features.
  7. Ekain (dodobaba)

    • Created Docker files for local models.

Of Note

  1. Dependency Conflicts: Multiple issues highlight conflicts between library versions, suggesting a need for clearer documentation or version management strategies.

  2. Centralized Contributions: A single contributor, zR, merges most PRs, indicating potential bottlenecks in the development process.

  3. Performance Hurdles: Ongoing performance-related issues suggest significant challenges in model deployment and usage.

  4. Community Engagement: Despite active user engagement through issue reporting, there is limited evidence of diverse contributions to the codebase.

  5. Documentation Focus: Frequent updates to README files indicate an emphasis on improving user guidance, though it may not fully address underlying technical challenges.

Quantified Reports

Quantify Issues



Recent GitHub Issues Activity

Timespan Opened Closed Comments Labeled Milestones
7 Days 1 0 0 1 1
30 Days 14 3 11 14 1
90 Days 51 38 71 51 1
All Time 768 748 - - -

Like all software activity quantification, these numbers are imperfect but sometimes useful. Comments, Labels, and Milestones refer to those issues opened in the timespan in question.

Detailed Reports

Report On: Fetch issues



Recent Activity Analysis

The THUDM/ChatGLM3 repository currently has 20 open issues, indicating ongoing user engagement and potential challenges with the model. Recent activity shows a mix of bug reports, feature requests, and discussions about compatibility issues with dependencies, particularly around the transformers library. Notably, several issues highlight problems with model performance, such as errors during fine-tuning and inference, suggesting that users are actively trying to implement and adapt the model in various environments.

Common themes among the issues include:

  • Dependency Conflicts: Users frequently report conflicts between versions of transformers, torch, and other libraries.
  • Model Performance Issues: Many users are encountering errors related to model loading, execution, or fine-tuning, indicating potential gaps in documentation or support for specific configurations.
  • Feature Requests: There is a demand for enhanced functionality, such as improved multi-GPU support and better handling of tool invocations.

Issue Details

Recently Created Issues

  1. Issue #1310: The figure in execute function will be valued as None

    • Priority: Medium
    • Status: Open
    • Created: 1 day ago
    • Updated: N/A
  2. Issue #1308: ImportError during fine-tuning of ChatGLM3-6B

    • Priority: High
    • Status: Open
    • Created: 9 days ago
    • Updated: 3 days ago
  3. Issue #1307: ModuleNotFoundError for huggingface_hub.inference._text_generation

    • Priority: Medium
    • Status: Open
    • Created: 10 days ago
    • Updated: N/A
  4. Issue #1304: Conflicts in requirements.txt

    • Priority: High
    • Status: Open
    • Created: 15 days ago
    • Updated: 14 days ago
  5. Issue #1300: ValueError during model forward pass

    • Priority: High
    • Status: Open
    • Created: 19 days ago
    • Updated: 14 days ago

Recently Updated Issues

  1. Issue #1308 (ImportError during fine-tuning) was edited recently to provide additional traceback details.
  2. Issue #1304 (requirements.txt conflicts) was updated with suggestions for resolving version constraints.
  3. Issue #1300 (ValueError during model forward pass) saw user comments suggesting potential fixes related to library versions.

Summary of Notable Issues

  • The presence of multiple dependency-related issues suggests that users may not be aware of the specific version requirements or that these requirements are not clearly documented.
  • The high number of performance-related issues indicates that while the model is powerful, users may face significant hurdles when integrating it into their workflows.
  • Feature requests and discussions around improving multi-GPU support reflect a community eager to enhance the capabilities of ChatGLM3.

This analysis highlights both the active engagement from users and the need for clearer documentation and support mechanisms to address common challenges encountered with the ChatGLM3 model.

Report On: Fetch pull requests



Overview

The analysis of the pull requests (PRs) for the ChatGLM3 repository reveals a total of 168 closed PRs, with a focus on performance improvements, bug fixes, documentation updates, and feature enhancements. The majority of these PRs have been merged by a single contributor, indicating a centralized development process.

Summary of Pull Requests

  1. PR #1287: discord
    Closed 36 days ago; this PR involved minor updates to the README files, with a net reduction in lines. It signifies ongoing maintenance of documentation.

  2. PR #1283: [fix#1272] Fix performance of resume from checkpoint
    Closed 42 days ago; it addressed performance issues related to resuming from checkpoints, indicating a focus on improving user experience during model training.

  3. PR #1274: update
    Closed 56 days ago; another minor update to README files, showcasing continuous documentation refinement.

  4. PR #1269: fix #1268
    Closed 60 days ago; this PR fixed an issue in the tool registration logic, highlighting attention to detail in code functionality.

  5. PR #1260: GLM-4 update
    Closed 67 days ago; it included significant updates to README files, likely reflecting new features or changes in the GLM-4 model.

  6. PR #1251: Update ChatGLM3.py fix index error
    Closed 69 days ago; focused on fixing an index error in the main script, which is critical for ensuring model reliability.

  7. PR #1245: fix finetune req
    Closed 77 days ago; details were not specified but indicates ongoing improvements to fine-tuning requirements.

  8. PR #1235: 更新协议
    Closed 82 days ago; this PR likely involved updating protocols or agreements related to the project.

  9. PR #1229: 多个文档和demo调整
    Closed 86 days ago; it included multiple adjustments across documentation and demos, suggesting a focus on user guidance and examples.

  10. PR #1223: Add RAGFlow in README
    Closed 87 days ago; added information about RAGFlow support in the README, enhancing user awareness of available features.

  11. PR #1212: agent-chat-openai 更新
    Closed 93 days ago; involved updates related to OpenAI's integration with agent chat functionalities.

  12. PR #1201: agent-chat-openai工具调度算法更新完善
    Closed 99 days ago; improved tool invocation logic for OpenAI integration, indicating enhancements in system robustness.

  13. PR #1167: 改进openai agent chat
    Closed 113 days ago; made several improvements to the OpenAI agent chat interface, enhancing its functionality and flexibility.

  14. PR #1102: embeddings 接口支持text
    Closed 129 days ago; added support for text input in the embeddings API, broadening its usability.

  15. PR #1088: fix bug of quantize
    Closed 132 days ago; addressed bugs related to model quantization, crucial for performance optimization.

Analysis of Pull Requests

The pull requests for the ChatGLM3 repository reflect a concerted effort towards continuous improvement and maintenance of the software project. A notable trend is the emphasis on documentation updates alongside code changes—many PRs involve modifications to README files or other documentation components, suggesting that the maintainers are keen on keeping users informed about new features and usage guidelines. This is particularly important in open-source projects where clear documentation can significantly enhance user engagement and adoption.

Performance improvements are another recurring theme observed across several PRs, such as those focused on fixing checkpoint resumption (#1283) and optimizing tool invocation logic (#1167). These enhancements indicate that the development team is actively addressing user feedback and technical challenges that arise during model training and deployment. The presence of multiple fixes related to bugs (#1251, #1088) further underscores a commitment to maintaining high software quality and reliability.

Moreover, there is a clear indication of collaboration within the community through various PRs aimed at integrating new features like RAGFlow support (#1223) and enhancements for OpenAI's capabilities (#1212). However, it's worth noting that many PRs are merged by a single contributor (zR), which could suggest a bottleneck in contributions or a lack of diverse input into the codebase. This centralization may lead to challenges if that contributor becomes unavailable or if their vision diverges from community needs over time.

In conclusion, while the ChatGLM3 project demonstrates robust activity through its closed pull requests—highlighting ongoing improvements in performance and usability—there remains an opportunity for broader community involvement in the development process. Encouraging more contributors could enhance innovation and ensure that diverse perspectives are considered as the project evolves.

Report On: Fetch commits



Repo Commits Analysis

Development Team and Recent Activity

Team Members:

  1. zR (zRzRzRzRzRzRzR)

    • Recent Activity:
    • Merged multiple pull requests related to bug fixes, updates, and feature enhancements.
    • Worked on performance improvements, particularly around checkpoint resumption and model updates.
    • Collaborated with other developers on various issues, including fixing bugs related to tool invocation and enhancing demo functionalities.
    • Active in addressing issues related to the integration of OpenAI APIs and Langchain compatibility.
  2. pandong (pandong2011)

    • Recent Activity:
    • Focused on fixing performance issues related to resuming from checkpoints.
    • Made several commits addressing minor bugs and adding comments for clarity in the codebase.
    • Collaborated with zR on the fix for performance issues and checkpoint functionality.
  3. Lilong·BaiYang (lilongxian)

    • Recent Activity:
    • Contributed to updates regarding agent-chat-openai functionalities and improvements in scheduling algorithms.
    • Collaborated with zR on merging updates related to OpenAI agent chat enhancements.
  4. Jin Hai (JinHai-CN)

    • Recent Activity:
    • Added documentation updates to the README, specifically regarding RAGFlow integration.
  5. tim

    • Recent Activity:
    • Contributed bug fixes related to tool invocation and other minor issues.
  6. sunyuan

    • Recent Activity:
    • Worked on implementing automatic breakpoint relay training features and contributed to various bug fixes.
  7. Ekain (dodobaba)

    • Recent Activity:
    • Involved in creating Docker files for local models and enhancing API functionalities.

Summary of Activities:

  • The team has been actively merging pull requests focused on bug fixes, feature enhancements, and documentation improvements.
  • Collaboration is evident among team members, particularly between zR and others like pandong and Lilong, indicating a cohesive effort towards improving the ChatGLM3 project.
  • The recent activities highlight a focus on enhancing model performance, fixing bugs related to tool invocations, and improving user documentation.
  • There are ongoing efforts to integrate new features such as automatic checkpoint recovery and updates for compatibility with external APIs like OpenAI.

Patterns and Themes:

  • A strong emphasis on collaborative problem-solving is apparent, with multiple developers working together on similar issues.
  • Continuous integration of feedback into the codebase suggests an agile development approach, responding quickly to community needs and bug reports.
  • The focus on both backend performance improvements (like checkpoint handling) and frontend usability (like documentation updates) indicates a balanced approach to development.

Conclusions:

The development team is actively engaged in enhancing the ChatGLM3 project through collaborative efforts aimed at both fixing existing issues and introducing new features. Their recent activities reflect a commitment to quality improvement and community engagement within the open-source framework.