ChatGLM3, an open-source bilingual conversational language model by THUDM and Tsinghua University's KEG Lab, has seen limited progress recently, with ongoing issues related to dependency conflicts and model performance hindering development.
Recent issues and pull requests (PRs) in the ChatGLM3 project primarily revolve around dependency conflicts and performance problems. Notable issues include #1304, which addresses conflicts in requirements.txt
, and #1300, which reports a ValueError during model execution. These issues suggest that users face challenges integrating the model due to unclear documentation or support for specific configurations. The development team has focused on resolving these issues through collaborative efforts, as seen in PR #1283, which fixes checkpoint resumption performance. However, the pace of resolving these challenges appears slow, with many issues remaining open.
zR (zRzRzRzRzRzRzR)
pandong (pandong2011)
Lilong·BaiYang (lilongxian)
Jin Hai (JinHai-CN)
tim
sunyuan
Ekain (dodobaba)
Dependency Conflicts: Multiple issues highlight conflicts between library versions, suggesting a need for clearer documentation or version management strategies.
Centralized Contributions: A single contributor, zR, merges most PRs, indicating potential bottlenecks in the development process.
Performance Hurdles: Ongoing performance-related issues suggest significant challenges in model deployment and usage.
Community Engagement: Despite active user engagement through issue reporting, there is limited evidence of diverse contributions to the codebase.
Documentation Focus: Frequent updates to README files indicate an emphasis on improving user guidance, though it may not fully address underlying technical challenges.
Timespan | Opened | Closed | Comments | Labeled | Milestones |
---|---|---|---|---|---|
7 Days | 1 | 0 | 0 | 1 | 1 |
30 Days | 14 | 3 | 11 | 14 | 1 |
90 Days | 51 | 38 | 71 | 51 | 1 |
All Time | 768 | 748 | - | - | - |
Like all software activity quantification, these numbers are imperfect but sometimes useful. Comments, Labels, and Milestones refer to those issues opened in the timespan in question.
The THUDM/ChatGLM3 repository currently has 20 open issues, indicating ongoing user engagement and potential challenges with the model. Recent activity shows a mix of bug reports, feature requests, and discussions about compatibility issues with dependencies, particularly around the transformers
library. Notably, several issues highlight problems with model performance, such as errors during fine-tuning and inference, suggesting that users are actively trying to implement and adapt the model in various environments.
Common themes among the issues include:
transformers
, torch
, and other libraries.Issue #1310: The figure in execute function will be valued as None
Issue #1308: ImportError during fine-tuning of ChatGLM3-6B
Issue #1307: ModuleNotFoundError for huggingface_hub.inference._text_generation
Issue #1304: Conflicts in requirements.txt
Issue #1300: ValueError during model forward pass
This analysis highlights both the active engagement from users and the need for clearer documentation and support mechanisms to address common challenges encountered with the ChatGLM3 model.
The analysis of the pull requests (PRs) for the ChatGLM3 repository reveals a total of 168 closed PRs, with a focus on performance improvements, bug fixes, documentation updates, and feature enhancements. The majority of these PRs have been merged by a single contributor, indicating a centralized development process.
PR #1287: discord
Closed 36 days ago; this PR involved minor updates to the README files, with a net reduction in lines. It signifies ongoing maintenance of documentation.
PR #1283: [fix#1272] Fix performance of resume from checkpoint
Closed 42 days ago; it addressed performance issues related to resuming from checkpoints, indicating a focus on improving user experience during model training.
PR #1274: update
Closed 56 days ago; another minor update to README files, showcasing continuous documentation refinement.
PR #1269: fix #1268
Closed 60 days ago; this PR fixed an issue in the tool registration logic, highlighting attention to detail in code functionality.
PR #1260: GLM-4 update
Closed 67 days ago; it included significant updates to README files, likely reflecting new features or changes in the GLM-4 model.
PR #1251: Update ChatGLM3.py fix index error
Closed 69 days ago; focused on fixing an index error in the main script, which is critical for ensuring model reliability.
PR #1245: fix finetune req
Closed 77 days ago; details were not specified but indicates ongoing improvements to fine-tuning requirements.
PR #1235: 更新协议
Closed 82 days ago; this PR likely involved updating protocols or agreements related to the project.
PR #1229: 多个文档和demo调整
Closed 86 days ago; it included multiple adjustments across documentation and demos, suggesting a focus on user guidance and examples.
PR #1223: Add RAGFlow in README
Closed 87 days ago; added information about RAGFlow support in the README, enhancing user awareness of available features.
PR #1212: agent-chat-openai 更新
Closed 93 days ago; involved updates related to OpenAI's integration with agent chat functionalities.
PR #1201: agent-chat-openai工具调度算法更新完善
Closed 99 days ago; improved tool invocation logic for OpenAI integration, indicating enhancements in system robustness.
PR #1167: 改进openai agent chat
Closed 113 days ago; made several improvements to the OpenAI agent chat interface, enhancing its functionality and flexibility.
PR #1102: embeddings 接口支持text
Closed 129 days ago; added support for text input in the embeddings API, broadening its usability.
PR #1088: fix bug of quantize
Closed 132 days ago; addressed bugs related to model quantization, crucial for performance optimization.
The pull requests for the ChatGLM3 repository reflect a concerted effort towards continuous improvement and maintenance of the software project. A notable trend is the emphasis on documentation updates alongside code changes—many PRs involve modifications to README files or other documentation components, suggesting that the maintainers are keen on keeping users informed about new features and usage guidelines. This is particularly important in open-source projects where clear documentation can significantly enhance user engagement and adoption.
Performance improvements are another recurring theme observed across several PRs, such as those focused on fixing checkpoint resumption (#1283) and optimizing tool invocation logic (#1167). These enhancements indicate that the development team is actively addressing user feedback and technical challenges that arise during model training and deployment. The presence of multiple fixes related to bugs (#1251, #1088) further underscores a commitment to maintaining high software quality and reliability.
Moreover, there is a clear indication of collaboration within the community through various PRs aimed at integrating new features like RAGFlow support (#1223) and enhancements for OpenAI's capabilities (#1212). However, it's worth noting that many PRs are merged by a single contributor (zR), which could suggest a bottleneck in contributions or a lack of diverse input into the codebase. This centralization may lead to challenges if that contributor becomes unavailable or if their vision diverges from community needs over time.
In conclusion, while the ChatGLM3 project demonstrates robust activity through its closed pull requests—highlighting ongoing improvements in performance and usability—there remains an opportunity for broader community involvement in the development process. Encouraging more contributors could enhance innovation and ensure that diverse perspectives are considered as the project evolves.
zR (zRzRzRzRzRzRzR)
pandong (pandong2011)
Lilong·BaiYang (lilongxian)
Jin Hai (JinHai-CN)
tim
sunyuan
Ekain (dodobaba)
The development team is actively engaged in enhancing the ChatGLM3 project through collaborative efforts aimed at both fixing existing issues and introducing new features. Their recent activities reflect a commitment to quality improvement and community engagement within the open-source framework.