Cinnamon/kotaemon is an open-source project aimed at providing a customizable interface for Retrieval-Augmented Generation (RAG), allowing users to interact with documents via a chat-based system. The project supports various large language models and offers advanced features like multi-modal QA support.
The last 30 days have seen significant development activity, particularly with the integration of the nano-graphrag feature, which enhances the project's graph-based retrieval capabilities. Tuan Anh Nguyen Dang has been the most active contributor, with 22 commits focused on feature development and bug fixes. The team has also made strides in improving user interface elements and documentation, indicating a strong commitment to enhancing user experience and maintaining clarity as new features are added.
Recent issues and pull requests indicate a focus on stabilizing GraphRAG features and expanding functionality. The project currently has 106 open issues, with many related to bugs in GraphRAG integration and model configuration. Notable issues include #451, a KeyError in NanoGraphRag, and #449, where the nano graph fails to write correctly. These suggest underlying problems requiring further stabilization.
Timespan | Opened | Closed | Comments | Labeled | Milestones |
---|---|---|---|---|---|
7 Days | 15 | 8 | 18 | 0 | 1 |
30 Days | 68 | 52 | 129 | 0 | 1 |
90 Days | 227 | 145 | 720 | 10 | 6 |
All Time | 261 | 155 | - | - | - |
Like all software activity quantification, these numbers are imperfect but sometimes useful. Comments, Labels, and Milestones refer to those issues opened in the timespan in question.
Developer | Avatar | Branches | PRs | Commits | Files | Changes |
---|---|---|---|---|---|---|
Tuan Anh Nguyen Dang (Tadashi_Cin) | 4 | 9/9/0 | 22 | 170 | 22034 | |
cin-klein | 1 | 1/1/0 | 1 | 7 | 478 | |
trducng | 1 | 0/0/0 | 7 | 10 | 390 | |
cin-jimmy | 1 | 1/1/1 | 1 | 2 | 339 | |
KennyWu | 1 | 0/1/0 | 1 | 5 | 303 | |
Khoi-Nguyen Nguyen-Ngoc | 1 | 2/2/0 | 2 | 5 | 248 | |
ronchengang | 1 | 3/2/1 | 2 | 3 | 204 | |
a652 | 1 | 1/1/0 | 1 | 1 | 3 | |
Frank Liu (fzliu) | 0 | 1/0/0 | 0 | 0 | 0 | |
kan_cin (phv2312) | 0 | 2/0/1 | 0 | 0 | 0 | |
None (ly0303521) | 0 | 0/0/1 | 0 | 0 | 0 |
PRs: created by that dev and opened/merged/closed-unmerged during the period
The recent activity on the Cinnamon/kotaemon GitHub repository indicates a high volume of issues, with 106 open issues currently logged. The majority of these issues are categorized as bugs, with users reporting various problems related to GraphRAG integration, model configuration, and installation errors. Notably, there is a recurring theme of difficulties in setting up local models and ensuring compatibility with various dependencies, particularly in Docker environments.
Several issues highlight critical failures in functionality, such as the inability to retrieve or index documents correctly when using specific models or configurations. The presence of multiple reports regarding GraphRAG suggests that this feature may require further stabilization and clearer documentation for users.
Here are some of the most recently created and updated issues:
Issue #451: [BUG] NanoGraphRag / KeyError: '7'
Issue #450: [REQUEST] Project function suggested
Issue #449: [BUG] nano graph not writing the graph
Issue #448: [REQUEST] Settings Tab
Issue #447: [BUG] Why do some files run successfully and others don't
Issue #446: [BUG] 为什么有的文件可以,有的文件会报错
Issue #445: [BUG]# unstructured_loader: Partition Process Hangs with Korean PNG, jpg, File
Issue #444: [BUG] Changes to Login Page are not being made
Issue #438: [BUG] Multiple issues when using an external MILVUS DB
Issue #437: [BUG] the usage issue of the graphrag feature
This ongoing pattern of issues suggests that while the project has robust functionality, it may benefit from improved documentation and stability testing, particularly around its more advanced features like GraphRAG and external integrations.
The analysis of the pull requests (PRs) for the Cinnamon/kotaemon project reveals a dynamic and active development environment. The project is focused on enhancing its capabilities in document interaction through a chat-based interface, integrating various large language models (LLMs), and providing advanced features like multi-modal QA support and hybrid retrieval pipelines. The PRs reflect ongoing efforts to improve functionality, fix bugs, and enhance user experience.
MP3Reader
class for processing MP3 files, expanding the project's ability to handle audio inputs.python-multipart
version to avoid issues when building Docker images, showcasing attention to dependency management.The PRs indicate several key themes in the development of the Cinnamon/kotaemon project:
Feature Enhancements: There is a strong focus on adding new features that expand the project's capabilities. For instance, PRs like #408 (Voyage embeddings) and #433 (nano-graphrag integration) highlight efforts to incorporate advanced technologies into the platform.
Community Engagement and Responsiveness: The quick turnaround on minor bug fixes (e.g., PR #441) and active discussions around feature implementations (e.g., PR #355) suggest a high level of community engagement and responsiveness from the maintainers.
Dependency Management and Technical Improvements: PRs addressing dependency issues (e.g., PR #436) and those that improve technical aspects of the project (e.g., PR #432 allowing usage without Gradio app) reflect an ongoing effort to maintain a robust and reliable software foundation.
Documentation and Usability Enhancements: Efforts to improve documentation (e.g., PR #432 with usage examples) and usability features (e.g., adding toggle dark mode button in PR #423) indicate a commitment to making the tool more accessible and user-friendly.
Integration with External Tools and Services: Several PRs involve integration with external services or tools, such as OCR services in PR #355 and embedding services in PR #408. This highlights the project's aim to be versatile and adaptable to various use cases.
In conclusion, the pull requests for Cinnamon/kotaemon demonstrate a vibrant development activity focused on feature expansion, community engagement, technical robustness, usability improvements, and integration with external technologies. This aligns well with the project's goal of providing a powerful yet user-friendly tool for document interaction through advanced AI capabilities.
Tuan Anh Nguyen Dang (Tadashi_Cin) (taprosoft)
nano-graphrag
import and updated the pymupdf
version in requirements.nano-graphrag
feature, including LLM and embedding integration.Khoi-Nguyen Nguyen-Ngoc (cin-niko)
Mikhail Khludnev (mkhludnev)
graphrag
input paths.Kenny Wu (KKenny0)
Trung Duc Nguyen (trducng)
Ron Chengang (ronchengang)
Albert Quang (cin-albert)
feat/docker_nanographrag
, indicating that Docker-related enhancements are still being developed.