Dataherald, an open-source natural language-to-SQL engine, aims to democratize data access by allowing non-technical users to query SQL databases using natural language. Despite its promising utility for enterprise-level question answering, recent development activity has been minimal, with the last significant contributions occurring over a month ago.
Recent issues and pull requests (PRs) highlight ongoing challenges and improvements in database connectivity and error handling. Issues such as #518 and #505 indicate persistent problems with database connections, while PRs like #513 address specific bugs in the API. The development team has focused on dependency updates to enhance security and performance, as seen in PRs #521 and #517. However, the lack of recent commits suggests a need for renewed focus and contributions.
dependabot[bot]
Ashvin (ashvin-a)
Amir A. Zohrenejad (aazo11)
.env.example
file and fixed formatting issues.Daniel Martin (daniel309)
tecz
.env
file.Dennis Paul (dnnspaul)
Ikko Eltociear Ashimine (eltociear)
agent_prompts.py
and README.md
.Dishen (DishenWang2023)
Ryan Watts (rwatts3)
Juan Valacco (valakJS)
Mohammadreza Pourreza (MohammadrezaPourreza)
Juan Carlos José Camacho (jcjc712)
Ainesh Pandey (dh-datateam-ainesh)
Developer | Avatar | Branches | PRs | Commits | Files | Changes |
---|---|---|---|---|---|---|
None (dependabot[bot]) | 1 | 1/0/1 | 1 | 1 | 2 |
PRs: created by that dev and opened/merged/closed-unmerged during the period
Timespan | Opened | Closed | Comments | Labeled | Milestones |
---|---|---|---|---|---|
7 Days | 0 | 0 | 0 | 0 | 0 |
30 Days | 0 | 0 | 0 | 0 | 0 |
90 Days | 3 | 2 | 3 | 3 | 1 |
All Time | 41 | 38 | - | - | - |
Like all software activity quantification, these numbers are imperfect but sometimes useful. Comments, Labels, and Milestones refer to those issues opened in the timespan in question.
Recent GitHub issue activity for the Dataherald project shows a mix of open and closed issues, with a focus on bug fixes, feature requests, and user support. Notably, there are several issues related to database connectivity and error handling, indicating ongoing challenges in these areas. The project also sees active community engagement, with users contributing to discussions and solutions.
Database Connectivity Issues: Several issues (#518, #505, #407) highlight problems with database connections, particularly with specific characters in connection URIs or compatibility with certain database versions. This suggests a recurring theme of connectivity challenges that may require more robust error handling or documentation improvements.
Error Handling and Documentation: Issues like #471 and #443 point to gaps in error handling and documentation. Users have reported unclear error messages and missing documentation for certain features, indicating a need for better guidance and more informative error reporting.
Feature Requests for LLM Integration: There is significant interest in integrating custom and open-source LLMs (#439, #259), reflecting a broader trend towards customizable AI solutions. This aligns with the project's goal of supporting diverse AI models for SQL generation.
Community Contributions: The project benefits from active community involvement, as seen in discussions around feature enhancements (#449) and support for additional databases (#231). This engagement is crucial for the project's evolution and responsiveness to user needs.
The issues reflect ongoing efforts to improve database connectivity, enhance error handling, and expand LLM integration capabilities. The project's active maintenance and community engagement are evident in the resolution of various technical challenges and feature requests.
The provided data includes a list of open and closed pull requests (PRs) for the Dataherald project, an open-source natural language-to-SQL engine. The PRs cover various updates, bug fixes, and feature enhancements across multiple components of the project.
#521: Bumps langchain-community
from 0.0.25 to 0.2.9 in the services/engine
. This update includes several minor and patch changes, such as adding new features, fixing bugs, and improving existing functionalities.
#520: Updates next
from 13.4.10 to 14.1.1 in the services/admin-console
. This upgrade involves numerous changes and improvements to the Next.js framework.
#517: Upgrades braces
from 3.0.2 to 3.0.3 in the services/slackbot
, addressing a vulnerability issue.
#514: Updates ws
from 7.5.9 to 7.5.10 in the services/slackbot
, which includes a bug fix for a crash issue.
#513: Fixes a broken PUT request for updating database connections in the API by correcting an attribute error.
#501: Upgrades express
from 4.18.2 to 4.19.2 in the services/slackbot
, including security improvements and bug fixes.
#500: Updates follow-redirects
from 1.15.2 to 1.15.6 in the services/slackbot
, enhancing security by dropping proxy authorization across hosts.
#499: Bumps requests
from 2.31.0 to 2.32.2 in the services/enterprise
, addressing security vulnerabilities and improving performance.
#498: Updates langchain
from 0.0.230 to 0.1.0 in the services/enterprise
, introducing new features and enhancements.
#490: Upgrades pymysql
from 1.1.0 to 1.1.1 in the services/engine
, fixing a vulnerability related to SQL injection.
#519: Added documentation for environment variables of the engine.
#516: Introduced a new environment variable for specifying the embedding model in the engine.
#515: Fixed sorting of table relevance scores in output.
#511: Corrected a typo in the enterprise service's example .env
file.
#510: (Not merged) Attempted to bump langchain-community
version but was superseded by #521.
#509: (Not merged) Work-in-progress on adding a new semantic layer agent.
#508: Updated .env.example
.
#507: Fixed regression in s3.py
.
#506: Made S3 parameters more dynamic for compatibility with alternatives like MinIO.
#502: Fixed disabled functions for organization creation with Stripe integration.
The pull requests reflect ongoing efforts to maintain and enhance the Dataherald project, focusing on dependency updates, bug fixes, and feature additions across its various components.
A significant portion of the PRs involves updating dependencies to newer versions, primarily addressing security vulnerabilities and performance improvements (e.g., #521, #520, #517). These updates are crucial for maintaining software integrity and ensuring compatibility with other libraries and frameworks used within the project.
Another recurring theme is enhancing functionality through new features or improvements, such as adding support for different embedding models (#516) or improving sorting mechanisms (#515). These changes aim to refine user experience and expand the project's capabilities.
Some PRs were not merged due to being superseded by newer updates or requiring further development (e.g., #510, #509). This indicates active project management where priorities are reassessed based on evolving requirements or better solutions emerging during development cycles.
The introduction of new environment variables (#516) and documentation updates (#519) highlights efforts to improve configurability and user guidance, making it easier for users to deploy and customize Dataherald according to their needs.
Several PRs address security concerns directly by updating vulnerable dependencies (#517, #501) or fixing potential issues within the codebase (#490). This focus on security ensures that Dataherald remains a reliable tool for enterprise-level applications where data integrity is paramount.
Overall, these pull requests demonstrate a well-managed open-source project with active contributions aimed at enhancing functionality, maintaining security standards, and improving user experience through thoughtful updates and documentation enhancements.
Ashvin (ashvin-a)
Amir A. Zohrenejad (aazo11)
.env.example
file, fixed formatting issues, and updated formatter to use the official black formatter.s3.py
.Daniel Martin (daniel309)
tecz
.env
file.Dennis Paul (dnnspaul)
Ikko Eltociear Ashimine (eltociear)
agent_prompts.py
.README.md
for authentication typo.Dishen (DishenWang2023)
env.example
files.Ryan Watts (rwatts3)
Juan Valacco (valakJS)
Mohammadreza Pourreza (MohammadrezaPourreza)
Juan Carlos José Camacho (jcjc712)
Ainesh Pandey (dh-datateam-ainesh)
dependabot[bot]
langchain-community
, next
, braces
, and ws
.