Demostracion.1.1.mp4
- Introduction
- Objective
- Features
- Architecture
- Requirements
- Application
- Usage Instructions
- Contribution
- License
- Authors
The need for systems that allow users without technical knowledge to generate accurate SQL queries is becoming increasingly critical in the data era. While SQL is a powerful and versatile language, its use requires a deep understanding of data structures and relationships between tables in a database. This technical requirement has been a barrier for many users who need access to the information stored in databases.
This project addresses this problem by developing a system that translates natural language queries into SQL using an intelligent agent-based architecture. These agents work in coordination to identify relevant tables, generate SQL queries, execute the queries, and ultimately provide additional insights based on the obtained results.
Figure 1: Web Interface - DataWhisper
The primary goal of this project is to transform how users interact with relational databases, facilitating the generation, evaluation, and execution of SQL queries from natural language inputs. This system is designed to reduce technical barriers, improve accessibility for non-technical users, and provide an efficient solution for data management and analysis.
- Facilitates interaction with databases: Allows users to generate precise and optimized SQL queries using natural language, without needing advanced technical knowledge of SQL.
- Implements a multi-agent system: An architecture of intelligent agents that collaborate to perform specific tasks within the SQL query generation process, such as table selection, query generation, error iteration and correction, and the generation of insights.
- Accessible and user-friendly web interface: Allows users to interact with the system intuitively, including generating SQL queries, viewing results, and configuring personalized preferences.
- Optimization and cost reduction: Allows users to select models to optimize resources based on the specific use case.
- Ensures database security and integrity: Implements effective security measures against attacks like command or prompt injection, protecting the database's integrity.
The system's architecture is composed of several intelligent agents, each designed to perform specific tasks in the SQL query generation and execution process. These agents work collaboratively under the supervision of an orchestrator that ensures the correct sequence and coordination of tasks.
Figure 2: Architecture
The Proxy Agent is the first system component that interacts with the user's natural language queries (NLQ). Its primary function is to evaluate the relevance of these queries to determine if they are suitable for translation into SQL. This agent uses an advanced language model (GPT-3.5) to classify queries on a scale of 1 to 5, where 1 indicates total irrelevance and 5 indicates maximum relevance.
The Proxy Agent implementation has demonstrated a 95% accuracy rate.
Once the Proxy Agent has identified a relevant query, the next step in the process is table selection, which is performed by the Selector Agent. This agent is responsible for identifying the database tables most relevant to the SQL query being generated.
The Selector Agent employs two distinct approaches to accomplish this task:
- Language model-based approach (Option 1): This approach uses a language model, like GPT-3.5, to identify the most relevant tables based on the user's input. This method is highly accurate but also more costly in terms of processing and resource usage.
- Semantic approach with cosine similarity (Option 2): This approach combines semantic processing techniques with cosine similarity to identify relevant tables. Although it is less costly and faster than the language model-based approach, it may not be as accurate in all cases since it will always return
k
relevant tables, and there may be cases where more thank
tables are needed.
Both approaches have proven effective in table identification, with comparable results in tests. The choice between these two approaches will depend on the user's specific needs, including factors like cost, execution time, accuracy, query complexity, or the number of tables in the database.
Once the relevant tables have been selected, the next step is SQL query generation and execution. This process is carried out by a team of analyst agents, which include:
- Data Engineer Agent (SQL Coder): This agent is responsible for the initial SQL query generation based on the user's specifications and the selected tables. The agent ensures that the generated query is correctly structured and free of basic syntax errors or incorrect references.
- Sr Data Analyst Agent: This agent executes the SQL query generated by the Data Engineer Agent and verifies its correctness. If errors are detected, such as unexpected or empty results, the Sr Data Analyst Agent initiates an iterative correction process, sending feedback to the Data Engineer Agent to adjust the query until a correct result is obtained.
The iterative correction process has proven highly effective, reaching 95% accuracy after two iterations in tests conducted.
The Data Engineer Agent (SQL Coder) model can be changed from the GUI.
It supports small models hosted on HuggingFace.
By default, if no model is applied, it will use GPT 3.5.
In addition to generating and executing SQL queries, the system also includes an Explorer Agent, whose goal is to generate additional insights from the obtained results. This agent uses an advanced language model (GPT-4 in tests) to identify patterns and trends in the data that may not be evident in the original query.
The Explorer Agent generates new SQL queries based on the original query and provides insights that can be of great value to the user.
While using the Explorer Agent may significantly increase operational costs (especially when using advanced models like GPT-4), it also offers considerable added value by providing additional information. To reduce costs, it is recommended to use less expensive models like GPT-3 for this agent, depending on the project's specific needs.
Insight generation can be disabled from the GUI.
To access the language models used in the system, an OpenAI API key is required. This API key is necessary to authenticate requests to the OpenAI API and enable the use of models like GPT-3.5 and GPT-4.
For other models, you'll need the HuggingFace API key (optional), but you'll need it for Option 2 of table selection.
The user interface is designed with a focus on usability and accessibility, allowing users to interact with the system smoothly and efficiently. The web application has been developed using Django for the backend and Vue.js with Vite for the frontend, providing a modern and reactive architecture.
The backend, developed in Django, handles interactions with the database and language models. It uses Django REST Framework to expose the necessary APIs that interact with the agents and process user queries.
The frontend has been developed using Vue.js, a progressive JavaScript framework that facilitates building interactive user interfaces. Vite, a fast build tool, has been used to optimize frontend development, providing a more agile and efficient development experience.
The interface includes several key pages and components:
- Chat Page: Allows users to input natural language queries and view generated responses.
- Settings Page: Offers options to customize the system's behavior and database connection.
To use the system, follow the steps below:
-
Clone the repository
git clone https://github.com/pablosierrafernandez/DataWhisper-Arquitectura-basada-en-Agentes-para-text-to-sql.git
-
Set up the backend
-
Virtual Environment
cd DataWhisper-Arquitectura-basada-en-Agentes-para-text-to-sql cd back python -m venv env . # create virtual environment source env/bin/activate # Linux/MacOS source env\Scripts\activate # Windows pip install -r requirements.txt
-
Migrations
# in back/ python manage.py migrate
-
-
Set up the frontend
You need to have Node.js installed beforehand
# in front npm install
-
Start the services
# in back python manage.py runserver
# in front npm run dev
-
Access the application
Open your browser and go to
http://localhost:5173
to access the user interface.
-
Clone the repository
git clone https://github.com/pablosierrafernandez/DataWhisper-Arquitectura-basada-en-Agentes-para-text-to-sql.git
-
Start the containers
You should have Docker and Docker Compose installed.
docker-compose up --build
-
Access the application:
Open your browser and go to
http://localhost:5173
to access the user interface.
Contributions are welcome. If you have suggestions for improvements, new features, or find any issues, please open an issue or submit a pull request.
This project is licensed under the MIT License.
Check the LICENSE
file for more details.
- @pablosierrafernandez: Project researcher and developer.
In collaboration with:
Logo | Entity | Description |
---|---|---|
Universitat Rovira i Virgili | A higher education institution located in Tarragona, Spain, known for its academic excellence and multidisciplinary research. |