Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: Agent finishes before tasks are fully complete #6204

Open
1 task done
aditya-choudhary599 opened this issue Jan 11, 2025 · 3 comments
Open
1 task done

[Bug]: Agent finishes before tasks are fully complete #6204

aditya-choudhary599 opened this issue Jan 11, 2025 · 3 comments
Assignees
Labels
bug Something isn't working

Comments

@aditya-choudhary599
Copy link

Is there an existing issue for the same bug?

  • I have checked the existing issues.

Describe the bug and reproduction steps

Description:

When the prompt to build a Conversational CRM Application using the MERN stack, augmented with Python for NLP capabilities was provided, the generated code failed to meet expectations. The output was incomplete, lacked functionality, and did not adhere to the requirements specified in the prompt.

Prompt Used:

Build a Conversational CRM Application using the MERN stack, augmented with Python for NLP capabilities, to provide a modern, user-friendly interface for managing sales and customer interactions.
Requirements:

  • Chatbot Interface: Develop a chatbot or virtual assistant as the core feature, enabling users to interact conversationally. It should support tasks like asking questions, scheduling meetings, and managing sales activities through chat.
  • Lead and Customer Management: Ensure the chatbot can update customer records, add new leads, and manage tasks based on natural conversational inputs.
  • Automated Follow-ups: Implement automated reminders and follow-up emails triggered by user interactions, improving workflow efficiency.
  • Reporting and Analytics: Create a system to generate and present sales reports and analytics through conversational commands, simplifying data access for users.
  • Third-Party Tool Integration: Enable integration with tools like email, calendars, and external CRMs to provide a seamless experience.
    Tech Stack:
  • MongoDB: Use MongoDB to store customer data, chat history, and related information.
  • Express.js: Develop the backend for handling API requests and bridging the front-end with chatbot services.
  • React.js: Build a dynamic and engaging user interface for the chatbot and CRM functionalities.
  • Node.js: Implement the server-side logic, manage APIs, and integrate the chatbot engine.
  • Python: Use Python for NLP tasks, powering the chatbot's natural language understanding and conversational flow.
    Objective:
    The application should deliver a conversational approach to CRM, transforming the way sales teams and customers interact while boosting productivity and improving the user experience.

Expected Behavior:

The generated code should have included:

  1. A fully functional and structured MERN stack application.
  2. A chatbot interface integrated with Python for NLP tasks, capable of handling conversational inputs and performing CRM tasks.
  3. Business logic for lead and customer management, automated follow-ups, reporting and analytics, and third-party tool integration.
  4. All necessary files, with non-empty, functional implementations for both backend and frontend components.
  5. A clear setup process to ensure the code is runnable without requiring significant manual intervention.

Observed Behavior:

  1. Many files were empty or contained only placeholder comments.
  2. Core business logic, including chatbot functionality, customer management, and analytics, was absent.
  3. The code was not runnable due to missing dependencies, configurations, and incomplete implementations.
  4. Generated files lacked proper folder structure or adherence to MERN stack conventions.
  5. Python NLP integration was either missing or implemented superficially, without demonstrating any meaningful capabilities.

OpenHands Installation

Development workflow

OpenHands Version

No response

Operating System

Linux

Logs, Errors, Screenshots, and Additional Context

sales-crm.tar.gz

@enyst
Copy link
Collaborator

enyst commented Jan 11, 2025

@aditya-choudhary599 you can give the agent this prompt, full with expectations and feedback, and tell it to fix it.

The LLM decides when to stop, in normal behavior, not openhands: the LLM may think it was finished before it should.

@neubig neubig changed the title [Bug]: Incomplete code generation [Bug]: Agent finishes before tasks are fully complete Jan 13, 2025
@neubig
Copy link
Contributor

neubig commented Jan 13, 2025

I agree that the LLM decides when to stop, but I think we need to have a better mechanism to double-check that the user's task has actually been fully completed.

@neubig
Copy link
Contributor

neubig commented Jan 13, 2025

Also highly related to #2221

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

4 participants