Skip to content

Agent Generates Hallucinated Code and Deviates from Provided File Data #16

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
Muhammad-Ahsan-Rasheed opened this issue Oct 20, 2024 · 0 comments

Comments

@Muhammad-Ahsan-Rasheed
Copy link

The project is set up using the Ollama framework with the search and vector database functionalities disabled. The goal is to get responses strictly from the provided file (Large Dataset). However, after assigning the file to the Data Analysis agent, the agent generates code with made-up dataframes and deviates from the expected goal of analyzing the provided data file.

Even after providing clear and precise prompts, the agent continues to hallucinate, making up variables and data structures that do not exist in the context of the provided file.

Steps to Reproduce:

  1. Set up the project using Ollama, disabling search and vector database.
  2. Assign a data file to the Data Analysis agent.
  3. Provide a clear prompt expecting the analysis of the file's content.
  4. Observe the agent's behavior as it generates code involving made-up dataframes and unrelated data structures.

Expected Behavior:
The Data Analysis agent should strictly adhere to the provided file's data and perform analysis only on the file without hallucinating or creating made-up variables.

Actual Behavior:
The agent generates analysis code that includes made-up dataframes and variables that deviate from the provided file, despite clear prompts.

Suggested Fix:
Improve the prompt-to-code parsing so that the agent strictly focuses on the provided file’s content and refrains from creating any hallucinated or irrelevant dataframes/variables.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant