Task-Centric Memory #5227

rickyloynd-microsoft · 2025-01-28T01:47:21Z

(EXPERIMENTAL RESEARCH IN PROGRESS)

In 2023 AutoGen introduced Teachable Agents that users could teach new facts, preferences and skills. But teachable agents were limited in several ways: They could only be ConversableAgent subclasses, they couldn't learn a new skill unless the user stated (in a single turn) both the task and how to solve it, and they couldn't learn on their own. Task-Centric Memory overcomes these limitations, allowing users to teach arbitrary agents (or teams) more flexibly and reliably, and enabling agents to learn from their own trial-and-error experiences.

This PR is large and complex. All of the files are new, and most of the added components depend on the others to run at all. But the review process can be accelerated if approached in the following order.

Start with the Task-Centric Memory README.
1. Install the memory extension locally, since it won't be in pypi until it's merged. In the agentic_memory branch, and the python/packages directory:
  - pip install -e autogen-agentchat
  - pip install -e autogen-ext[openai]
  - pip install -e autogen-ext[task-centric-memory]
2. Run the Quickstart sample code, then immediately open the ~/pagelogs/quick/0 Call Tree.html file in a browser to view the work in progress.
3. Click through the web page links to see the details.
Continue through the rest of the main README to get a high-level overview of the architecture.
Read through the code samples README, running each of the 4 code samples while viewing their page logs.
Skim through the 4 code samples, along with their corresponding yaml config files:
1. eval_retrieval.py
2. eval_teachability.py
3. eval_learning_from_demonstration.py
4. eval_self_teaching.py
Read task_centric_memory_controller.py, referring back to the previously generated page logs as needed. This is the most important and complex file in the PR.
Read the remaining core files.
1. _task_centric_memory_bank.py
2. _string_similarity_map.py
3. _prompter.py
Read the supporting files in the utils dir.
1. apprentice.py
2. grader.py
3. page_logger.py
4. _functions.py

Make memory optional. Filter out insights with negative scores.

Refactor memory paths. Enrich page logging.

Seed messages with random int for variability.

Save sessions as yaml for readability.

Eval simplifications.

…ions.

…ified in settings.

rickyloynd-microsoft · 2025-02-14T01:04:10Z

Thanks for the PR, Ricky. I left a few comments.

This PR is very big. And it's hard to review these many number of file changes. Is it possible to make this PR incrementally?

I think that would help everyone to give you feedback more carefully and on the most important bits first. But I also realize that it may not be feasible to break this PR.

Followed your suggestion to put a recommended list of steps in the PR intro to help reviewers.

python/packages/autogen-ext/src/autogen_ext/task_centric_memory/README.md

ekzhu · 2025-02-14T19:39:40Z

python/packages/autogen-ext/src/autogen_ext/task_centric_memory/README.md

+        print("- " + memo.insight)
+
+asyncio.run(main())
+```


Another example perhaps like this. It can be just another code block in this file for now. In a future PR when we write a user guide for this memory module, we can move this example to the user guide.

import asyncio from dataclasses import dataclass from typing import List from autogen_core import AgentId, MessageContext, RoutedAgent, SingleThreadedAgentRuntime, message_handler from autogen_core.models import ChatCompletionClient, LLMMessage, SystemMessage, UserMessage from autogen_ext.models.openai import OpenAIChatCompletionClient from autogen_ext.task_centric_memory import PageLogger, TaskCentricMemoryController @dataclass class Message: content: str class MemoryEnabledAgent(RoutedAgent): def __init__( self, description: str, model_client: ChatCompletionClient, task_memory_controller: TaskCentricMemoryController ) -> None: super().__init__(description) self._model_client = model_client self._task_memory_controller = task_memory_controller @message_handler async def handle_message(self, message: Message, context: MessageContext) -> Message: # Retrieve relevant memories for the task. memos = await self._task_memory_controller.retrieve_relevant_memos(task=message.content) # Format the memories for the model. formatted_memos = "Relevant context about the user: \n\n" + "\n".join([memo.insight for memo in memos]) print(f"{'-'*38}Memos{'-'*38}:\n{formatted_memos}\n{'-'*80}") # Create the messages for the model with the retrieved memories. messages: List[LLMMessage] = [ SystemMessage(content="You are a helpful assistant."), UserMessage(content=formatted_memos, source="user"), UserMessage(content=message.content, source="user"), ] # Call the model with the messages. model_result = await self._model_client.create(messages=messages) assert isinstance(model_result.content, str) # Send the model's response to the user. return Message(content=model_result.content) async def main() -> None: client = OpenAIChatCompletionClient(model="gpt-4o") page_logger = PageLogger(config={"level": "DEBUG", "path": "~/pagelogs/quickstart"}) # Optional, but very useful. memory_controller = TaskCentricMemoryController(reset=True, client=client, logger=page_logger) # Add a few task-insight pairs as memories, where an insight can be any string that may help solve the task. await memory_controller.add_memo(task="What color do I like?", insight="Deep blue is my favorite color") await memory_controller.add_memo(task="What's another color I like?", insight="I really like cyan") await memory_controller.add_memo(task="What's my favorite food?", insight="Halibut is my favorite") # Create an agent runtime. runtime = SingleThreadedAgentRuntime() # Start the agent runtime. runtime.start() # Register the agent type. await MemoryEnabledAgent.register( runtime, "memory_enabled_agent", lambda: MemoryEnabledAgent( "A agent with memory", model_client=client, task_memory_controller=memory_controller ), ) # Send a direct message to the agent. response = await runtime.send_message( Message(content="What colors do I like most"), AgentId("memory_enabled_agent", "default") ) print("Agent response: " + response.content) # Stop the agent runtime. await runtime.stop() asyncio.run(main())

--------------------------------------Memos--------------------------------------: Relevant context about the user: Deep blue is my favorite color I really like cyan -------------------------------------------------------------------------------- Agent response: Based on the information you've provided, you really like deep blue and cyan.

To show how to use this as part of an agent implementation in the Core API.

@victordibia could you comment how can we perhaps leverage this module as an implementation of the autogen_core.memory.Memory interface?

Not for this PR but for future one.

Good idea.
It seems to me that the example above could be indeed implemented (to some extent) using the Memory interface.

add maps to memory_controller.add_memo

and update_context maps to formatted_memos = ...

Yeah. I do think using the controller we can implement the autogen_core.memory.Memory interface. I can give it a try.

If that's the case we should move this module to autogen_ext.memory.task_centric_memory.

Without losing any of the current functionality of TaskCentricMemoryController? Like learning from its own experience?? (That's TaskCentricMemoryController.train_on_task(self, task: str, expected_answer: str), which requires the expected answer.)

TaskCentricMemoryController is still active research so the interface is highly likely to change, as we discussed before. Why do you now want it in Core?

Why do you now want it in Core?

We are not moving it to core. It's staying in extension. My message earlier refers to autogen_ext module.

I am just trying to play with it and see what it takes me.

New code example from @ekzhu added to the README.

Yeah. I do think using the controller we can implement the autogen_core.memory.Memory interface. I can give it a try.

If that's the case we should move this module to autogen_ext.memory.task_centric_memory.

@ekzhu, should we wait for the results of your attempt to use TaskCentricMemoryController to implement the autogen_core.memory.Memory interface before we try to clarify the relations and connections between TaskCentricMemory and the autogen_core.memory.Memory interface?

...n/packages/autogen-ext/src/autogen_ext/task_centric_memory/task_centric_memory_controller.py

ekzhu · 2025-02-14T19:47:30Z

...n/packages/autogen-ext/src/autogen_ext/task_centric_memory/task_centric_memory_controller.py

+        add_task_solution_pair_to_memory: Adds a task-solution pair to the memory bank, to be retrieved together later as a combined insight.
+        retrieve_relevant_memos: Retrieves any memos from memory that seem relevant to the task.
+        assign_task: Assigns a task to the agent, along with any relevant insights/memories.
+        handle_user_message: Handles a user message, extracting any advice and assigning a task to the agent.


Add an example code block here for the simplest example that you have in the README.md and then add links to the sample directory for more advanced samples.

See https://github.com/microsoft/autogen/blob/agentic_memory/python/packages/autogen-ext/src/autogen_ext/models/openai/_openai_client.py#L1258 for an example of how to add doc-string code block.

…by encapsulating the settings that change frequently, as when loading many settings from a single YAML file.

…entic_memory

python/.gitignore

python/packages/autogen-ext/imgs/task_centric_memory.png

husseinmozannar · 2025-02-22T01:42:09Z

...n/packages/autogen-ext/src/autogen_ext/task_centric_memory/task_centric_memory_controller.py

+            self.logger.info(task)
+
+            # Get a list of topics from the generalized task.
+            generalized_task = await self.prompter.generalize_task(task)


as discussed, there is an option to combine these two lines in a single API call, it would sacrifice accuracy potentially, but go from minimum 2 LLM calls to 1 LLM call

husseinmozannar · 2025-02-22T01:43:35Z

python/packages/autogen-ext/src/autogen_ext/task_centric_memory/_prompter.py

+        )
+        user_message = [
+            "Now put yourself in the mind of the students. What misconception led them to their incorrect answer?"
+        ]


there is potential to combine these three LLM calls into a single call, it depends on the LLM, but GPT-4o should be able to do this in a single shot with a COT prompt, other less capable models might need extra reflection

rickyloynd-microsoft added 30 commits November 29, 2024 12:31

initial checkin

442a9d8

support for extensive evaluations

f8584cd

Enhance retrieval with task generalization and insight validation

607e7ff

Support TRAPI client.

b045636

Make memory optional. Filter out insights with negative scores.

Restoring earlier results, and general cleanup.

63b28d7

Merge branch 'refs/heads/main' into agentic_memory

b921d83

Modify imports after merge from main.

9dfb074

Log model and token counts.

93a5ca4

Only instantiate the client once.

2cb9344

Fix bug that was duplicating insights across trials.

878f458

Add the Grader class.

21562f1

Refactor memory paths. Enrich page logging.

Adjustments for comparison tests.

3a40b30

Test generalization over multiple tasks.

8622c5e

Add teachability and a test for it.

20b26c1

Learning from demonstration, in-progress.

9d47227

In memory retrieval, validate insights separately rather than together.

52d4e00

Finish learning from demonstration.

6b15777

Seed messages with random int for variability.

Added RecordableChatCompletionClient as a guardrail during refactoring.

a18674c

Ran 3 evals with session recording and replay.

52e213e

Add results to recorded sessions, including session length.

a440b0a

Save sessions as yaml for readability.

Use yaml file for eval settings.

cab51f1

Simplify paths and other settings.

d91e58c

Renamed the memory classes.

f1d7a2f

Apprentice.

17d4c42

Eval simplifications.

Moved test into the evaluator, and removed eval.py's other util funct…

19654e8

…ions.

renaming

7aa20c1

Rerouted calls to AgenticMemoryController through FastLearner.

83a7ddc

Replace task_assignment_callback with AgentWrapper.

3047c1c

Segregate files into subfolders, eval framework vs. implementation, etc.

1f20b79

Rename FastLearner subclass to Apprentice, and import it only as spec…

de4c12b

…ified in settings.

Update readme files per reviewer feedback.

a720863

rickyloynd-microsoft mentioned this pull request Feb 14, 2025

Structured logging for PageLogger #5542

Open

rickyloynd-microsoft changed the title ~~Agentic memory~~ Task-Centric Memory Feb 14, 2025

Merge branch 'refs/heads/main' into agentic_memory

b0e72a7

ekzhu reviewed Feb 14, 2025

View reviewed changes

python/packages/autogen-ext/src/autogen_ext/task_centric_memory/README.md Show resolved Hide resolved

ekzhu reviewed Feb 14, 2025

View reviewed changes

...n/packages/autogen-ext/src/autogen_ext/task_centric_memory/task_centric_memory_controller.py Show resolved Hide resolved

ekzhu reviewed Feb 14, 2025

View reviewed changes

rickyloynd-microsoft added 4 commits February 15, 2025 13:22

Get API Reference documentation to build correctly.

dba5b55

Add code example provided by @ekzhu

01d8b9d

Added installation instructions and a code snippet to the docstring.

193466b

Code format fix in the docstring

39d460a

rickyloynd-microsoft added the memory Covering all aspects of fast learning for agents label Feb 16, 2025

rickyloynd-microsoft added 4 commits February 17, 2025 10:31

Use TypedDicts in the nested-config pattern to minimize code changes …

8f9d066

…by encapsulating the settings that change frequently, as when loading many settings from a single YAML file.

Merge branch 'refs/heads/main' into agentic_memory

94eab06

Add clarifying diagrams.

f892c18

fix image sizes

a15bfd1

gagb self-requested a review February 18, 2025 23:32

rickyloynd-microsoft added 5 commits February 19, 2025 10:43

Merge branch 'main' into agentic_memory

3ea8011

Add file docstrings to sample code.

58ecd7e

Merge branch 'agentic_memory' of github.com:microsoft/autogen into ag…

00e27e1

…entic_memory

uv sync --all-extras

4466eee

restore previous uv.lock

64dc3c0

gagb reviewed Feb 20, 2025

View reviewed changes

python/.gitignore Outdated Show resolved Hide resolved

gagb reviewed Feb 20, 2025

View reviewed changes

python/packages/autogen-ext/imgs/task_centric_memory.png Outdated Show resolved Hide resolved

rickyloynd-microsoft added 2 commits February 20, 2025 18:26

changes for webby

e15d0eb

experimental

af362f6

husseinmozannar reviewed Feb 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Task-Centric Memory #5227

Task-Centric Memory #5227

rickyloynd-microsoft commented Jan 28, 2025 •

edited

Loading

rickyloynd-microsoft commented Feb 14, 2025

ekzhu Feb 14, 2025 •

edited

Loading

ekzhu Feb 14, 2025

victordibia Feb 14, 2025

ekzhu Feb 15, 2025 •

edited

Loading

rickyloynd-microsoft Feb 15, 2025 •

edited

Loading

ekzhu Feb 15, 2025 •

edited

Loading

rickyloynd-microsoft Feb 15, 2025

rickyloynd-microsoft Feb 17, 2025

ekzhu Feb 14, 2025

rickyloynd-microsoft Feb 15, 2025

husseinmozannar Feb 22, 2025

husseinmozannar Feb 22, 2025

Task-Centric Memory #5227

Are you sure you want to change the base?

Task-Centric Memory #5227

Conversation

rickyloynd-microsoft commented Jan 28, 2025 • edited Loading

rickyloynd-microsoft commented Feb 14, 2025

ekzhu Feb 14, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ekzhu Feb 15, 2025 • edited Loading

Choose a reason for hiding this comment

rickyloynd-microsoft Feb 15, 2025 • edited Loading

Choose a reason for hiding this comment

ekzhu Feb 15, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rickyloynd-microsoft commented Jan 28, 2025 •

edited

Loading

ekzhu Feb 14, 2025 •

edited

Loading

ekzhu Feb 15, 2025 •

edited

Loading

rickyloynd-microsoft Feb 15, 2025 •

edited

Loading

ekzhu Feb 15, 2025 •

edited

Loading