diff --git a/docs/plugins/index.md b/docs/plugins/index.md
index 9bb23080b..856a2076d 100644
--- a/docs/plugins/index.md
+++ b/docs/plugins/index.md
@@ -1,512 +1,30 @@
 # Plugins
 
-## What is a Plugin?
+Plugins provide a structured way to intercept and modify agent, tool, and model behavior across your entire application. While callbacks apply to a particular agent, plugins apply globally to all agents in the runner.
 
-A Plugin in Agent Development Kit (ADK) is a custom code module that can be
-executed at various stages of an agent workflow lifecycle using callback hooks.
-You use Plugins for functionality that is applicable across your agent workflow.
-Some typical applications of Plugins are as follows:
+Plugins are best used for adding custom behaviors such as:
 
--   **Logging and tracing**: Create detailed logs of agent, tool, and
-    generative AI model activity for debugging and performance analysis.
--   **Policy enforcement**: Implement security guardrails, such as a
-    function that checks if users are authorized to use a specific tool and
-    prevent its execution if they do not have permission.
--   **Monitoring and metrics**: Collect and export metrics on token usage,
-    execution times, and invocation counts to monitoring systems such as
-    Prometheus or 
-    [Google Cloud Observability](https://cloud.google.com/stackdriver/docs) 
-    (formerly Stackdriver).
--   **Response caching**: Check if a request has been made before, so you
-    can return a cached response, skipping expensive or time consuming AI model
-    or tool calls.
--   **Request or response modification**: Dynamically add information to AI
-    model prompts or standardize tool output responses.
+*   **Logging:** Track important information at each callback point.
+*   **Context Management:** Filter the LLM context to reduce its size.
+*   **Error Handling:** Implement custom logic to handle tool failures.
 
-!!! tip
-    When implementing security guardrails and policies, use ADK Plugins for
-    better modularity and flexibility than Callbacks. For more details, see 
-    [Callbacks and Plugins for Security Guardrails](/adk-docs/safety/#callbacks-and-plugins-for-security-guardrails).
+## How to Use
 
-!!! warning "Caution"
-    Plugins are not supported by the 
-    [ADK web interface](../evaluate/#1-adk-web-run-evaluations-via-the-web-ui). 
-    If your ADK workflow uses Plugins, you must run your workflow without the 
-    web interface.
+To use a plugin, you add it to the `plugins` list of your `App`.
 
-Tip: When implementing security guardrails and policies, use ADK Plugins for better modularity and flexibility than Callbacks. For more details, see [Callbacks and Plugins for Security Guardrails](../safety/index.md#callbacks-and-plugins-for-security-guardrails).
+```python
+from google.adk.apps import App
+from my_plugins import MyCustomPlugin
 
-## How do Plugins work?
-
-An ADK Plugin extends the `BasePlugin` class and contains one or more
-`callback` methods, indicating where in the agent lifecycle the Plugin should be
-executed. You integrate Plugins into an agent by registering them in your
-agent's `Runner` class. For more information on how and where you can trigger
-Plugins in your agent application, see
-[Plugin callback hooks](#plugin-callback-hooks).
-
-Plugin functionality builds on
-[Callbacks](../callbacks/), which is a key design
-element of the ADK's extensible architecture. While a typical Agent Callback is
-configured on a *single agent, a single tool* for a *specific task*, a Plugin is
-registered *once* on the `Runner` and its callbacks apply *globally* to every
-agent, tool, and LLM call managed by that runner. Plugins let you package
-related callback functions together to be used across a workflow. This makes
-Plugins an ideal solution for implementing features that cut across your entire
-agent application.
-
-## Define and register Plugins
-
-This section explains how to define Plugin classes and register them as part of
-your agent workflow. For a complete code example, see
-[Plugin Basic](https://github.com/google/adk-python/tree/main/contributing/samples/plugin_basic)
-in the repository.
-
-### Create Plugin class
-
-Start by extending the `BasePlugin` class and add one or more `callback`
-methods, as shown in the following code example:
-
-```py title="count_plugin.py"
-from google.adk.agents.base_agent import BaseAgent
-from google.adk.agents.callback_context import CallbackContext
-from google.adk.models.llm_request import LlmRequest
-from google.adk.plugins.base_plugin import BasePlugin
-
-class CountInvocationPlugin(BasePlugin):
-  """A custom plugin that counts agent and tool invocations."""
-
-  def __init__(self) -> None:
-    """Initialize the plugin with counters."""
-    super().__init__(name="count_invocation")
-    self.agent_count: int = 0
-    self.tool_count: int = 0
-    self.llm_request_count: int = 0
-
-  async def before_agent_callback(
-      self, *, agent: BaseAgent, callback_context: CallbackContext
-  ) -> None:
-    """Count agent runs."""
-    self.agent_count += 1
-    print(f"[Plugin] Agent run count: {self.agent_count}")
-
-  async def before_model_callback(
-      self, *, callback_context: CallbackContext, llm_request: LlmRequest
-  ) -> None:
-    """Count LLM requests."""
-    self.llm_request_count += 1
-    print(f"[Plugin] LLM request count: {self.llm_request_count}")
-```
-
-This example code implements callbacks for `before_agent_callback` and
-`before_model_callback` to count execution of these tasks during the lifecycle
-of the agent.
-
-### Register Plugin class
-
-Integrate your Plugin class by registering it during your agent initialization
-as part of your `Runner` class, using the `plugins` parameter. You can specify
-multiple Plugins with this parameter. The following code example shows how to
-register the `CountInvocationPlugin` plugin defined in the previous section with
-a simple ADK agent.
-
-```py
-from google.adk.runners import InMemoryRunner
-from google.adk import Agent
-from google.adk.tools.tool_context import ToolContext
-from google.genai import types
-import asyncio
-
-# Import the plugin.
-from .count_plugin import CountInvocationPlugin
-
-async def hello_world(tool_context: ToolContext, query: str):
-  print(f'Hello world: query is [{query}]')
-
-root_agent = Agent(
-    model='gemini-2.0-flash',
-    name='hello_world',
-    description='Prints hello world with user query.',
-    instruction="""Use hello_world tool to print hello world and user query.
-    """,
-    tools=[hello_world],
+my_app = App(
+    agent=my_agent,
+    plugins=[MyCustomPlugin()]
 )
-
-async def main():
-  """Main entry point for the agent."""
-  prompt = 'hello world'
-  runner = InMemoryRunner(
-      agent=root_agent,
-      app_name='test_app_with_plugin',
-
-      # Add your plugin here. You can add multiple plugins.
-      plugins=[CountInvocationPlugin()],
-  )
-
-  # The rest is the same as starting a regular ADK runner.
-  session = await runner.session_service.create_session(
-      user_id='user',
-      app_name='test_app_with_plugin',
-  )
-
-  async for event in runner.run_async(
-      user_id='user',
-      session_id=session.id,
-      new_message=types.Content(
-        role='user', parts=[types.Part.from_text(text=prompt)]
-      )
-  ):
-    print(f'** Got event from {event.author}')
-
-if __name__ == "__main__":
-  asyncio.run(main())
 ```
 
-### Run the agent with the Plugin
-
-Run the plugin as you typically would. The following shows how to run the
-command line:
-
-```sh
-python3 -m path.to.main
-```
-
-Plugins are not supported by the
-[ADK web interface](../evaluate/#1-adk-web-run-evaluations-via-the-web-ui).
-If your ADK workflow uses Plugins, you must run your workflow without the web
-interface.
-
-The output of this previously described agent should look similar to the
-following:
-
-```log
-[Plugin] Agent run count: 1
-[Plugin] LLM request count: 1
-** Got event from hello_world
-Hello world: query is [hello world]
-** Got event from hello_world
-[Plugin] LLM request count: 2
-** Got event from hello_world
-```
-
-
-For more information on running ADK agents, see the
-[Quickstart](/get-started/quickstart/#run-your-agent)
-guide.
-
-## Build workflows with Plugins
-
-Plugin callback hooks are a mechanism for implementing logic that intercepts,
-modifies, and even controls the agent's execution lifecycle. Each hook is a
-specific method in your Plugin class that you can implement to run code at a key
-moment. You have a choice between two modes of operation based on your hook's
-return value:
-
--   **To Observe:** Implement a hook with no return value (`None`). This
-    approach is for tasks such as logging or collecting metrics, as it allows
-    the agent's workflow to proceed to the next step without interruption. For
-    example, you could use `after_tool_callback` in a Plugin to log every
-    tool's result for debugging.
--   **To Intervene:** Implement a hook and return a value. This approach
-    short-circuits the workflow. The `Runner` halts processing, skips any
-    subsequent plugins and the original intended action, like a Model call, and
-    use a Plugin callback's return value as the result. A common use case is
-    implementing `before_model_callback` to return a cached `LlmResponse`,
-    preventing a redundant and costly API call.
--   **To Amend:** Implement a hook and modify the Context object. This
-    approach allows you to modify the context data for the module to be
-    executed without otherwise interrupting the execution of that module. For
-    example, adding additional, standardized prompt text for Model object execution.
-
-**Caution:** Plugin callback functions have precedence over callbacks
-implemented at the object level. This behavior means that Any Plugin callbacks
-code is executed *before* any Agent, Model, or Tool objects callbacks are
-executed. Furthermore, if a Plugin-level agent callback returns any value, and
-not an empty (`None`) response, the Agent, Model, or Tool-level callback is *not
-executed* (skipped).
-
-The Plugin design establishes a hierarchy of code execution and separates
-global concerns from local agent logic. A Plugin is the stateful *module* you
-build, such as `PerformanceMonitoringPlugin`, while the callback hooks are the
-specific *functions* within that module that get executed. This architecture
-differs fundamentally from standard Agent Callbacks in these critical ways:
-
--   **Scope:** Plugin hooks are *global*. You register a Plugin once on the
-    `Runner`, and its hooks apply universally to every Agent, Model, and Tool
-    it manages. In contrast, Agent Callbacks are *local*, configured
-    individually on a specific agent instance.
--   **Execution Order:** Plugins have *precedence*. For any given event, the
-    Plugin hooks always run before any corresponding Agent Callback. This
-    system behavior makes Plugins the correct architectural choice for
-    implementing cross-cutting features like security policies, universal
-    caching, and consistent logging across your entire application.
-
-### Agent Callbacks and Plugins
-
-As mentioned in the previous section, there are some functional similarities
-between Plugins and Agent Callbacks. The following table compares the
-differences between Plugins and Agent Callbacks in more detail.
-
-<table>
-  <thead>
-    <tr>
-      <th></th>
-      <th><strong>Plugins</strong></th>
-      <th><strong>Agent Callbacks</strong></th>
-    </tr>
-  </thead>
-  <tbody>
-    <tr>
-      <td><strong>Scope</strong></td>
-      <td><strong>Global</strong>: Apply to all agents/tools/LLMs in the
-<code>Runner</code>.</td>
-      <td><strong>Local</strong>: Apply only to the specific agent instance
-they are configured on.</td>
-    </tr>
-    <tr>
-      <td><strong>Primary Use Case</strong></td>
-      <td><strong>Horizontal Features</strong>: Logging, policy, monitoring,
-global caching.</td>
-      <td><strong>Specific Agent Logic</strong>: Modifying the behavior or
-state of a single agent.</td>
-    </tr>
-    <tr>
-      <td><strong>Configuration</strong></td>
-      <td>Configure once on the <code>Runner</code>.</td>
-      <td>Configure individually on each <code>BaseAgent</code> instance.</td>
-    </tr>
-    <tr>
-      <td><strong>Execution Order</strong></td>
-      <td>Plugin callbacks run <strong>before</strong> Agent Callbacks.</td>
-      <td>Agent callbacks run <strong>after</strong> Plugin callbacks.</td>
-    </tr>
-  </tbody>
-</table>
-
-## Plugin callback hooks
-
-You define when a Plugin is called with the callback functions to define in
-your Plugin class. Callbacks are available when a user message is received,
-before and after an `Runner`, `Agent`, `Model`, or `Tool` is called, for
-`Events`, and when a `Model`, or `Tool` error occurs. These callbacks include,
-and take precedence over, the any callbacks defined within your Agent, Model,
-and Tool classes.
-
-The following diagram illustrates callback points where you can attach and run
-Plugin functionality during your agents workflow:
-
-![ADK Plugin callback hooks](../assets/workflow-plugin-hooks.svg)
-**Figure 1.** Diagram of ADK agent workflow with Plugin callback hook
-locations.
-
-The following sections describe the available callback hooks for Plugins in
-more detail.
-
--   [User Message callbacks](#user-message-callbacks)
--   [Runner start callbacks](#runner-start-callbacks)
--   [Agent execution callbacks](#agent-execution-callbacks)
--   [Model callbacks](#model-callbacks)
--   [Tool callbacks](#tool-callbacks)
--   [Runner end callbacks](#runner-end-callbacks)
-
-### User Message callbacks
+## Available Plugins
 
-*A User Message c*allback (`on_user_message_callback`) happens when a user
-sends a message. The `on_user_message_callback` is the very first hook to run,
-giving you a chance to inspect or modify the initial input.\
-
--   **When It Runs:** This callback happens immediately after
-    `runner.run()`, before any other processing.
--   **Purpose:** The first opportunity to inspect or modify the user's raw
-    input.
--   **Flow Control:** Returns a `types.Content` object to **replace** the
-    user's original message.
-
-The following code example shows the basic syntax of this callback:
-
-```py
-async def on_user_message_callback(
-    self,
-    *,
-    invocation_context: InvocationContext,
-    user_message: types.Content,
-) -> Optional[types.Content]:
-```
-
-### Runner start callbacks
-
-A *Runner start* callback (`before_run_callback`) happens when the `Runner`
-object takes the potentially modified user message and prepares for execution.
-The `before_run_callback` fires here, allowing for global setup before any agent
-logic begins.
-
--   **When It Runs:** Immediately after `runner.run()` is called, before
-    any other processing.
--   **Purpose:** The first opportunity to inspect or modify the user's raw
-    input.
--   **Flow Control:** Return a `types.Content` object to **replace** the
-    user's original message.
-
-The following code example shows the basic syntax of this callback:
-
-```py
-async def before_run_callback(
-    self, *, invocation_context: InvocationContext
-) -> Optional[types.Content]:
-```
-
-### Agent execution callbacks
-
-*Agent execution* callbacks (`before_agent`, `after_agent`) happen when a
-`Runner` object invokes an agent. The `before_agent_callback` runs immediately
-before the agent's main work begins. The main work encompasses the agent's
-entire process for handling the request, which could involve calling models or
-tools. After the agent has finished all its steps and prepared a result, the
-`after_agent_callback` runs.
-
-**Caution:** Plugins that implement these callbacks are executed *before* the
-Agent-level callbacks are executed. Furthermore, if a Plugin-level agent
-callback returns anything other than a `None` or null response, the Agent-level
-callback is *not executed* (skipped).
-
-For more information about Agent callbacks defined as part of an Agent object,
-see
-[Types of Callbacks](../callbacks/types-of-callbacks/#agent-lifecycle-callbacks).
-
-### Model callbacks
-
-Model callbacks **(`before_model`, `after_model`, `on_model_error`)** happen
-before and after a Model object executes. The Plugins feature also supports a
-callback in the event of an error, as detailed below:
-
--   If an agent needs to call an AI model, `before_model_callback` runs first.
--   If the model call is successful, `after_model_callback` runs next.
--   If the model call fails with an exception, the `on_model_error_callback`
-    is triggered instead, allowing for graceful recovery.
-
-**Caution:** Plugins that implement the **`before_model`** and  `**after_model`
-**callback methods are executed *before* the Model-level callbacks are executed.
-Furthermore, if a Plugin-level model callback returns anything other than a
-`None` or null response, the Model-level callback is *not executed* (skipped).
-
-#### Model on error callback details
-
-The on error callback for Model objects is only supported by the Plugins
-feature works as follows:
-
--   **When It Runs:** When an exception is raised during the model call.
--   **Common Use Cases:** Graceful error handling, logging the specific
-    error, or returning a fallback response, such as "The AI service is
-    currently unavailable."
--   **Flow Control:**
-    -   Returns an `LlmResponse` object to **suppress the exception**
-        and provide a fallback result.
-    -   Returns `None` to allow the original exception to be raised.
-
-**Note**: If the execution of the Model object returns a `LlmResponse`, the
-system resumes the execution flow, and `after_model_callback` will be triggered
-normally.****
-
-The following code example shows the basic syntax of this callback:
-
-```py
-async def on_model_error_callback(
-    self,
-    *,
-    callback_context: CallbackContext,
-    llm_request: LlmRequest,
-    error: Exception,
-) -> Optional[LlmResponse]:
-```
-
-### Tool callbacks
-
-Tool callbacks **(`before_tool`, `after_tool`, `on_tool_error`)** for Plugins
-happen before or after the execution of a tool, or when an error occurs. The
-Plugins feature also supports a callback in the event of an error, as detailed
-below:\
-
--   When an agent executes a Tool, `before_tool_callback` runs first.
--   If the tool executes successfully, `after_tool_callback` runs next.
--   If the tool raises an exception, the `on_tool_error_callback` is
-    triggered instead, giving you a chance to handle the failure. If
-    `on_tool_error_callback` returns a dict, `after_tool_callback` will be
-    triggered normally.
-
-**Caution:** Plugins that implement these callbacks are executed *before* the
-Tool-level callbacks are executed. Furthermore, if a Plugin-level tool callback
-returns anything other than a `None` or null response, the Tool-level callback
-is *not executed* (skipped).
-
-#### Tool on error callback details
-
-The on error callback for Tool objects is only supported by the Plugins feature
-works as follows:
-
--   **When It Runs:** When an exception is raised during the execution of a
-    tool's `run` method.
--   **Purpose:** Catching specific tool exceptions (like `APIError`),
-    logging the failure, and providing a user-friendly error message back to
-    the LLM.
--   **Flow Control:** Return a `dict` to **suppress the exception**, provide
-    a fallback result. Return `None` to allow the original exception to be raised.
-
-**Note**: By returning a `dict`, this resumes the execution flow, and
-`after_tool_callback` will be triggered normally.
-
-The following code example shows the basic syntax of this callback:
-
-```py
-async def on_tool_error_callback(
-    self,
-    *,
-    tool: BaseTool,
-    tool_args: dict[str, Any],
-    tool_context: ToolContext,
-    error: Exception,
-) -> Optional[dict]:
-```
-
-### Event callbacks
-
-An *Event callback* (`on_event_callback`) happens when an agent produces
-outputs such as a text response or a tool call result, it yields them as `Event`
-objects. The `on_event_callback` fires for each event, allowing you to modify it
-before it's streamed to the client.
-
--   **When It Runs:** After an agent yields an `Event` but before it's sent
-    to the user. An agent's run may produce multiple events.
--   **Purpose:** Useful for modifying or enriching events (e.g., adding
-    metadata) or for triggering side effects based on specific events.
--   **Flow Control:** Return an `Event` object to **replace** the original
-    event.
-
-The following code example shows the basic syntax of this callback:
-
-```py
-async def on_event_callback(
-    self, *, invocation_context: InvocationContext, event: Event
-) -> Optional[Event]:
-```
-
-### Runner end callbacks
-
-The *Runner end* callback **(`after_run_callback`)** happens when the agent has
-finished its entire process and all events have been handled, the `Runner`
-completes its run. The `after_run_callback` is the final hook, perfect for
-cleanup and final reporting.
-
--   **When It Runs:** After the `Runner` fully completes the execution of a
-    request.
--   **Purpose:** Ideal for global cleanup tasks, such as closing connections
-    or finalizing logs and metrics data.
--   **Flow Control:** This callback is for teardown only and cannot alter
-    the final result.
-
-The following code example shows the basic syntax of this callback:
-
-```py
-async def after_run_callback(
-    self, *, invocation_context: InvocationContext
-) -> Optional[None]:
-```
+*   [Logging Plugin](logging_plugin.md)
+*   [Context Filter Plugin](context_filter_plugin.md)
+*   [Save Files As Artifacts Plugin](save_files_as_artifacts_plugin.md)
+*   [Reflect and Retry Tool Plugin](reflect_and_retry_tool_plugin.md)
diff --git a/docs/plugins/reflect_and_retry_tool_plugin.md b/docs/plugins/reflect_and_retry_tool_plugin.md
new file mode 100644
index 000000000..5a65d1557
--- /dev/null
+++ b/docs/plugins/reflect_and_retry_tool_plugin.md
@@ -0,0 +1,75 @@
+# Reflect and Retry Tool Plugin
+
+The `ReflectAndRetryToolPlugin` is a powerful plugin that enhances the resilience of your agents by providing a mechanism to handle tool failures gracefully. When a tool fails, this plugin intercepts the error, provides structured guidance to the LLM for reflection and correction, and retries the operation up to a configurable number of times.
+
+## How it Works
+
+The `ReflectAndRetryToolPlugin` works by implementing the `on_tool_error_callback` method. When a tool execution results in an error, this callback is triggered. The plugin then constructs a detailed reflection message for the LLM, which includes:
+
+*   The name of the tool that failed.
+*   The arguments that were passed to the tool.
+*   The error message that was returned.
+*   The number of retry attempts remaining.
+
+This reflection message guides the LLM to understand the cause of the failure and to adjust its subsequent actions to correct the error.
+
+## Features
+
+*   **Automatic Retry**: Automatically retries a failed tool execution up to a specified number of times.
+*   **LLM Reflection**: Provides detailed error information to the LLM to help it correct its mistakes.
+*   **Configurable**: You can configure the maximum number of retries and whether to throw an exception when the retry limit is exceeded.
+*   **Customizable**: You can extend the plugin to implement custom error handling logic.
+
+## How to Use
+
+To use the `ReflectAndRetryToolPlugin`, you need to add it to the `plugins` list of your `App`.
+
+```python
+from google.adk.apps import App
+from google.adk.agents import LlmAgent
+from google.adk.plugins import ReflectAndRetryToolPlugin
+
+# Define your agent
+my_agent = LlmAgent(
+    # ... your agent configuration
+)
+
+# Initialize the plugin
+error_handling_plugin = ReflectAndRetryToolPlugin(max_retries=3)
+
+# Create an App with the agent and plugin
+my_app = App(
+    agent=my_agent,
+    plugins=[error_handling_plugin]
+)
+```
+
+### Configuration
+
+The `ReflectAndRetryToolPlugin` can be configured with the following parameters:
+
+*   **`max_retries`** (int, optional): The maximum number of times to retry a failed tool execution. Defaults to `3`.
+*   **`throw_exception_if_retry_exceeded`** (bool, optional): If `True`, the plugin will raise the final exception when the retry limit is reached. If `False`, it will return guidance to the LLM instead. Defaults to `True`.
+
+### Customizing the Plugin
+
+You can also customize the behavior of the plugin by subclassing it and overriding its methods. For example, you can override the `extract_error_from_result` method to implement custom logic for detecting errors in tool responses.
+
+```python
+from google.adk.plugins import ReflectAndRetryToolPlugin
+
+class CustomRetryPlugin(ReflectAndRetryToolPlugin):
+    def extract_error_from_result(self, result: dict) -> str | None:
+        # Implement custom error detection logic here
+        if "custom_error_key" in result:
+            return result["custom_error_key"]
+        return super().extract_error_from_result(result)
+
+# Use the custom plugin
+custom_error_handling_plugin = CustomRetryPlugin(max_retries=5)
+
+my_app = App(
+    agent=my_agent,
+    plugins=[custom_error_handling_plugin]
+)
+```