202505-13-AIM-2

Adaptive Intelligent Medical Multi-Agents (AIM²)
Foundation models have shown strong potential in clinical applications, yet their integration into real-world medical workflows remains limited by rigid prompting strategies and a lack of collaborative reasoning. We introduce AIM², a multi-agent framework designed to emulate dynamic clinical decision-making through structured, context-aware collaboration among large language models (LLMs). The superscript “2” reflects the system’s dual foundation in multi-modal understanding and multi-agent coordination. AIM² first interprets the task complexity and clinical modality, then automatically assigns either solo or team-based agents with specialized roles and reasoning scopes. These agents engage in multi-round deliberation when appropriate, simulating multidisciplinary team (MDT) workflows common in hospitals. We evaluate AIM² on a diverse set of multi-modal medical questions involving radiological images and free-text prompts. The results illustrate AIM²’s capacity to adaptively balance efficiency and depth of reasoning while maintaining transparent, role-grounded interactions. This framework bridges the gap between powerful foundation models and practical, adaptive medical reasoning systems.

🌟 Features

Vision Expert
Extracts structured imaging findings and generates a professional report.
Difficulty Triage
Classifies case complexity into low / moderate / high.
Adaptive Recruitment
Builds a GP, Expert Team, or full MDT + Challenger based on complexity.
Multi-Round Deliberation
Experts and a Challenger agent debate under a Moderator’s guidance.
Transparent Audit Trail
Outputs include final answer, chain-of-thoughts, and timestamped logs.

📂 Repository Structure

E6895-AIM/
├── README.md
├── LICENSE
├── requirements.txt
├── .gitignore
│
├── src/
│   ├── __init__.py
│   ├── utils.py                 # helpers (HF login, model loading)
│   ├── app.py                   # Gradio interface
│   │
│   ├── agents/
│   │   ├── __init__.py
│   │   ├── vision_expert.py     # Define VisionExpert
│   │   ├── difficulty_agent.py  # Define DifficultyAgent
│   │   ├── recruiter.py         # Define Recruiter
│   │   ├── expert_agent.py      # Define ExpertAgent
│   │   ├── challenger_agent.py  # Define ChallengerAgent
│   │   └── moderator_agent.py   # Define ModeratorAgent
│   │
│   └── engine/
│       ├── __init__.py
│       └── discussion_engine.py # Define DiscussionEngine
│
└── tests/
    └── test_agents.py           # pytest unit tests

Installation

Clone the repository

git clone https://github.com/tianshuai-gao/E6895-AIM.git
cd E6895-AIM

Create a virtual environment

python3 -m venv .venv
source .venv/bin/activate    # Linux/macOS
.venv\Scripts\activate       # Windows

3.Install dependencies

pip install -r requirements.txt

Login to Hugging Face

huggingface-cli login

Usage

Command-Line Demo

python src/app.py \
  --image-path ./data/sample_mri.jpg \
  --question "Is there evidence of hemorrhage?"

This will print:

Imaging report
Complexity label
Recruited team
Multi-round discussion logs
Final answer

Interactive Gradio Demo

python -m src.app

Then open your browser at http://localhost:7860 to:

Upload an image
Enter a clinical question
Step through each discussion round interactively

Configuration

All key settings live in src/utils.py or can be overridden via environment variables:

# src/utils.py (example)
MODEL_LLMT = "ContactDoctor/Bio-Medical-Llama-3-8B"
MODEL_VLM = "ContactDoctor/Bio-Medical-MultiModal-Llama-3-8B-V1"
QUANTIZATION:
  load_in_4bit: true
  compute_dtype: float16

# You can also set:
export AIM_LLMT_MODEL="gpt-4o-mini"
export AIM_VLM_MODEL="openai/gpt-4o-vision-preview"

Adjust max_rounds per complexity:

Complexity	Max Rounds	Allow Escalation?
low	2	No
moderate	2 (→4)	Yes
high	4	No

Examples

Low complexity: Solo GP agent answers directly.
Moderate complexity: 2–3 specialist agents discuss, may escalate to deep track.
High complexity: Full MDT + Challenger, up to 4 rounds of debate.

Testing

Run the unit tests with pytest:

pytest -q

VisionExpert

analyze_image returns a non-empty string.
query_roi returns the stubbed answer.

DifficultyAgent

classify correctly maps “low”, “moderate”, “high” and falls back to moderate.
Add new tests under tests/ for any additional agents or utilities you implement.

Contributing

Fork the repository.
Create a feature branch:

git checkout -b feature/your-feature-name

Commit your changes:

git commit -m "Add [short description of feature]"

Push to your fork and open a Pull Request.

Please ensure that:

Code follows PEP8 style conventions.
Public APIs are documented with docstrings.
New features include appropriate unit tests.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact

Tianshuai Gao

Email: [email protected]
GitHub: tianshuai-gao/E6895-AIM

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

202505-13-AIM-2

Table of Contents

🌟 Features

📂 Repository Structure

Installation

Usage

Configuration

Examples

Testing

Contributing

License

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

License

Sapphirine/202505-13-AIM-2

Folders and files

Latest commit

History

Repository files navigation

202505-13-AIM-2

Table of Contents

🌟 Features

📂 Repository Structure

Installation

Usage

Configuration

Examples

Testing

Contributing

License

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages