[IEEE TNNLS 2025] OS-RRG is a two-stage approach that incorporates a State-aware Balancing Diagnosis (SBD) module to alleviate inter- and intra-class imbalances in medical report generation and employs a State-guided Attention Intervention (SAI) technique to dynamically adjusts focus on key diagnostic features through targeted filtering and enhancement mechanisms.
Clone this repository and install the required packages:
git clone https://github.com/xmed-lab/OS_RRG.git
cd OS_RRG
conda create -n osrrg python=3.8
conda activate osrrg
pip install -r requirements.txt
MIMIC-CXR: The images can be downloaded from either physionet or R2Gen. Note that the physionet version requires a license for download. We use the R2Gen version for both training and evaluation. The annotation file can be downloaded from Google Drive. Please place all downloaded files under the data/mimic_cxr/
folder.
IU-Xray: Download the images from R2Gen and the annotation file from Google Drive. Please place both the images and annotation files under the data/iu_xray/
folder.
Moreover, you need to download the chexbert.pth
from here for evaluating clinical efficacy and put it under checkpoints/chexbert/
.
OS_RRG employs a two-stage training approach to align fine-grained visual observations to high-quality medical reports:
This stage focuses on aligning observation states with textual descriptions.
bash train_step1_Align.sh
This stage employs the SBD (State-aware Balancing Diagnosis) module to handle imbalanced observations states.
bash train_step2_SBD.sh
You can directly use our pre-trained models for evaluation:
- Step 1 - State-to-Description Alignment: Weights - Put at ./checkpoints/osrrg/
- Step 2 - Imbalanced Observation and State Mitigation (SBD): Weights - Put at ./checkpoints/osrrg/
# For MIMIC-CXR dataset
bash test_mimic.sh
# For IU-XRay dataset
bash test_iuxray.sh
OS_RRG is built upon the BLIP, PromptMRG, and SADE.
If you use this work in your research, please cite:
@ARTICLE{11095809,
author={Yang, Honglong and Tang, Hui and Song, Shanshan and Li, Xiaomeng},
journal={IEEE Transactions on Neural Networks and Learning Systems},
title={OS-RRG: Observation State-Aware Radiology Report Generation With Balanced Diagnosis and Attention Intervention},
year={2025},
volume={},
number={},
pages={1-15},
keywords={Accuracy;Medical diagnostic imaging;Radiology;Diseases;Telecommunication traffic;MIMICs;Linguistics;Heavily-tailed distribution;Communication switching;Training;Natural language processing;observation state (OS)-aware generation;observation-guided generation (OGG);radiology report generation (RRG)},
doi={10.1109/TNNLS.2025.3589103}}
For questions and issues, please use the GitHub issue tracker or contact [[email protected]].