VAAS: Vision-Attention Anomaly Scoring

What is VAAS?

VAAS is an inference-first, research-driven dual-module vision library for image integrity analysis. It integrates Vision Transformer Attention Mechanisms with patch-level self-consistency analysis to enable fine-grained localization and detection of visual inconsistencies across diverse image integrity analysis tasks

This repository provides the inference-ready implementation of VAAS for research engineers and practitioners.

Read Paper

Architecture

VAAS integrates two complementary components:

Fx — Global Attention Module
A Vision Transformer capturing semantic/global irregularities from attention patterns.
Px — Patch Consistency Module
A SegFormer-based model capturing local inconsistencies across image patches.

These combine to produce:

S_F — global attention fidelity
S_P — patch-level plausibility
S_H — hybrid anomaly score (final)

S_H is continuous and reflects relative anomaly intensity, not a binary decision.

Installation

pip install vaas

Or if you prefer using uv:

uv add vaas

Important: VAAS requires PyTorch and torchvision at inference time.

The library itself can be installed and imported without PyTorch installed.

Install the correct PyTorch build for your system (CPU / CUDA / ROCm):

https://pytorch.org/get-started/locally/

Usage

1. Basic inference on local and online images

from vaas.inference.pipeline import VAASPipeline
from PIL import Image
import requests
from io import BytesIO

pipeline = VAASPipeline.from_pretrained(
    "OBA-Research/vaas-v1-df2023",
    device="cpu",
    alpha=0.5
)

# # Option A: Using a local image
# image = Image.open("example.jpg").convert("RGB")
# result = pipeline(image)

# Option B: Using an online image
url = "https://raw.githubusercontent.com/OBA-Research/VAAS/main/examples/images/COCO_DF_C110B00000_00539519.jpg"
image = Image.open(BytesIO(requests.get(url).content)).convert("RGB")
result = pipeline(image)

print(result)
anomaly_map = result["anomaly_map"]

Output Format

{
  "S_F": float,
  "S_P": float,
  "S_H": float,
  "anomaly_map": numpy.ndarray  # shape (224, 224)
}

2. Inference with visual explanation

VAAS can also generate a qualitative visualization combining:

Patch-level anomaly heatmaps (Px)
Global attention maps (Fx)
Final hybrid anomaly score (S_H)

pipeline.visualize(
    image=image,
    save_path="vaas_visualization.png",
    mode="all",        # options: "all", "px", "binary", "fx"
    threshold=0.5,
)

This will save a figure containing:

Original image
Patch-level anomaly overlays
Global attention overlays
A gauge-style visualization of the hybrid anomaly score

The examples below illustrate realistic manipulation scenarios where visual integrity is compromised through structural or semantic inconsistencies.

Example Notebooks and Colab

A complete set of Google Colab notebooks demonstrating VAAS v0.1.7 is available here:

👉 examples/notebooks/vaas_v017/

The notebooks cover:

Each notebook is inference-only and runnable without local setup.

If you would like to contribute a notebook, see CONTRIBUTING.md for guidelines.

Model Variants (Planned & Released)

Version	Training Data	Description	Reported Evaluation (Paper)	Hugging Face Model
v1	DF2023 (10%)	Initial public inference release	F1 / IoU reported on DF2023 & CASIA v2.0	vaas-v1-df2023
v2	DF2023 (≈50%)	Planned scale-up experiment	Planned	TBD
v3	DF2023 (100%)	Full-dataset training (planned)	Planned	TBD
v4	DF2023 + CASIA2.0	Cross-dataset study (planned)	Cross-dataset eval planned	TBD
v5	Other datasets	Exploratory generalisation study	TBD	TBD

These planned variants aim to study the effect of training scale, dataset diversity, and cross-dataset benchmarking on generalisation and score calibration.

Notes on Model Scope

VAAS models may be trained with emphasis on different classes of visual integrity violations (e.g. splicing, identity manipulation, text editing, structural deformation, or AI-generated artifacts).

These variants share the same inference API and scoring framework, but may differ in training data composition and calibration depending on the target integrity focus.

Reported Quantitative Performance

Quantitative detection and localisation metrics for VAAS are reported in the accompanying paper under a defined evaluation protocol.

Under the experimental setup described in the paper:

DF2023 (10% subset)
F1: 94.9%
IoU: 91.1%
CASIA v2.0
F1: 94.1%
IoU: 89.0%

These metrics are dataset- and protocol-specific and should be interpreted in conjunction with the methodology described in the paper.

Roadmap (Inference-Focused)

Batch inference and folder-level CLI
Richer visualisation modes
More efficient backbones
Expose rich image embeddings
Cross-dataset inferencing
Model compression
Extended anomaly-map visualisation
ONNX / TorchScript export
Use cases with Streamlit / Gradio

Contributing

We welcome contributions that improve the usability, robustness, and extensibility of VAAS.

Please see the full guidelines in CONTRIBUTING.md.

Citation

If you use VAAS in your research, please cite both the software and the associated paper as appropriate.

@software{vaas,
  title        = {VAAS: Vision-Attention Anomaly Scoring},
  author       = {Bamigbade, Opeyemi and Scanlon, Mark and Sheppard, John},
  year         = {2025},
  publisher    = {Zenodo},
  doi          = {10.5281/zenodo.18064355},
  url          = {https://doi.org/10.5281/zenodo.18064355}
}

@article{bamigbade2025vaas,
  title={VAAS: Vision-Attention Anomaly Scoring for Image Manipulation Detection in Digital Forensics},
  author={Bamigbade, Opeyemi and Scanlon, Mark and Sheppard, John},
  journal={arXiv preprint arXiv:2512.15512},
  year={2025}
}

License

MIT License.

Maintainers

OBA-Research
https://github.com/OBA-Research
https://huggingface.co/OBA-Research

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github		.github
docs		docs
examples		examples
scripts		scripts
tests		tests
vaas		vaas
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CITATION.cff		CITATION.cff
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VAAS: Vision-Attention Anomaly Scoring

What is VAAS?

Read Paper

Architecture

Installation

Usage

1. Basic inference on local and online images

Output Format

2. Inference with visual explanation

Example Notebooks and Colab

Model Variants (Planned & Released)

Notes on Model Scope

Reported Quantitative Performance

Roadmap (Inference-Focused)

Contributing

Citation

License

Maintainers

About

Uh oh!

Releases 2

Packages

Languages

License

OBA-Research/VAAS

Folders and files

Latest commit

History

Repository files navigation

VAAS: Vision-Attention Anomaly Scoring

What is VAAS?

Read Paper

Architecture

Installation

Usage

1. Basic inference on local and online images

Output Format

2. Inference with visual explanation

Example Notebooks and Colab

Model Variants (Planned & Released)

Notes on Model Scope

Reported Quantitative Performance

Roadmap (Inference-Focused)

Contributing

Citation

License

Maintainers

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages