Amenity Detection using Zero-Shot Classification with CLIP

Detection of amenities in property images using pre-trained CLIP and ALIGN models.

A simple image-text retrieval system experimenting with pre-trained CLIP and ALIGN models. Given an image of a room and a set of complementary labels (e.g. "there is a microwave", "there is no microwave") the model predicts whethere there is an amenity in the image or not. This information can be useful to cross-check it with the property description and detect possible discrepancies.

There are many other approaches and models that can be used for this task (e.g. object detection using YOLO, etc.). The goal of this project is to experiment with multi-modal models CLIP and ALIGN and see how they perform in this task.

Data

Using a sample of Kaggle Room street dataset for testing the model. The dataset contains images of rooms and houses.

App

The app is a simple streamlit app that allows the user to upload an image and verify a set of amenities to detect. The model will predict whether the amenities are present in the image or not.

Installation

Clone the repository
Install the package
```
pip install poetry
poetry install
```
Run the app
```
streamlit run src/app/app.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
src		src
tests		tests
.DS_Store		.DS_Store
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
app_snapshot.png		app_snapshot.png
config.yaml		config.yaml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amenity Detection using Zero-Shot Classification with CLIP

Data

App

Installation

References

About

Releases

Packages

Languages

kvankova/image-text-retrieval

Folders and files

Latest commit

History

Repository files navigation

Amenity Detection using Zero-Shot Classification with CLIP

Data

App

Installation

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages