diff --git a/README.md b/README.md index c6a026c..652110e 100644 --- a/README.md +++ b/README.md @@ -449,7 +449,11 @@ The goal of vision-language instruction tuning is to train a model that can effe - InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning [[paper]](https://arxiv.org/abs/2305.06500) [[code]](https://github.com/salesforce/LAVIS/tree/main/projects/instructblip) - `arXiv 2023/05` + `arXiv 2023/05` + +- BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions [[paper]](https://arxiv.org/abs/2308.09936) [[code]](https://github.com/mlpc-ucsd/BLIVA) + + `arXiv 2023/08`