Augmenting and Mixing Transformers with Synthetic Data for Image Captioning Create the environment conda create -y -n "synthcap" python=3.10 conda activate synthcap pip install -r requirements.txt Training TBD Evaluation TBD