Generate image embeddings from Croissant datasets using DenseNet and package results in RO-Crate format.
- Loads CM4AI immunofluorescence images from a Croissant dataset
- Generates 1024-dimensional embeddings using pre-trained DenseNet121
- Saves embeddings and metadata to TSV files
- Creates an RO-Crate package for reproducible research
pip install mlcroissant torch torchvision Pillow pandas numpy tqdm fairscape-cli- Place
cm4ai_if_images_croissant.jsonin the working directory - Run
jupyter notebook demo.ipynb - Execute cells sequentially
./densenet_embeddings/image_embeddings.tsv- 1024-dim embedding vectors./densenet_embeddings/ro-crate-metadata.json- Research object metadata