Skip to content
Change the repository type filter

All

    Repositories list

    • sub

      Public
      0200Updated Jul 17, 2025Jul 17, 2025
    • TiViT

      Public
      Time Vision Transformer
      Python
      0600Updated Jul 2, 2025Jul 2, 2025
    • flair

      Public
      [CVPR 2025] FLAIR: VLM with Fine-grained Language-informed Image Representations
      Python
      48730Updated Jun 25, 2025Jun 25, 2025
    • [ICML 2025 Workshop MUGen] Align-then-Unlearn: Embedding Alignment for LLM Unlearning
      Python
      0100Updated Jun 17, 2025Jun 17, 2025
    • [ICLR 2025] Revealing and Reducing Gender Biases in Vision and Language Assistants (VLAs)
      Python
      0440Updated Jun 5, 2025Jun 5, 2025
    • DeLoRA

      Public
      [ICLR25] Official Implementation of "Decoupling Angles and Strength in Low-rank Adaptation"
      Python
      0920Updated May 3, 2025May 3, 2025
    • Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models
      Python
      22410Updated Apr 15, 2025Apr 15, 2025
    • Project Page for FLAIR [CVPR 2025]
      JavaScript
      0000Updated Apr 11, 2025Apr 11, 2025
    • EgoCVR

      Public
      [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval
      Python
      03830Updated Apr 11, 2025Apr 11, 2025
    • LoFT

      Public
      Synthetic Dataset Generation with Few-shot Guidance: LoFT [Arxiv] & DataDream [ECCV24]
      Python
      0420Updated Mar 31, 2025Mar 31, 2025
    • cosmos

      Public
      [CVPR 2025] COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training
      Python
      22620Updated Mar 27, 2025Mar 27, 2025
    • This is the official repository for the WikiBigEdit benchmark, introduced in the paper Understanding the Limits of Lifelong Knowledge Editing in LLMs.
      Python
      0600Updated Mar 13, 2025Mar 13, 2025
    • ReNO

      Public
      [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization
      Python
      1214350Updated Jan 27, 2025Jan 27, 2025
    • Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
      Python
      35700Updated Dec 10, 2024Dec 10, 2024
    • [NeurIPS 2023 Spotlight] In-Context Impersonation Reveals Large Language Models' Strengths and Biases
      Python
      12220Updated Nov 30, 2024Nov 30, 2024
    • ZerAuCap

      Public
      [NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords
      Python
      11700Updated Nov 30, 2024Nov 30, 2024
    • [ICLR 2024] Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model
      Python
      0100Updated Oct 31, 2024Oct 31, 2024
    • Python
      01700Updated Oct 5, 2024Oct 5, 2024
    • Official PyTorch implementation of "Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models" (ECCV 2024)
      Python
      4800Updated Aug 12, 2024Aug 12, 2024
    • DataDream

      Public
      [ECCV 2024] Official repository for "DataDream: Few-shot Guided Dataset Generation"
      Python
      64150Updated Jul 24, 2024Jul 24, 2024
    • This repository contains the code for our DAGM GCPR 2023 paper "Text-to-feature diffusion for audio-visual few-shot learning"
      Python
      1800Updated Jul 23, 2024Jul 23, 2024
    • [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"
      Python
      56910Updated Jul 4, 2024Jul 4, 2024
    • uot-fm

      Public
      Official Repository for "Unbalancedness in Neural Monge Maps Improves Unpaired Domain Translation" [ICLR 2024]
      Python
      51510Updated May 15, 2024May 15, 2024
    • Official repository for "Fantastic Gains and Where to Find Them: On the Existence and Prospect of General Knowledge Transfer between Any Pretrained Model" [ICLR 2024 spotlight]
      0700Updated Feb 20, 2024Feb 20, 2024
    • ECCV 2022: Abstracting Sketches through Simple Primitives
      Python
      52620Updated Jan 19, 2024Jan 19, 2024
    • ProbVLM

      Public
      ProbVLM: Probabilistic Adapter for Frozen Vision-Language Models
      Python
      64310Updated Dec 21, 2023Dec 21, 2023
    • Code for the paper "Addressing caveats of neural persistence with deep graph persistence".
      Python
      1400Updated Nov 29, 2023Nov 29, 2023
    • ReGaDa

      Public
      BMVC 2023: Video-adverb retrieval with compositional adverb-action embeddings
      Python
      1610Updated Nov 17, 2023Nov 17, 2023
    • CLEVR-X

      Public
      CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
      Python
      32800Updated Oct 27, 2023Oct 27, 2023
    • DeViL

      Public
      GCPR 2023 - DeViL: Decoding Vision features into Language
      Python
      11200Updated Oct 16, 2023Oct 16, 2023