Change the repository type filter
All
Repositories list
97 repositories
DitHub
Publicfed-mammoth
Public- [CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
- Democratising RGBA Image Generation With No $$$ (AI4VA@ECCV24)
ReT-2
PublicRecurrence Meets Transformers for Universal Multimodal Retrievalcoldfront
PublicCHAIR-DPO
PublicMLLMs-FlowTracker
PublicMissRAG
Publicmammoth
PublicAn Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual LearningLLaVA-MORE
Public[ICCVW 25] LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction TuningScanDiff
Public- Official PyTorch implementation for "Zero-Shot Styled Text Image Generation, but Make It Autoregressive" (CVPR25)
TransFusion
PublicOfficial codebase of "Update Your Transformer to the Latest Release: Re-Basin of Task Vectors" - ICML 2025pacscore
Public[CVPR 2023 & IJCV 2025] Positive-Augmented Contrastive Learning for Image and Video Captioning EvaluationReflectiVA
Public[CVPR 2025] Augmenting Multimodal LLMs with Self-Reflective Tokens for Knowledge-based Visual Question Answering- [TPDL 2025] Generating Synthetic Data with Large Language Models for Low-Resource Sentence Retrieval
MAD
PublicOfficial PyTorch implementation for "Merging and Splitting Diffusion Paths for Semantically Coherent Panoramas", presenting the Merge-Attend-Diffuse operator (ECCV24)DICE
PublicCoDE
Public[ECCV'24] Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local SimilaritiesMaPeT
Publicsynthcap_pp
Publicmammoth-lite
PublicSanctuaria-Gaze
PublicSanctuaria-Gaze is a multimodal dataset of egocentric recordings from visits to four sanctuaries in Northern Italy. Alongside the data, we release an open-source framework for automatic detection and analysis of Areas of Interest (AOIs), enabling gaze-based research in dynamic, real-world settings without manual annotation.FourBi
PublicBinarizing Documents by Leveraging both Space and Frequency. (ICDAR 2024)COGT
PublicHySAC
Public- ITSERR WP8 - Code for Latin embeddings semantic search