EgoTV Egocentric Task Verification from Natural Language Task Descriptions
Repo to reproduce results for Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings
Implementation for the CVPR 2023 paper "Improving Selective Visual Question Answering by Learning from Your Peers" (https://arxiv.org/abs/2306.08751)
DVSR ("Consistent Direct Time-of-Flight Video Depth Super-Resolution"), CVPR 2023
Pinpointing Why Object Recognition Performance Degrades Across Income Levels and Geographies
PyTorch code and models for the DINOv2 self-supervised learning method.
DynamicStereo: Consistent Dynamic Depth from Stereo Videos. CVPR 2023
MTM Masked Trajectory Models for Prediction, Representation, and Control.
SiLK (Simple Learned Keypoint) is a self-supervised deep learning keypoint model.
GliTr Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction
calculate color difference metric for binocular rivalry, delta E bino
The codes reproduce the figures and statistics in the paper, "Cumulative differences between paired samples." The repo also provides the LaTeX and BibTeX sources required for replicating the paper.
Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning
replication code for "Node Attribute Prediction on Multilayer Networks with Weighted and Directed Edges"
MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection