跳转至

计算机视觉

计算机视觉

目标检测

  • YOLO系列目标检测

视频动作识别(Video Action Recognition)

This is a collection of our video understanding work

X-CLIP (@ECCV'22 Oral): Expanding Language-Image Pretrained Models for General Video Recognition

人体姿态估计

综述论文

视觉语言模型

OpenVLA