计算机视觉 计算机视觉 目标检测 YOLO系列目标检测 视频动作识别(Video Action Recognition) This is a collection of our video understanding work X-CLIP (@ECCV'22 Oral): Expanding Language-Image Pretrained Models for General Video Recognition 人体姿态估计 综述论文 视觉语言模型 OpenVLA