VideoNet: A Large-Scale Dataset for Domain-Specific Action Recognition
VideoNet is a large-scale domain-specific action recognition benchmark and training dataset with 1,000 distinct actions across 37 domains, designed to revitalize action recognition evaluation for modern vision-language models.
Jun 2, 2026