Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/VideoInstruct

VideoInstruct

Video Instruction Dataset

TextsVideosCreative Commons Attribution 4.0Introduced 2023-06-08

Video Instruction Dataset is used to train Video-ChatGPT. It consists of 100,000 high-quality video instruction pairs. employs a combination of human-assisted and semi-automatic annotation techniques, aiming to produce high-quality video instruction data. These methods create question-answer pairs related to

Video summarization
Description-based question-answers (exploring spatial, temporal, relationships, and reasoning concepts)
Creative/generative question-answers

Benchmarks

Generative Visual Question Answering/mean Generative Visual Question Answering/Correctness of Information Generative Visual Question Answering/Detail Orientation Generative Visual Question Answering/Contextual Understanding Generative Visual Question Answering/Temporal Understanding Generative Visual Question Answering/Consistency Generative Visual Question Answering/gpt-score VCGBench-Diverse/mean VCGBench-Diverse/Correctness of Information VCGBench-Diverse/Detail Orientation VCGBench-Diverse/Contextual Understanding VCGBench-Diverse/Temporal Understanding VCGBench-Diverse/Consistency VCGBench-Diverse/Dense Captioning VCGBench-Diverse/Spatial Understanding VCGBench-Diverse/Reasoning Video-based Generative Performance Benchmarking/mean Video-based Generative Performance Benchmarking/Correctness of Information Video-based Generative Performance Benchmarking/Detail Orientation Video-based Generative Performance Benchmarking/Contextual Understanding Video-based Generative Performance Benchmarking/Temporal Understanding Video-based Generative Performance Benchmarking/Consistency Video-based Generative Performance Benchmarking/gpt-score Video-based Generative Performance Benchmarking (Correctness of Information)/gpt-score Visual Question Answering (VQA)/mean Visual Question Answering (VQA)/Correctness of Information Visual Question Answering (VQA)/Detail Orientation Visual Question Answering (VQA)/Contextual Understanding Visual Question Answering (VQA)/Temporal Understanding Visual Question Answering (VQA)/Consistency Visual Question Answering (VQA)/gpt-score

Statistics

Papers: 30
Benchmarks: 31

Links

Tasks

Generative Visual Question Answering VCGBench-Diverse Video Question Answering Video-based Generative Performance Benchmarking Video-based Generative Performance Benchmarking (Consistency)Video-based Generative Performance Benchmarking (Contextual Understanding)Video-based Generative Performance Benchmarking (Correctness of Information)Video-based Generative Performance Benchmarking (Detail Orientation))Video-based Generative Performance Benchmarking (Temporal Understanding)Visual Question Answering (VQA)