vqa-nle-llava

ImagesTextscc-by-4.0Introduced 2024-09-23

VQA NLE synthetic dataset, made with LLaVA-1.5 using features from GQA dataset. Total number of unique datas: 66684

Languages

eng

Supported Tasks

Question Answering

Dataset Usage

from datasets import load_dataset
dset = datasets.load_dataset("patrickamadeus/vqa-nle-llava", name='<CONFIG_NAME>', trust_remote_code=True)

Dataset Version

Source: 1.0.1. Date: 2024.09.25.

Dataset License

CC-BY 4.0

Citation

If you are using the VQA NLE LLaVA dataloader in your work, please cite the following:

@misc{irawan2024efficientrobustvqanledata,
      title={Towards Efficient and Robust VQA-NLE Data Generation with Large Vision-Language Models}, 
      author={Patrick Amadeus Irawan and Genta Indra Winata and Samuel Cahyawijaya and Ayu Purwarianti},
      year={2024},
      eprint={2409.14785},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2409.14785}, 
}