Metadata for all 622 UCI datasets

TabularIntroduced 2024-10-23

This dataset contains the extraction made in 2022 of all the 622 datasets that existed then at the UCI Machine Learning Repository. It contains the index, its name, its url, the instances (number os lines), the number of attributes (columns), the year it was created, the area, such as Life, Social, etc., the web_hits at the time, the data folder url, where the data were in the internet, the dataset_file_url, the URL for the data, the dataset_file_format (format, such as data, txt, Z, etc), the names_file_url, which describe the files with the description of the attributes, the names_file_format which describe the format of the previous file, the attribute_info, which describe the information of all the attributes or columns that are in the dataset, the source, the data_set_information, the relevant_papers associated with this dataset, the papers_that_cite_this_data_set, and a final column with the number of papers that cite this dataset.