Datasets

395 machine learning datasets

395 dataset results

ACR Appropriateness Criteria Corpus

Dataset Card for the ACR Appropriateness Criteria Corpus This dataset contains chunked guidelines and narratives from the ACR Appropriateness Criteria, an set of societal guidelines from the American College of Radiology (ACR) to help clinicians order appropriate diagnostic imaging studies for patients. The corpus is formatted similarly to the corpuses introduced in MedRAG by Xiong et al. (2024), and can therefore be similarly used for medical Retrieval-Augmented Generation (RAG).

1 papers0 benchmarksMedical, Texts

MediConfusion

MediConfusion is a challenging medical Visual Question Answering (VQA) benchmark dataset, that probes the failure modes of medical Multimodal Large Language Models (MLLMs) from a vision perspective. We reveal that state-of-the-art models are easily confused by image pairs that are otherwise visually dissimilar and clearly distinct for medical experts. Our benchmark consists of 176 confusing pairs. A confusing pair is a set of two images that share the same question and corresponding answer options, but the correct answer is different for the images. We evaluate models based on their ability to answer both questions correctly within a confusing pair, which we call set accuracy. This metric indicates how well models can tell the two images apart, as a model that selects the same answer option for both images for all pairs will receive 0% set accuracy. We also report confusion, a metric that describes the proportion of confusing pairs where the model ha

1 papers0 benchmarksBiomedical, Images, Medical, Texts

SCARED-C (SCARED-Corrupted)

The dataset SCARED-C is introduced in the context of assessing robustness in endoscopic depth prediction models. It is part of the EndoDepth benchmark, which is designed to evaluate the performance of monocular depth prediction models specifically for endoscopic scenarios. The dataset features 16 different types of image corruptions, each with five levels of severity, encompassing challenges like lens distortion, resolution alterations, specular reflection, and color changes that are typical in endoscopic imaging. The ground truth is on the original testing set of SCARED.

1 papers2 benchmarksBiomedical, Images, Medical

RaTE-NER

RaTE-NER dataset is a large-scale, radiological named entity recognition (NER) dataset, including 13,235 manually annotated sentences from 1,816 reports within the MIMIC-IV database, that spans 9 imaging modalities and 23 anatomical regions, ensuring comprehensive coverage.

1 papers0 benchmarksMedical, Texts

Bengali Social Media Depressive Dataset (BSMDD)

Our dataset, BSMDD, was collected from various open social media platforms and translated and annotated by native Bengali speakers with expertise in both language and mental health. It contains 21,910 cleaned samples, including 10,961 labeled as Depressed and 10,949 as Non-Depressed. The dataset is publicly accessible, providing a valuable resource for further research in depression detection in Bengali social media content. The expert annotation process, conducted by professionals, ensures high validity, making BSMDD particularly important for advancing mental health research through social media analysis. This dataset is also published on Mendeley.

1 papers0 benchmarksMedical, Texts

PASSION dataset (PASSION derm 2024 dataset)

Overview PASSION derm is a pioneering initiative dedicated to closing the diversity gap in dermatology datasets. This project provides a unique dataset of skin condition images from Sub-Saharan Africa, with a focus on richly pigmented skin. The dataset is designed to emulate teledermatology settings and includes images of common pediatric skin conditions, such as eczema, fungal infections, scabies, and impetigo, in diverse quality and resolution. PASSION derm aims to improve access to dermatologic care in regions with limited healthcare resources.

1 papers0 benchmarksImages, Medical

41598_2022_22531_MOESM2_ESM.xlsx

The datasets used and analysed from the glucose clamp study are available in this Excel file. They include pseudonymised information on the participants, somatometric data, biomarkers of lipid metabolism and parameters of insulin-glucose homeostasis, i.e. concentrations of insulin, glucose and c-peptide as well as data from glucose-clamp experiments, HOMA, SPINA Carb parameters (SPINA-GBeta and SPINA-GR), Matsuda index, insulinogenic index, disposition index and McAuley index.

1 papers0 benchmarksBiomedical, Medical, Tabular, Time series

41598_2022_22531_MOESM1_ESM.dif

The datasets used and analysed from the glucose clamp study are available in this DIF file. They include pseudonymised information on the participants, somatometric data, biomarkers of lipid metabolism and parameters of insulin-glucose homeostasis, i.e. concentrations of insulin, glucose and c-peptide as well as data from glucose-clamp experiments, HOMA, SPINA Carb parameters (SPINA-GBeta and SPINA-GR), Matsuda index, insulinogenic index, disposition index and McAuley index.

1 papers0 benchmarksBiomedical, Medical, Tabular, Time series

U-10: United-10 COVID19 CT Dataset

This dataset supports the research detailed in the pre-print "Virtual Imaging Trials Improved the Transparency and Reliability of AI Systems in COVID-19 Imaging." The study employs both clinical and simulated CT data to evaluate AI models for COVID-19 diagnosis. By leveraging the Virtual Imaging Trials (VIT) framework, the research addresses reproducibility and generalizability issues prevalent in medical imaging AI models.

1 papers1 benchmarks3D, Images, Medical

BraTS PEDs 2023 (The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs))

Click to add a brief description of the dataset (Markdown and LaTeX enabled).

1 papers0 benchmarks3D, MRI, Medical

EGC-FPHFS (Early Gastric Cancer Data from First People's Hospital of Foshan)

High-resolution early gastric cancer (EGC) detection and analysis: Patient Data：Datasets often include images from patients diagnosed with gastric cancer, specifically distinguishing between early gastric cancer (EGC) and Non -pathogenic gastric cancer (NGC). The study utilized data from 341 patients, with 124 classified as EGC and 217 as NGC. Image Types: High-resolution images are typically obtained from endoscopy image. Data Volume: The size of datasets mentioned a dataset of 1120 images specifically for EGC detection and 2150 images for NGC.

1 papers1 benchmarksImages, Medical

Reddit Posts Related To Eating Disorders and Dieting (Topic Annotations on Reddit Posts from Eating Disorders and Dieting Forums by Human and LLMs)

This dataset comprises 77,175 Reddit posts from 115 subreddit forums, annotated for the presence of 15 topics related to eating disorders and dieting. The dataset includes labels and scores on all 77,175 Reddit posts, determined by 5 Large Language Models: GPT-4o, Llama-3.1-8B-Instruct, Qwen2.5-7B-Instruct, Mistral-7B-Instruct-v0.3, Vicuna-7b-v1.5, as well as by the ensemble of the four open-source LLMs. The dataset also includes a subset of 1,080 human-annotated posts for evaluation.

1 papers0 benchmarksMedical, Texts

RAOS (Rethinking Abdominal Organ Segmentation)

Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases.

1 papers0 benchmarks3D, Biomedical, Images, Medical

ENSeg

ENSeg Dataset Overview This dataset represents an enhanced subset of the ENS dataset. The ENS dataset comprises image samples extracted from the enteric nervous system (ENS) of male adult Wistar rats (Rattus norvegicus, albius variety), specifically from the jejunum, the second segment of the small intestine.

1 papers1 benchmarksBiology, Images, Medical

Liver-US (Liver Ultrasound Dataset for Medical Image Classification)

The Liver-US dataset is a comprehensive collection of high-quality ultrasound images of the liver, including both normal and abnormal cases. This dataset is designed to facilitate research in medical image classification, with a focus on liver-related conditions. It includes a diverse range of ultrasound images acquired from multiple clinical settings, providing a robust foundation for developing and validating machine learning models in medical image analysis. Detailed Dataset Description

1 papers1 benchmarksBiomedical, Images, Medical

PlainFact

PlainFact is a high-quality human-annotated dataset with fine-grained explanation (i.e., added information) annotations.

1 papers0 benchmarksMedical, Texts

LIRCAD (Inria Liver vessels subbranch anotomical nomenclature labels - "LIRCAD")

The structure for the dataset is as follows :

1 papers0 benchmarksImages, Medical

LLM Health Benchmarks (LLM Health Benchmarks - Yesil Science)

LLM Health Benchmarks Dataset The Health Benchmarks Dataset is a specialized resource for evaluating large language models (LLMs) in different medical specialties. It provides structured question-answer pairs designed to test the performance of AI models in understanding and generating domain-specific knowledge.

1 papers0 benchmarksMedical, Texts

LLaVA-Rad MIMIC-CXR Annotations

LLaVA-Rad MIMIC-CXR features more accurate section extractions from MIMIC-CXR free-text radiology reports. Traditionally, rule-based methods were used to extract sections such as the reason for exam, findings, and impression. However, these approaches often fail due to inconsistencies in report structure and clinical language. In this work, we leverage GPT-4 to extract these sections more reliably, adding 237,073 image-text pairs to the training split and 1,952 pairs to the validation split. This enhancement afforded the development and fine-tuning of LLaVA-Rad, a multimodal large language model (LLM) tailored for radiology applications, achieving improved performance on report generation tasks.

1 papers0 benchmarksImages, Medical, Texts

MERGE SPCS

This dataset contains pre-processed versions of datasets introduced in prior works. Additionally, it also contains new data that are pertinent to the paper.

1 papers0 benchmarksBiology, Biomedical, Images, Medical, Tables, Tabular

PreviousPage 18 of 20Next