IndianBailJudgments-1200: A Multi-Attribute Dataset for Legal NLP on Indian Bail Orders

Sneha Deshmukh, Prathmesh Kamble

2025-07-03Fairness Attribute Jurisprudence

Abstract

Legal NLP remains underdeveloped in regions like India due to the scarcity of structured datasets. We introduce IndianBailJudgments-1200, a new benchmark dataset comprising 1200 Indian court judgments on bail decisions, annotated across 20+ attributes including bail outcome, IPC sections, crime type, and legal reasoning. Annotations were generated using a prompt-engineered GPT-4o pipeline and verified for consistency. This resource supports a wide range of legal NLP tasks such as outcome prediction, summarization, and fairness analysis, and is the first publicly available dataset focused specifically on Indian bail jurisprudence.

Related Papers

A Reproducibility Study of Product-side Fairness in Bundle Recommendation2025-07-18 FedGA: A Fair Federated Learning Framework Based on the Gini Coefficient2025-07-17 Looking for Fairness in Recommender Systems2025-07-16 FADE: Adversarial Concept Erasure in Flow Models2025-07-16 MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM2025-07-16 Non-Adaptive Adversarial Face Generation2025-07-16 Fairness-Aware Grouping for Continuous Sensitive Variables: Application for Debiasing Face Analysis with respect to Skin Tone2025-07-15 Guiding LLM Decision-Making with Fairness Reward Models2025-07-15