Responsible Data Science Lab at Purdue

We study problems at the intersection of data management and machine learning to build trustworthy and responsible decision-making systems. Our aim is to develop systems that enable explainability, fairness, and accountability of data-driven decision-making systems. We are particularly interested in:

Explaining and debugging fairness violations in machine learning models and data science pipelines:
- How can we determine sources of unexpected errors and bias in machine learning model outcomes?
- How can we decompose unexpected or discriminatory behavior of data science pipelines in terms of the different pipeline stages?
- Can we effectively generate post hoc explanations for the outcomes of machine learning models?
Data integration and data quality:
- How can we leverage expert feedback to improve data cleaning techniques for machine learning?
- Can we use the final outcomes in data science pipelines to inform intermediate pipeline choices?
- How can we intertwine pipeline stages with downstream analytics to improve upon the end goals?

We are always looking for motivated Ph.D. students to collaborate with. If you are interested in data management and/or responsible data analytics, feel free to contact us with your CV/resume and a couple of sentences describing your research interests, and consider applying to Purdue CIT!

Sponsors We are thankful for the generous funding award and gift from our sponsors: NSF, Google, and CASMI.

news

Jun 20, 2025	Our paper on Label Flipping for Group Fairness got accepted to the 14th International Workshop on Quality in Databases (QDB) at the 51st VLDB conference.
May 7, 2025	Our paper on Explanations for Machine Learning Pipelines under Data Drift got accepted to the 2025 ACM International Conference on Management of Data (SIGMOD).
Mar 3, 2025	Dr. Pradhan gave a talk at the Cornell CS Database Seminar.
Feb 5, 2025	Our paper on Explaining Fairness Violations using Machine Unlearning accepted to the 28th International Conference on Extending Database Technology (EDBT).
Nov 13, 2024	Kevin defends his M.S. thesis. Congrats, Kevin!

selected publications

Explainable AI: Foundations, Applications, Opportunities for Data Management Research

Romila Pradhan, Aditya Lahiri, Sainyam Galhotra, and Babak Salimi

In Proceedings of the 2022 International Conference on Management of Data, 2022

Abs PDF Website

Algorithmic decision-making systems are successfully being adopted in a wide range of domains for diverse tasks. While the potential benefits of algorithmic decision-making are many, the importance of trusting these systems has only recently attracted attention. There is growing concern that these systems are complex, opaque and non-intuitive, and hence are difficult to trust. There has been a recent resurgence of interest in explainable artificial intelligence (XAI) that aims to reduce the opacity of a model by explaining its behavior, its predictions or both, thus allowing humans to scrutinize and trust the model. A host of technical advances have been made and several explanation methods have been proposed in recent years that address the problem of model explainability and transparency. In this tutorial, we will present these novel explanation approaches, characterize their strengths and limitations, position existing work with respect to the database (DB) community, and enumerate opportunities for data management research in the context of XAI.
Interpretable Data-Based Explanations for Fairness Debugging

Romila Pradhan, Jiongli Zhu, Boris Glavic, and Babak Salimi

In Proceedings of the 2021 International Conference on Management of Data, 2022

Abs PDF Website

A wide variety of fairness metrics and eXplainable Artificial Intelligence (XAI) approaches have been proposed in the literature to identify bias in machine learning models that are used in critical real-life contexts. However, merely reporting on a model’s bias or generating explanations using existing XAI techniques is insufficient to locate and eventually mitigate sources of bias. We introduce Gopher, a system that produces compact, interpretable, and causal explanations for bias or unexpected model behavior by identifying coherent subsets of the training data that are root-causes for this behavior. Specifically, we introduce the concept ofcausal responsibility that quantifies the extent to which intervening on training data by removing or updating subsets of it can resolve the bias. Building on this concept, we develop an efficient approach for generating the top-𝑘 patterns that explain model bias by utilizing techniques from the machine learning (ML) community to approximate causal responsibility, and using pruning rules to manage the large search space for patterns. Our experimental evaluation demonstrates the effectiveness of Gopher in generating interpretable explanations for identifying and debugging sources of bias.
Explaining Black-Box Algorithms using Probabilistic Contrastive Counterfactuals

Sainyam Galhotra, Romila Pradhan, and Babak Salimi

In Proceedings of the 2021 International Conference on Management of Data, 2021

Abs PDF Website

There has been a recent resurgence of interest in explainable arti!cial intelligence (XAI) that aims to reduce the opaqueness of AI-based decision-making systems, allowing humans to scrutinize and trust them. Prior work in this context has focused on the attribution of responsibility for an algorithm’s decisions to its inputs wherein responsibility is typically approached as a purely associational concept. In this paper, we propose a principled causalitybased approach for explaining black-box decision-making systems that addresses limitations of existing methods in XAI. At the core of our framework lies probabilistic contrastive counterfactuals, a concept that can be traced back to philosophical, cognitive, and social foundations of theories on how humans generate and select explanations. We show how such counterfactuals can quantify the direct and indirect in!uences of a variable on decisions made by an algorithm, and provide actionable recourse for individuals negatively affected by the algorithm’s decision. Unlike prior work, our system, Lewis: (1) can compute provably effective explanations and recourse at local, global and contextual levels; (2) is designed to work with users with varying levels of background knowledge of the underlying causal model; and (3) makes no assumptions about the internals of an algorithmic system except for the availability of its input-output data. We empirically evaluate Lewis on four realworld datasets and show that it generates human-understandable explanations that improve upon state-of-the-art approaches in XAI, including the popular LIME and SHAP. Experiments on synthetic data further demonstrate the correctness of Lewis’s explanations and the scalability of its recourse algorithm.
Staging User Feedback toward Rapid Conflict Resolution in Data Fusion

Romila Pradhan, Siarhei Bykau, and Sunil Prabhakar

In Proceedings of the 2017 ACM International Conference on Management of Data, 2017

Abs PDF

In domains such as the Web, sensor networks and social media, sources often provide conflicting information for the same data item. Several data fusion techniques have been proposed recently to resolve conflicts and identify correct data. The performance of these fusion systems, while quite accurate, is far from perfect. In this paper, we propose to leverage user feedback for validating data conflicts and rapidly improving the performance of fusion. To present the most beneficial data items for the user to validate, we take advantage of the level of consensus among sources, and the output of fusion to generate an effective ordering of items. We first evaluate data items individually, and then define a novel decision-theoretic framework based on the concept of value of perfect information (VPI) to order items by their ability to boost the performance of fusion. We further derive approximate formulae to scale up the decision-theoretic framework to large-scale data. We empirically evaluate our algorithms on three real-world datasets with different characteristics, and show that the accuracy of fusion can be significantly improved even while requesting feedback on a few data items. We also show that the performance of the proposed methods depends on the characteristics of data, and assess the trade-off between the amount of feedback acquired, and the effectiveness and efficiency of the methods.