Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Collect, Measure, Repeat: Reliability Factors for Responsible AI Data Collection
by
Draws, Tim
, Aroyo, Lora
, Inel, Oana
in
Annotations
/ Data collection
/ Datasets
/ Iterative methods
/ Machine learning
/ Massive data points
/ Reliability aspects
/ Robustness
2023
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Collect, Measure, Repeat: Reliability Factors for Responsible AI Data Collection
by
Draws, Tim
, Aroyo, Lora
, Inel, Oana
in
Annotations
/ Data collection
/ Datasets
/ Iterative methods
/ Machine learning
/ Massive data points
/ Reliability aspects
/ Robustness
2023
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Collect, Measure, Repeat: Reliability Factors for Responsible AI Data Collection
Paper
Collect, Measure, Repeat: Reliability Factors for Responsible AI Data Collection
2023
Request Book From Autostore
and Choose the Collection Method
Overview
The rapid entry of machine learning approaches in our daily activities and high-stakes domains demands transparency and scrutiny of their fairness and reliability. To help gauge machine learning models' robustness, research typically focuses on the massive datasets used for their deployment, e.g., creating and maintaining documentation for understanding their origin, process of development, and ethical considerations. However, data collection for AI is still typically a one-off practice, and oftentimes datasets collected for a certain purpose or application are reused for a different problem. Additionally, dataset annotations may not be representative over time, contain ambiguous or erroneous annotations, or be unable to generalize across issues or domains. Recent research has shown these practices might lead to unfair, biased, or inaccurate outcomes. We argue that data collection for AI should be performed in a responsible manner where the quality of the data is thoroughly scrutinized and measured through a systematic set of appropriate metrics. In this paper, we propose a Responsible AI (RAI) methodology designed to guide the data collection with a set of metrics for an iterative in-depth analysis of the factors influencing the quality and reliability of the generated data. We propose a granular set of measurements to inform on the internal reliability of a dataset and its external stability over time. We validate our approach across nine existing datasets and annotation tasks and four content modalities. This approach impacts the assessment of data robustness used for AI applied in the real world, where diversity of users and content is eminent. Furthermore, it deals with fairness and accountability aspects in data collection by providing systematic and transparent quality analysis for data collections.
Publisher
Cornell University Library, arXiv.org
Subject
This website uses cookies to ensure you get the best experience on our website.