Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
MIRACL : A Multilingual Retrieval Dataset Covering 18 Diverse Languages
by
Rezagholizadeh, Mehdi
, Zhang, Xinyu
, Kamalloo, Ehsan
, Liu, Qun
, Lin, Jimmy
, Thakur, Nandan
, Li, Xiaoguang
, Ogundepo, Odunayo
, Alfonso-Hermelo, David
in
Annotations
/ Computational linguistics
/ Control data (computers)
/ Data quality
/ Datasets
/ Families & family life
/ Heuristic
/ Information sources
/ Internet
/ Language diversity
/ Languages
/ Multilingualism
/ Native speakers
/ Queries
/ Retrieval
/ Verification
2023
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
MIRACL : A Multilingual Retrieval Dataset Covering 18 Diverse Languages
by
Rezagholizadeh, Mehdi
, Zhang, Xinyu
, Kamalloo, Ehsan
, Liu, Qun
, Lin, Jimmy
, Thakur, Nandan
, Li, Xiaoguang
, Ogundepo, Odunayo
, Alfonso-Hermelo, David
in
Annotations
/ Computational linguistics
/ Control data (computers)
/ Data quality
/ Datasets
/ Families & family life
/ Heuristic
/ Information sources
/ Internet
/ Language diversity
/ Languages
/ Multilingualism
/ Native speakers
/ Queries
/ Retrieval
/ Verification
2023
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
MIRACL : A Multilingual Retrieval Dataset Covering 18 Diverse Languages
by
Rezagholizadeh, Mehdi
, Zhang, Xinyu
, Kamalloo, Ehsan
, Liu, Qun
, Lin, Jimmy
, Thakur, Nandan
, Li, Xiaoguang
, Ogundepo, Odunayo
, Alfonso-Hermelo, David
in
Annotations
/ Computational linguistics
/ Control data (computers)
/ Data quality
/ Datasets
/ Families & family life
/ Heuristic
/ Information sources
/ Internet
/ Language diversity
/ Languages
/ Multilingualism
/ Native speakers
/ Queries
/ Retrieval
/ Verification
2023
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
MIRACL : A Multilingual Retrieval Dataset Covering 18 Diverse Languages
Journal Article
MIRACL : A Multilingual Retrieval Dataset Covering 18 Diverse Languages
2023
Request Book From Autostore
and Choose the Collection Method
Overview
MIRACL is a multilingual dataset for
retrieval across 18 languages that collectively encompass over three billion native speakers around the world. This resource is designed to support monolingual retrieval tasks, where the queries and the corpora are in the same language. In total, we have gathered over 726k high-quality relevance judgments for 78k queries over Wikipedia in these languages, where all annotations have been performed by native speakers hired by our team. MIRACL covers languages that are both typologically close as well as distant from 10 language families and 13 sub-families, associated with varying amounts of publicly available resources. Extensive automatic heuristic verification and manual assessments were performed during the annotation process to control data quality. In total, MIRACL represents an investment of around five person-years of human annotator effort. Our goal is to spur research on improving retrieval across a continuum of languages, thus enhancing information access capabilities for diverse populations around the world, particularly those that have traditionally been underserved. MIRACL is available at
.
Publisher
MIT Press,MIT Press Journals, The,The MIT Press
MBRLCatalogueRelatedBooks
Related Items
Related Items
This website uses cookies to ensure you get the best experience on our website.