Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Scientific reasoning driven by influential data: resuscitate dfstat
by
Schaks, Matthias
, Rödiger, Stefan
, Spiess, Andrej-Nikolai
, Burdukiewicz, Michał
, Tellinghuisen, Joel
in
Scientific Communication and Education
2024
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Scientific reasoning driven by influential data: resuscitate dfstat
by
Schaks, Matthias
, Rödiger, Stefan
, Spiess, Andrej-Nikolai
, Burdukiewicz, Michał
, Tellinghuisen, Joel
in
Scientific Communication and Education
2024
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Scientific reasoning driven by influential data: resuscitate dfstat
Paper
Scientific reasoning driven by influential data: resuscitate dfstat
2024
Request Book From Autostore
and Choose the Collection Method
Overview
In biomedical literature, one of the most widely employed statistical procedures to analyze and visualize the association between two variables is linear regression. Data points that exert influence on the fit and its parameters are routinely, but not as often as required, identified by established influence measures and their corresponding cut-off values. In this work, we are specifically concerned with the presence of influential data points that directly impact hypothesis testing of linear regressions, which none of the established measures describe. Interestingly, the highly overlooked influence measure dfstat and its derived leave-one-out p-value exists exactly for this purpose, unmentioned in the majority of statistical text books as well as absent from all available statistical software packages. Its application for identifying these data points seems pivotal, as scientific reasoning in publications is almost exclusively based on the p-value of the fit, commonly adhering to the α = 0.05 threshold to state significance or not. With this metric, we found for 29 of 100 digitizable papers published in Science, Nature and PNAS in 2016, a time when the “reproducibility crisis” was a growing concern, that stated significances (or their absence) are based on the presence of a single influential data point.
Publisher
Cold Spring Harbor Laboratory
This website uses cookies to ensure you get the best experience on our website.