Catalogue Search | MBRL

Statistical significance or clinical significance? A researcher's dilemma for appropriate interpretation of research results

by Sharma, Hunny in bias; biostatistics; clinical significance; research design; sample size; statistical significance , Clinical significance , Hypotheses

2021

It is incredibly essential that the current clinicians and researchers remain updated with findings of current biomedical literature for evidence-based medicine. However, they come across many types of research that are nonreproducible and are even difficult to interpret clinically. Statistical and clinical significance is one such difficulty that clinicians and researchers face across many instances. In simpler terms, the P value tests all hypothesis about how the data were produced (model as whole), and not just the targeted hypothesis that it is intended to test (such as a null hypothesis) keeping in mind how reliable are the of the research results. Most of the times it is misinterpreted and misunderstood as a measure to judge the results as clinically significant. Hence this review aims to impart knowledge about \"P\" value and its importance in biostatistics, also highlights the importance of difference between statistical and clinical significance for appropriate interpretation of research results.

Journal Article

Share this book

Add to My Shelf

The origin of Modern Epidemiology, the book

by Rothman, Kenneth J. in Cardiology , Epidemiology , ESSAY

2021

I am sitting at my desk looking at a print copy of the fourth edition of Modern Epidemiology (1), wondering how to respond to the kind invitation from Albert Hofman. He asked me to describe for the EJE how I came to write the first edition, and how the book evolved from there. To me the origin was mundane.

Journal Article

Share this book

Add to My Shelf

S1222 ERCP in Octogenarian and Nonagenarian Population is Safe and Effective

by Badillo, Ricardo , Mahapatra, Swati , Taclob, Jeff Angelo in Statistical significance

2023

Journal Article

Share this book

Add to My Shelf

Is Deep Learning on Tabular Data Enough? An Assessment

by Fayaz, Sheikh Amir , Zaman, Majid , Butt, Muheet Ahmed in Algorithms , Classification , Datasets

2022

It is critical to select the model that best fits the situation while analyzing the data. Many scholars on classification and regression issues have offered ensemble techniques on tabular data, as well as other approaches to classification and regression problems (Like Boosting and Logistic Model tree ensembles). Furthermore, various deep learning algorithms have recently been implemented on tabular data, with the authors claiming that deep models outperform Boosting and Model tree approaches. On a range of datasets including historical geographical data, this study compares the new deep models (TabNet, NODE, and DNF-net) against the boosting model (XGBoost) to see if they should be regarded a preferred choice for tabular data. We look at how much tweaking and computation they require, as well as how well they perform based on the metrics evaluation and statistical significance test. According to our study, XGBoost outperforms these deep models across all datasets, including the datasets used in the journals that presented the deep models. We further show that, when compared to deep models, XGBoost requires considerably less tweaking. In addition, we can also confirm that a combination of deep models with XGBoost outperforms XGBoost alone on almost all datasets.

Journal Article

Share this book

Add to My Shelf

Statistical significance and its critics

by Hand, David , Mayo, Deborah G. in Applications , Education , Epistemology

2022

While the common procedure of statistical significance testing and its accompanying concept of p-values have long been surrounded by controversy, renewed concern has been triggered by the replication crisis in science. Many blame statistical significance tests themselves, and some regard them as sufficiently damaging to scientific practice as to warrant being abandoned. We take a contrary position, arguing that the central criticisms arise from misunderstanding and misusing the statistical tools, and that in fact the purported remedies themselves risk damaging science. We argue that banning the use of p-value thresholds in interpreting data does not diminish but rather exacerbates data-dredging and biasing selection effects. If an account cannot specify outcomes that will not be allowed to count as evidence for a claim—if all thresholds are abandoned—then there is no test of that claim. The contributions of this paper are: To explain the rival statistical philosophies underlying the ongoing controversy; To elucidate and reinterpret statistical significance tests, and explain how this reinterpretation ameliorates common misuses and misinterpretations; To argue why recent recommendations to replace, abandon, or retire statistical significance undermine a central function of statistics in science: to test whether observed patterns in the data are genuine or due to background variability.

Journal Article

Share this book

Add to My Shelf

Putting the P-Value in its Place

by Heck, Patrick R. , Krueger, Joachim I. in Bayes' theorem , Data analysis , Inference

2019

As the debate over best statistical practices continues in academic journals, conferences, and the blogosphere, working researchers (e.g., psychologists) need to figure out how much time and effort to invest in attending to experts' arguments, how to design their next project, and how to craft a sustainable long-term strategy for data analysis and inference. The present special issue of The American Statistician promises help. In this article, we offer a modest proposal for a continued and informed use of the conventional p-value without the pitfalls of statistical rituals. Other statistical indices should complement reporting, and extra-statistical (e.g., theoretical) judgments ought to be made with care and clarity.

Journal Article

Share this book

Add to My Shelf

Tree-Based Classifier Ensembles for PE Malware Analysis: A Performance Revisit

by Tama, Bayu Adhi , Louk, Maya Hilda Lestari in Algorithms , Classification , Classifiers

2022

Given their escalating number and variety, combating malware is becoming increasingly strenuous. Machine learning techniques are often used in the literature to automatically discover the models and patterns behind such challenges and create solutions that can maintain the rapid pace at which malware evolves. This article compares various tree-based ensemble learning methods that have been proposed in the analysis of PE malware. A tree-based ensemble is an unconventional learning paradigm that constructs and combines a collection of base learners (e.g., decision trees), as opposed to the conventional learning paradigm, which aims to construct individual learners from training data. Several tree-based ensemble techniques, such as random forest, XGBoost, CatBoost, GBM, and LightGBM, are taken into consideration and are appraised using different performance measures, such as accuracy, MCC, precision, recall, AUC, and F1. In addition, the experiment includes many public datasets, such as BODMAS, Kaggle, and CIC-MalMem-2022, to demonstrate the generalizability of the classifiers in a variety of contexts. Based on the test findings, all tree-based ensembles performed well, and performance differences between algorithms are not statistically significant, particularly when their respective hyperparameters are appropriately configured. The proposed tree-based ensemble techniques also outperformed other, similar PE malware detectors that have been published in recent years.

Journal Article

Share this book

Add to My Shelf

Statistical Significance, 𝑝-Values, and the Reporting of Uncertainty

by Imbens, Guido W. in Statistical Significance

2021

The use of statistical significance and 𝑝-values has become a matter of substantial controversy in various fields using statistical methods. This has gone as far as some journals banning the use of indicators for statistical significance, or even any reports of 𝑝-values, and, in one case, any mention of confidence intervals. I discuss three of the issues that have led to these often-heated debates. First, I argue that in many cases, 𝑝-values and indicators of statistical significance do not answer the questions of primary interest. Such questions typically involve making (recommendations on) decisions under uncertainty. In that case, point estimates and measures of uncertainty in the form of confidence intervals or even better, Bayesian intervals, are often more informative summary statistics. In fact, in that case, the presence or absence of statistical significance is essentially irrelevant, and including them in the discussion may confuse the matter at hand. Second, I argue that there are also cases where testing null hypotheses is a natural goal and where 𝑝-values are reasonable and appropriate summary statistics. I conclude that banning them in general is counterproductive. Third, I discuss that the overemphasis in empirical work on statistical significance has led to abuse of 𝑝-values in the form of 𝑝-hacking and publication bias. The use of pre-analysis plans and replication studies, in combination with lowering the emphasis on statistical significance may help address these problems.

Journal Article

Share this book

Add to My Shelf

A study of the characteristics of white noise using the empirical mode decomposition method

by Wu, Zhaohua , Huang, Norden E. in Characteristics Of White Noise , Datasets , Degrees of freedom

2004

Based on numerical experiments on white noise using the empirical mode decomposition (EMD) method, we find empirically that the EMD is effectively a dyadic filter, the intrinsic mode function (IMF) components are all normally distributed, and the Fourier spectra of the IMF components are all identical and cover the same area on a semi-logarithmic period scale. Expanding from these empirical findings, we further deduce that the product of the energy density of IMF and its corresponding averaged period is a constant, and that the energy-density function is chi-squared distributed. Furthermore, we derive the energy-density spread function of the IMF components. Through these results, we establish a method of assigning statistical significance of information content for IMF components from any noisy data. Southern Oscillation Index data are used to illustrate the methodology developed here.

Journal Article

Share this book

Add to My Shelf

Your Chi-Square Test Is Statistically Significant: Now What?

by Sharpe, Donald in Comparative Analysis , Computation , Statistical Analysis

2015

Applied researchers have employed chi-square tests for more than one hundred years. This paper addresses the question of how one should follow a statistically significant chi-square test result in order to determine the source of that result. Four approaches were evaluated: calculating residuals, comparing cells, ransacking, and partitioning. Data from two recent journal articles were used to illustrate these approaches. A call is made for greater consideration of foundational techniques such as the chi-square tests.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter