Catalogue Search | MBRL

Eye tracking: empirical foundations for a minimal reporting guideline

by Ettinger, Ulrich , Benjamins, Jeroen S , Zemblys, Raimondas in Empirical Research , Eye Movements , Eye-Tracking Technology

2023

In this paper, we present a review of how the various aspects of any study using an eye tracker (such as the instrument, methodology, environment, participant, etc.) affect the quality of the recorded eye-tracking data and the obtained eye-movement and gaze measures. We take this review to represent the empirical foundation for reporting guidelines of any study involving an eye tracker. We compare this empirical foundation to five existing reporting guidelines and to a database of 207 published eye-tracking studies. We find that reporting guidelines vary substantially and do not match with actual reporting practices. We end by deriving a minimal, flexible reporting guideline based on empirical research (Section \"An empirically based minimal reporting guideline\").

Journal Article

Share this book

Add to My Shelf

How to conduct systematic literature reviews in management research: a guide in 6 steps and 14 decisions

by Sauer, Philipp C , Seuring, Stefan in Literature reviews

2023

Systematic literature reviews (SLRs) have become a standard tool in many fields of management research but are often considerably less stringently presented than other pieces of research. The resulting lack of replicability of the research and conclusions has spurred a vital debate on the SLR process, but related guidance is scattered across a number of core references and is overly centered on the design and conduct of the SLR, while failing to guide researchers in crafting and presenting their findings in an impactful way. This paper offers an integrative review of the widely applied and most recent SLR guidelines in the management domain. The paper adopts a well-established six-step SLR process and refines it by sub-dividing the steps into 14 distinct decisions: (1) from the research question, via (2) characteristics of the primary studies, (3) to retrieving a sample of relevant literature, which is then (4) selected and (5) synthesized so that, finally (6), the results can be reported. Guided by these steps and decisions, prior SLR guidelines are critically reviewed, gaps are identified, and a synthesis is offered. This synthesis elaborates mainly on the gaps while pointing the reader toward the available guidelines. The paper thereby avoids reproducing existing guidance but critically enriches it. The 6 steps and 14 decisions provide methodological, theoretical, and practical guidelines along the SLR process, exemplifying them via best-practice examples and revealing their temporal sequence and main interrelations. The paper guides researchers in the process of designing, executing, and publishing a theory-based and impact-oriented SLR.

Journal Article

Share this book

Add to My Shelf

Replication across space and time must be weak in the social and environmental sciences

by Goodchild, Michael F. , Li, Wenwen in Artificial intelligence , Biological Sciences , Earth surface

2021

Replicability takes on special meaning when researching phenomena that are embedded in space and time, including phenomena distributed on the surface and near surface of the Earth. Two principles, spatial dependence and spatial heterogeneity, are generally characteristic of such phenomena. Various practices have evolved in dealing with spatial heterogeneity, including the use of place-based models. We review the rapidly emerging applications of artificial intelligence to phenomena distributed in space and time and speculate on how the principle of spatial heterogeneity might be addressed. We introduce a concept of weak replicability and discuss possible approaches to its measurement.

Journal Article

Share this book

Add to My Shelf

Lack of group-to-individual generalizability is a threat to human subjects research

by Medaglia, John D. , Fisher, Aaron J. , Jeronimus, Bertus F. in Anxiety Disorders - therapy , Biological Sciences , Bipolar Disorder - therapy

2018

Only for ergodic processes will inferences based on group-level data generalize to individual experience or behavior. Because human social and psychological processes typically have an individually variable and time-varying nature, they are unlikely to be ergodic. In this paper, six studies with a repeated-measure design were used for symmetric comparisons of interindividual and intraindividual variation. Our results delineate the potential scope and impact of nonergodic data in human subjects research. Analyses across six samples (with 87–94 participants and an equal number of assessments per participant) showed some degree of agreement in central tendency estimates (mean) between groups and individuals across constructs and data collection paradigms. However, the variance around the expected value was two to four times larger within individuals than within groups. This suggests that literatures in social and medical sciences may overestimate the accuracy of aggregated statistical estimates. This observation could have serious consequences for how we understand the consistency between group and individual correlations, and the generalizability of conclusions between domains. Researchers should explicitly test for equivalence of processes at the individual and group level across the social and medical sciences.

Journal Article

Share this book

Add to My Shelf

A Meta-Psychological Perspective on the Decade of Replication Failures in Social Psychology

by Schimmack, Ulrich in Cognition & reasoning , Cognitive psychology , Crises

2020

Bem's (2011) article triggered a string of replication failures in social psychology. A major replication project found that only 25% of results in social psychology could be replicated. I examine various explanations for this low replication rate and found most of them lacking in empirical support. I then provide evidence that the use of questionable research practices accounts for this result. Using z-curve and a representative sample of focal hypothesis tests, I find that the expected replication rate for social psychology is between 20% and 45%. I argue that quantifying replicability can provide an incentive to use good research practices and to invest more resources in studies that produce replicable results. The replication crisis in social psychology provides important lessons for other disciplines in psychology that have avoided to take a closer look at their research practices. L'article de BEM (2011) a déclenché une série d'échecs de reproduction en psychologie sociale. Un important projet de réla courbe Z et un échantillon représentatif des tests d'hypothèse focale, j'ai trouvé que le taux de reproduction attendu pour la psychologie sociale se situe entre 20 et 45 %. Je crois que la quantification de la reproductibilité peut inciter à utiliser de bonnes pratiques de recherche et à investir davantage de ressources dans des études qui produisent des résultats reproductibles. La crise de reproduction en psychologie sociale offre des leçons importantes pour d'autres disciplines de psychologie qui ont évité de s'intéresser de plus près à leurs pratiques de recherche. Public Significance Statement This article examines explanations for the low replicability of social psychology. Using a new statistical method, z-curve, it is shown that low statistical power and selective reporting of significant results are the main factor. As a result, original results in social psychology journals report inflated effect sizes that cannot be replicated. To improve the credibility of social psychology, researchers must increase statistical power and disclose nonsignificant results.

Journal Article

Share this book

Add to My Shelf

Sample size in multistakeholder Delphi surveys: at what minimum sample size do replicability of results stabilize?

by Purvis, Anthony , Ciani, Oriana , Taylor, Rod S. in Adult , Clinical trials , Consensus

2024

The minimum sample size for multistakeholder Delphi surveys remains understudied. Drawing from three large international multistakeholder Delphi surveys, this study aimed to: 1) investigate the effect of increasing sample size on replicability of results; 2) assess whether the level of replicability of results differed with participant characteristics: for example, gender, age, and profession. We used data from Delphi surveys to develop guidance for improved reporting of health-care intervention trials: SPIRIT (Standard Protocol Items: Recommendations for Interventional Trials) and CONSORT (Consolidated Standards of Reporting Trials) extension for surrogate end points (n = 175, 22 items rated); CONSORT-SPI [CONSORT extension for Social and Psychological Interventions] (n = 333, 77 items rated); and core outcome set for burn care (n = 553, 88 items rated). Resampling with replacement was used to draw random subsamples from the participant data set in each of the three surveys. For each subsample, the median value of all rated survey items was calculated and compared to the medians from the full participant data set. The median number (and interquartile range) of medians replicated was used to calculate the percentage replicability (and variability). High replicability was defined as ≥80% and moderate as 60% and <80% The average median replicability (variability) as a percentage of total number of items rated from the three datasets was 81% (10%) at a sample size of 60. In one of the datasets (CONSORT-SPI), a ≥80% replicability was reached at a sample size of 80. On average, increasing the sample size from 80 to 160 increased the replicability of results by a further 3% and reduced variability by 1%. For subgroup analysis based on participant characteristics (eg, gender, age, professional role), using resampled samples of 20 to 100 showed that a sample size of 20 to 30 resulted to moderate replicability levels of 64% to 77%. We found that a minimum sample size of 60–80 participants in multistakeholder Delphi surveys provides a high level of replicability (≥80%) in the results. For Delphi studies limited to individual stakeholder groups (such as researchers, clinicians, patients), a sample size of 20 to 30 per group may be sufficient. •A minimum sample size of 60 to 80 participants rating all items in multistakeholder Delphi surveys provides high levels of replicability (≥80%) of results.•For Delphi surveys including individual stakeholder groups (eg, researchers, clinicians, patients), a sample size of 20 to 30 rating all items per group may be sufficient.•The choice of the target sample size should further consider the response rates and attrition between Delphi survey rounds; diminishing returns of increasing sample size as estimated in this study; and diversity and expertise of participants.

Journal Article

Share this book

Add to My Shelf

Taking Parametric Assumptions Seriously: Arguments for the Use of Welch’s F-test instead of the Classical F-test in One-Way ANOVA

by Leys, Christophe , Lakens, Daniël , Mora, Youri L in Gender , Kurtosis , Normal distribution

2019

Student’s t-test and classical F-test ANOVA rely on the assumptions that two or more samples are independent, and that independent and identically distributed residuals are normal and have equal variances between groups. We focus on the assumptions of normality and equality of variances, and argue that these assumptions are often unrealistic in the field of psychology. We underline the current lack of attention to these assumptions through an alysis of researchers’ practices. Through Monte Carlo simulations, we illustrate the consequences of performing the classic parametric F-test for ANOVA when the test assumptions are not met on the Type I error rate and statistical power. Under realistic deviations from the assumption of equal variances, the classic F-test can yield severely biased results and lead to invalid statistical inferences. We examine two common altertives to the F-test, mely the Welch’s ANOVA (W-test) and the Brown-Forsythe test (F*-test). Our simulations show that under a range of realistic scerios, the W-test is a better altertive and we therefore recommend using the W-test by default when comparing means. We provide a detailed example explaining how to perform the W-test in SPSS and R. We summarize our conclusions in practical recommendations that researchers can use to improve their statistical practices.

Journal Article

Share this book

Add to My Shelf

Editors' Introduction to the Special Section on Replicability in Psychological Science: A Crisis of Confidence?

by Pashler, Harold , Wagenmakers, Eric-Jan in Academic journals , Cognitive psychology , Personality psychology

2012

Journal Article

Share this book

Add to My Shelf

The interpretation of statistical power after the data have been gathered

by Dziak, John Joseph , Dierker, Lisa C , Abar Beau in Data analysis , Hypotheses , Sample size

2020

Post-hoc power estimates (power calculated for hypothesis tests after performing them) are sometimes requested by reviewers in an attempt to promote more rigorous designs. However, they should never be requested or reported because they have been shown to be logically invalid and practically misleading. We review the problems associated with post-hoc power, particularly the fact that the resulting calculated power is a monotone function of the p value and therefore contains no additional helpful information. We then discuss some situations that seem at first to call for post-hoc power analysis, such as attempts to decide on the practical implications of a null finding, or attempts to determine whether the sample size of a secondary data analysis is adequate for a proposed analysis, and consider possible approaches to achieving these goals. We make recommendations for practice in situations in which clear recommendations can be made, and point out other situations where further methodological research and discussion are required.

Journal Article

Share this book

Add to My Shelf

Science's reproducibility and replicability crisis: International business is not immune

by Cascio, Wayne F. , Aguinis, Herman , Ramani, Ravi S. in Business , Business and Management , Business Strategy/Leadership

2017

International business is not immune to science's reproducibility and replicability crisis. We argue that this crisis is not entirely surprising given the methodological practices that enhance systematic capitalization on chance. This occurs when researchers search for a maximally predictive statistical model based on a particular dataset and engage in several trial-and-error steps that are rarely disclosed in published articles. We describe systematic capitalization on chance, distinguish it from unsystematic capitalization on chance, address five common practices that capitalize on chance, and offer actionable strategies to minimize the capitalization on chance and improve the reproducibility and replicability of future IB research.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter