Catalogue Search | MBRL

From Story Line to Box Office: A New Approach for Green-Lighting Movie Scripts

by Zhang, Z. John , Hui, Sam K , Eliashberg, Jehoshua in Applied sciences , Artificial intelligence , Business management

2007

Movie studios often have to choose among thousands of scripts to decide which ones to turn into movies. Despite the huge amount of money at stake, this processknown as green-lighting in the movie industryis largely a guesswork based on experts experience and intuitions. In this paper, we propose a new approach to help studios evaluate scripts that will then lead to more profitable green-lighting decisions. Our approach combines screenwriting domain knowledge, natural-language processing techniques, and statistical learning methods to forecast a movies return on investment (ROI) based only on textual information available in movie scripts. We test our model in a holdout decision task to show that our model is able to significantly improve a studios gross ROI.

Journal Article

Share this book

Add to My Shelf

Sampling contingency tables given sets of marginals and/or conditionals in the context of statistical disclosure limitation

by Lee, Juyoun in Statistics

2009

Federal agencies and other organizations often publish data summarized in arrays of non-negative integers, called contingency tables. When such data are released, it is necessary to prevent sensitive information pertaining to individuals from being disclosed. In statistical disclosure limitation, we must maintain a balance between disclosure risk and the data utility needed to make valid statistical inferences. One method for achieving this balance is to release partial information about the original data. In practice, many agencies release data summarized in the form of marginal sums or conditional probabilities. Sampling methods for multi-way contingency tables given a set of observed marginal sums have been studied in diverse ways; yet, there is almost no literature about sampling of tables given a set of observed conditional probabilities. In this thesis, we focus on a set of conditional probabilities instead of marginal sums. We propose MCMC simulation schemes coupled with tools from algebraic statistics to sample tables from the sets of possible tables given observed conditional values. We also propose a simple extension to the case given a combination of observed marginal totals and conditional values. These algorithms can be used to compute posterior distribution and assess data utility and disclosure risk in the context of statistical disclosure limitation. We demonstrate the proposed algorithms with simple examples and discuss their advantages and disadvantages. In addition, proposed sampling algorithms can be used for releasing synthetic contingency tables. We study both the disclosure risk and data utility associated with proposed synthetic tabular data releases.

Dissertation

Share this book

Add to My Shelf

Classical tests, linear models and their extensions for the analysis of 2 × 2 contingency tables

by Morrissey, Michael B. , Nagel, Rebecca , Ruxton, Graeme D. in 2 × 2 contingency table , Binary data , chi‐squared test

2024

Ecologists and evolutionary biologists are regularly tasked with the comparison of binary data across groups. There is, however, some discussion in the biostatistics literature about the best methodology for the analysis of data comprising binary explanatory and response variables forming a 2 × 2 contingency table. We assess several methodologies for the analysis of 2 × 2 contingency tables using a simulation scheme of different sample sizes with outcomes evenly or unevenly distributed between groups. Specifically, we assess the commonly recommended logistic (generalised linear model [GLM]) regression analysis, the classical Pearson chi‐squared test and four conventional alternatives (Yates' correction, Fisher's exact, exact unconditional and mid‐p), as well as the widely discouraged linear model (LM) regression. We found that both LM and GLM analyses provided unbiased estimates of the difference in proportions between groups. LM and GLM analyses also provided accurate standard errors and confidence intervals when the experimental design was balanced. When the experimental design was unbalanced, sample size was small, and one of the two groups had a probability close to 1 or 0, LM analysis could substantially over‐ or under‐represent statistical uncertainty. For null hypothesis significance testing, the performance of the chi‐squared test and LM analysis were almost identical. Across all scenarios, both had high power to detect non‐null effects and reject false positives. By contrast, the GLM analysis was underpowered when using z‐based p‐values, in particular when one of the two groups had a probability near 1 or 0. The GLM using the LRT had better power to detect non‐null results. Our simulation results suggest that, wherever a chi‐squared test would be recommended, a linear regression is a suitable alternative for the analysis of 2 × 2 contingency table data. When researchers opt for more sophisticated procedures, we provide R functions to calculate the standard error of a difference between two probabilities from a Bernoulli GLM output using the delta method. We also explore approaches to compliment GLM analysis of 2 × 2 contingency tables with credible intervals on the probability scale. These additional operations should support researchers to make valid assessments of both statistical and practical significances.

Journal Article

Share this book

Add to My Shelf

Big data analytics capability in supply chain agility

by Dubey, Rameshwar , Childe, Stephen J. , Gunasekaran, Angappa in Big Data , Business analytics , Competitive advantage

2019

PurposeThe purpose of this paper is to examine when and how organizations build big data analytics capability (BDAC) to improve supply chain agility (SCA) and gain competitive advantage.Design/methodology/approachThe authors grounded the theoretical framework in two perspectives: the dynamic capabilities view and contingency theory. To test the research hypotheses, the authors gathered 173 usable responses using a pre-tested questionnaire.FindingsThe results suggest that BDAC has a positive and significant effect on SCA and competitive advantage. Further, the results support the hypothesis that organizational flexibility (OF) has a positive and significant moderation effect on the path joining BDAC and SCA. However, contrary to the belief, the authors found no support for the moderation effect of OF on the path joining BDAC and competitive advantage.Originality/valueThe study makes some useful contributions to the literature on BDAC, SCA, OF, and competitive advantage. Moreover, the results may further motivate future scholars to replicate the findings using longitudinal data.

Journal Article

Share this book

Add to My Shelf

Diagnosis of knee meniscal injuries using artificial intelligence: A systematic review and meta-analysis of diagnostic performance

by Guermazi, Ali , Shahrabi Farahani, Mohammad , Jahanshahi, Ali in Accuracy , Algorithms , Artificial Intelligence

2025

The aim was to systematically review the literature and perform a meta-analysis to estimate the performance of artificial intelligence (AI) algorithms in detecting meniscal injuries. A systematic search was performed in the Scopus, PubMed, EBSCO, Cinahl, Web of Science, IEEE Xplore, and Cochrane Central databases on July, 2024. The included studies' reporting quality and risk of bias were evaluated using the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD) and the Prediction Model Study Risk of Bias Assessment Tool (PROBAST), respectively. Also, a meta-analysis was done using contingency tables to estimate diagnostic performance metrics (sensitivity and specificity), and a meta-regression analysis was performed to investigate the effect of the following variables on the main outcome: imaging view, data augmentation and transfer learning usage, and presence of meniscal tear in the injury, with a corresponding 95% confidence interval (CI) and a P-value of 0.05 as a threshold for significance. Among 28 included studies, 92 contingency tables were extracted from 15 studies. The reference standard of the studies were mostly expert radiologists, orthopedics, or surgical reports. The pooled sensitivity and specificity for AI algorithms on internal validation were 81% (95% CI: 78, 85), and 78% (95% CI: 72, 83), and for clinicians on internal validation were 85% (95% CI: 76, 91), and 88% (95% CI: 83, 92), respectively. The pooled sensitivity and specificity for studies validating algorithms with an external test set were 82% (95% CI: 74, 88), and 88% (95% CI: 84, 91), respectively. The results of this study imply the lower diagnostic performance of AI-based algorithms in knee meniscal injuries compared with clinicians.

Journal Article

Share this book

Add to My Shelf

AI technologies and their impact on supply chain resilience during COVID-19

by Stekelorum, Rébecca , Gupta, Shivam , Modgil, Sachin in Artificial intelligence , Consumers , Contingency

2022

PurposeCOVID-19 has pushed many supply chains to re-think and strengthen their resilience and how it can help organisations survive in difficult times. Considering the availability of data and the huge number of supply chains that had their weak links exposed during COVID-19, the objective of the study is to employ artificial intelligence to develop supply chain resilience to withstand extreme disruptions such as COVID-19.Design/methodology/approachWe adopted a qualitative approach for interviewing respondents using a semi-structured interview schedule through the lens of organisational information processing theory. A total of 31 respondents from the supply chain and information systems field shared their views on employing artificial intelligence (AI) for supply chain resilience during COVID-19. We used a process of open, axial and selective coding to extract interrelated themes and proposals that resulted in the establishment of our framework.FindingsAn AI-facilitated supply chain helps systematically develop resilience in its structure and network. Resilient supply chains in dynamic settings and during extreme disruption scenarios are capable of recognising (sensing risks, degree of localisation, failure modes and data trends), analysing (what-if scenarios, realistic customer demand, stress test simulation and constraints), reconfiguring (automation, re-alignment of a network, tracking effort, physical security threats and control) and activating (establishing operating rules, contingency management, managing demand volatility and mitigating supply chain shock) operations quickly.Research limitations/implicationsAs the present research was conducted through semi-structured qualitative interviews to understand the role of AI in supply chain resilience during COVID-19, the respondents may have an inclination towards a specific role of AI due to their limited exposure.Practical implicationsSupply chain managers can utilise data to embed the required degree of resilience in their supply chains by considering the proposed framework elements and phases.Originality/valueThe present research contributes a framework that presents a four-phased, structured and systematic platform considering the required information processing capabilities to recognise, analyse, reconfigure and activate phases to ensure supply chain resilience.

Journal Article

Share this book

Add to My Shelf

Bolstering green supply chain integration via big data analytics capability: the moderating role of data-driven decision culture

by Liu, Yang , Gao, Na , Fang, Wei in Big Data , Collaboration , Contingency

2022

PurposeBased on organizational information processing theory, this research explores how big data analytics capability (BDAC) contributes to green supply chain integration (GSCI) and the contingency role that data-driven decision culture plays.Design/methodology/approachUsing the two-wave survey data collected from 317 Chinese manufacturing firms, the authors validate the hypotheses.FindingsThe results show that big data managerial capability has positive impacts on three dimensions of GSCI, while big data technical capability has positive impacts on green internal and customer integration. Moreover, green internal integration mediates the impacts of big data technical capability and managerial capability on green supplier and customer integration. Finally, data-driven decision culture alleviates the positive impacts of big data technical and managerial capability on green internal integration.Practical implicationsThe findings suggest that firms can leverage big data technical and managerial capability to enhance information processing capability for achieving a higher degree of GSCI. Further, the critical role of data-driven decision culture in affecting the link between BDAC and GSCI should not be overlooked.Originality/valueThis research contributes to literature on green supply chain management by revealing the role of BDAC in improving GSCI.

Journal Article

Share this book

Add to My Shelf

Drought characterisation based on an agriculture-oriented standardised precipitation index

by Tigkas, Dimitris , Tsakiris, George , Harris, Vangelis in Agricultural drought , Agricultural industry , Agriculture

2019

Drought is a major natural hazard with significant effects in the agricultural sector, especially in arid and semi-arid regions. The accurate and timely characterisation of agricultural drought is crucial for devising contingency plans, including the necessary mitigation measures. Many drought indices have been developed during the last decades for drought characterisation and analysis. One of the most widely used indices worldwide is the Standardised Precipitation Index (SPI). Although other comprehensive indices have been introduced over the years, SPI remains the most broadly accepted index due to a number of reasons, the most important of which are its simple structure and the fact that it uses only precipitation data. In this paper, a modified version of SPI is proposed, namely the Agricultural Standardised Precipitation Index (aSPI), based on the substitution of the total precipitation by the effective precipitation, which describes more accurately the amount of water that can be used productively by the plants. Further, the selection of the most suitable reference periods and time steps for agricultural drought identification using aSPI is discussed. This conceptual enhancement of SPI aims at improving the suitability of the index for agricultural drought characterisation, while retaining the advantages of the original index, including its dependence only on precipitation data. The evaluation of the performance of both SPI and aSPI in terms of correlating drought magnitude with crop yield response in four regions of Greece under Mediterranean conditions indicated that aSPI is more robust than the original index in identifying agricultural drought.

Journal Article

Share this book

Add to My Shelf

BET on Independence

by Zhang, Kai in Binary expansion , Breast cancer , breast neoplasms

2019

We study the problem of nonparametric dependence detection. Many existing methods may suffer severe power loss due to nonuniform consistency, which we illustrate with a paradox. To avoid such power loss, we approach the nonparametric test of independence through the new framework of binary expansion statistics (BEStat) and binary expansion testing (BET), which examine dependence through a novel binary expansion filtration approximation of the copula. Through a Hadamard transform, we find that the symmetry statistics in the filtration are complete sufficient statistics for dependence. These statistics are also uncorrelated under the null. By using symmetry statistics, the BET avoids the problem of nonuniform consistency and improves upon a wide class of commonly used methods (a) by achieving the minimax rate in sample size requirement for reliable power and (b) by providing clear interpretations of global relationships upon rejection of independence. The binary expansion approach also connects the symmetry statistics with the current computing system to facilitate efficient bitwise implementation. We illustrate the BET with a study of the distribution of stars in the night sky and with an exploratory data analysis of the TCGA breast cancer data. Supplementary materials for this article are available online.

Journal Article

Share this book

Add to My Shelf

Bayesian Nonparametric Ordination for the Analysis of Microbial Communities

by Holmes, Susan , Bacallado, Sergio , Favaro, Stefano in Analysis , Applications and Case Studies , Bacteria

2017

Human microbiome studies use sequencing technologies to measure the abundance of bacterial species or Operational Taxonomic Units (OTUs) in samples of biological material. Typically the data are organized in contingency tables with OTU counts across heterogeneous biological samples. In the microbial ecology community, ordination methods are frequently used to investigate latent factors or clusters that capture and describe variations of OTU counts across biological samples. It remains important to evaluate how uncertainty in estimates of each biological sample's microbial distribution propagates to ordination analyses, including visualization of clusters and projections of biological samples on low-dimensional spaces. We propose a Bayesian analysis for dependent distributions to endow frequently used ordinations with estimates of uncertainty. A Bayesian nonparametric prior for dependent normalized random measures is constructed, which is marginally equivalent to the normalized generalized Gamma process, a well-known prior for nonparametric analyses. In our prior, the dependence and similarity between microbial distributions is represented by latent factors that concentrate in a low-dimensional space. We use a shrinkage prior to tune the dimensionality of the latent factors. The resulting posterior samples of model parameters can be used to evaluate uncertainty in analyses routinely applied in microbiome studies. Specifically, by combining them with multivariate data analysis techniques we can visualize credible regions in ecological ordination plots. The characteristics of the proposed model are illustrated through a simulation study and applications in two microbiome datasets. Supplementary materials for this article are available online.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter