Catalogue Search | MBRL

Handbook of research on opinion mining and text analytics on literary works and social media

by Keikhosrokiani, Pantea, 1982- editor , Pourya Asl, Moussa, 1986- editor in Text processing (Computer science) , Content analysis (Communication) Data processing. , Social media Data processing.

\"This book uses artificial intelligence and big data analytics to conduct opinion mining and text analytics on literary works and social media, focusing on theories, method, applications and approaches of data analytic techniques that can be used to extract and analyze data from literary books and social media, in a meaningful pattern\"-- Provided by publisher.

Book

Share this book

Add to My Shelf

Practical text mining and statistical analysis for non-structured text data applications

by Delen, Dursun , Elder, John , Miner, Gary in Data mining , Statistics

2012

Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications brings together all the information, tools and methods a professional will need to efficiently use text mining applications and statistical analysis.Winner of a 2012 PROSE Award in Computing and Information Sciences from the Association of American Publishers.

eBook

Share this book

Add to My Shelf

Subjective well-being and social media

by Iacus, Stefano M. (Stefano Maria), author , Porro, Giuseppe, author in Online social networks Use studies. , Online social networks Research Data processing. , Online social networks Psychological aspects.

\"Subjective Well-Being and Social Media shows how, by exploiting the unprecedented amount of information provided by the social networking sites, it is possible to build new composite indicators of subjective well-being. These new social media indicators are complementary to official statistics and surveys, whose data are collected at very low temporary and geographical resolution. The book also explains in full details how to solve the problem of selection bias coming from social media data. Mixing textual analysis, machine learning and time series analysis, the book also shows how to extract both the structural and the temporary components of subjective well-being. Cross-country analysis confirms that well-being is a complex phenomenon that is governed by macroeconomic and health factors, ageing, temporary shocks and cultural and psychological aspects. As an example, the last part of the book focuses on the impact of the prolonged stress due to the COVID-19 pandemic on subjective well-being in both Japan and Italy. Through a data science approach, the results show that a consistent and persistent drop occurred throughout 2020 in the overall level of well-being in both countries. The methodology presented in this book: enables social scientists and policy makers to know what people think about the quality of their own life, minimizing the bias induced by the interaction between the researcher and the observed individuals; being language-free, it allows for comparing the well-being perceived in different linguistic and socio-cultural contexts, disentangling differences due to objective events and life conditions from dissimilarities related to social norms or language specificities; provides a solution to the problem of selection bias in social media data through a systematic approach based on time-space small area estimation models. The book comes also with replication R scripts and data. Stefano M. Iacus is full professor of Statistics at the University of Milan, on leave at the Joint Research Centre of the European Commission. Former R-core member (1999-2017) and R Foundation Member. Giuseppe Porro is full professor of Economic Policy at the University of Insubria. An earlier version of this project was awarded the Italian Institute of Statistics-Google prize for \"official statistics and big data\"\"-- Provided by publisher.

Book

Share this book

Add to My Shelf

A systematic review of text mining approaches applied to various application areas in the biomedical domain

by Khedo, Kavi Kumar , Cheerkoot-Jalim, Sudha in Algorithms , Application , Automation

2021

Purpose This work shows the results of a systematic literature review on biomedical text mining. The purpose of this study is to identify the different text mining approaches used in different application areas of the biomedical domain, the common tools used and the challenges of biomedical text mining as compared to generic text mining algorithms. This study will be of value to biomedical researchers by allowing them to correlate text mining approaches to specific biomedical application areas. Implications for future research are also discussed. Design/methodology/approach The review was conducted following the principles of the Kitchenham method. A number of research questions were first formulated, followed by the definition of the search strategy. The papers were then selected based on a list of assessment criteria. Each of the papers were analyzed and information relevant to the research questions were extracted. Findings It was found that researchers have mostly harnessed data sources such as electronic health records, biomedical literature, social media and health-related forums. The most common text mining technique was natural language processing using tools such as MetaMap and Unstructured Information Management Architecture, alongside the use of medical terminologies such as Unified Medical Language System. The main application area was the detection of adverse drug events. Challenges identified included the need to deal with huge amounts of text, the heterogeneity of the different data sources, the duality of meaning of words in biomedical text and the amount of noise introduced mainly from social media and health-related forums. Originality/value To the best of the authors’ knowledge, other reviews in this area have focused on either specific techniques, specific application areas or specific data sources. The results of this review will help researchers to correlate most relevant and recent advances in text mining approaches to specific biomedical application areas by providing an up-to-date and holistic view of work done in this research area. The use of emerging text mining techniques has great potential to spur the development of innovative applications, thus considerably impacting on the advancement of biomedical research.

Journal Article

Share this book

Add to My Shelf

Comparative analysis of text mining and clustering techniques for assessing functional dependency between manual test cases

by de Oliveira Neto, Francisco Gomes , Hatvani, Leo , Feldt, Robert in Accuracy , Algorithms , Artificial intelligence

2025

Text mining techniques, particularly those leveraging machine learning for natural language processing, have gained significant attention for qualitative data analysis in software testing. However, their complexity and lack of transparency can pose challenges, especially in safety-critical domains where simpler, interpretable solutions are often preferred unless accuracy is heavily compromised. This study investigates the trade-offs between complexity, effort, accuracy, and utility in text mining and clustering techniques, focusing on their application for detecting functional dependencies among manual integration test cases in safety-critical systems. Using empirical data from an industrial testing project at ALSTOM Sweden, we evaluate various string distance methods, NCD compressors, and machine learning approaches. The results highlight the impact of preprocessing techniques, such as tokenization, and intrinsic factors, such as text length, on algorithm performance. Findings demonstrate how text mining and clustering can be optimized for safety-critical contexts, offering actionable insights for researchers and practitioners aiming to balance simplicity and effectiveness in their testing workflows.

Journal Article

Share this book

Add to My Shelf

Functional Applications of Text Analytics Systems

by Simske, Steven in Computing and Processing , General Topics for Engineers , Text data mining

2020,2021

Text analytics consist of the statistics about a text element, which includes the word count, the word histogram, and the word frequency histogram. Most text documents of value are related to other—sometimes many other—documents, and so analytics describing the relative frequency of terms in a document compared to its peers are important for defining key words (tagging, labeling, indexing), search-responsive terms (query terms), and compressed versions of the documents (key words, summary, etc.).This clearly written text explains the functional applications of search, translation, optimization, and learning with regard to text analytics. Generation of analytics is aided by a hybrid, ensemble, or other combinatorial approach in which two or more effective analytic processes are used simultaneously, and their outputs combined to form a better “consensus”. Additional value to the preservation of the information is provided through these methods. Also, since they encompass capabilities of two or more knowledge-generating systems, they can create a “superset” of access points to the data generated. The book also describes the role of functional approaches in the testing and configuration of these systems.

eBook

Share this book

Add to My Shelf

Psychological Well-Being of Left-Behind Children in China: Text Mining of the Social Media Website Zhihu

by Julian Chun-Chung Chow , Yuwen Lyu , Cheng Ren in Artificial intelligence , Behavioral and Social Science , Big Data

2022

China’s migrant population has significantly contributed to its economic growth; however, the impact on the well-being of left-behind children (LBC) has become a serious public health problem. Text mining is an effective tool for identifying people’s mental state, and is therefore beneficial in exploring the psychological mindset of LBC. Traditional data collection methods, which use questionnaires and standardized scales, are limited by their sample sizes. In this study, we created a computational application to quantitively collect personal narrative texts posted by LBC on Zhihu, which is a Chinese question-and-answer online community website; 1475 personal narrative texts posted by LBC were gathered. We used four types of words, i.e., first-person singular pronouns, negative words, past tense verbs, and death-related words, all of which have been associated with depression and suicidal ideations in the Chinese Linguistic Inquiry Word Count (CLIWC) dictionary. We conducted vocabulary statistics on the personal narrative texts of LBC, and bilateral t-tests, with a control group, to analyze the psychological well-being of LBC. The results showed that the proportion of words related to depression and suicidal ideations in the texts of LBC was significantly higher than in the control group. The differences, with respect to the four word types (i.e., first-person singular pronouns, negative words, past tense verbs, and death-related words), were 5.37, 2.99, 2.65, and 2.00 times, respectively, suggesting that LBC are at a higher risk of depression and suicide than their counterparts. By sorting the texts of LBC, this research also found that child neglect is a main contributing factor to psychological difficulties of LBC. Furthermore, mental health problems and the risk of suicide in vulnerable groups, such as LBC, is a global public health issue, as well as an important research topic in the era of digital public health. Through a linguistic analysis, the results of this study confirmed that the experiences of left-behind children negatively impact their mental health. The present findings suggest that it is vital for the public and nonprofit sectors to establish online suicide prevention and intervention systems to improve the well-being of LBC through digital technology.

Journal Article

Share this book

Add to My Shelf

Demystifying user needs in wardrobe furniture design: A network analysis via text mining and DEMATEL-ANP integration

by Xu, Jinyang , Li, Xinlian , Jia, Peijun in Aesthetics , Consumers , Data analysis

2025

Core user demands of wardrobe furniture design are becoming increasingly complex. Traditional design methods fail to systematically analyze the interrelationships among these multidimensional factors. This study integrated web text mining, the Decision-Making Trial and Evaluation Laboratory (DEMATEL) method, and the Analytic Network Process (ANP) to construct a causal network model for wardrobe design, and further optimized design proposals through the Technique for Order Preference by Similarity to Ideal Solution (TOPSIS). By applying Python technology, user evaluation data were extracted from mainstream e-commerce platforms, with high-frequency user demand keywords being identified and categorized into four key dimensions. DEMATEL was employed to quantify the causal intensity and centrality of the identified factors; ANP was subsequently utilized to construct a network hierarchy, revealing the feedback mechanisms between functional modules and user experience. Finally, TOPSIS was applied to rank three design proposals, among which Option 3—featuring flexible space partitioning, auto-sensing lighting, and anti-tip design—was selected as the optimal solution. The findings demonstrate that integrating text mining with the DEMATEL-ANP-TOPSIS framework can effectively identify the prioritization of user needs, thereby providing scientific decision support for furniture design.

Journal Article

Share this book

Add to My Shelf

A RESEARCH-BASED ONTOLOGY FOR COLLABORATIVE INNOVATION: A METHODOLOGY LEVERAGING AI AND DOMAIN EXPERT KNOWLEDGE

by Sharairi, Mohammad , Alshawabkeh, Abdallah , Kharbat, Faten in Artificial intelligence , artificial intelligence; ontology; research-based; accounting; text mining , Automation

2024

This paper introduces a method, for creating research-driven ontology to foster collaboration and innovation. The concept of collaborative innovation implies a process where multiple stakeholders work together to generate novel ideas, solutions or products. The suggested approach combines Artificial Intelligence (AI) and expert knowledge to build a comprehensive model encompassing various aspects of research, development and innovation. To demonstrate the feasibility of this method, the paper showcases its implementation in the field of accounting science. First, AI-powered machine-learning algorithms and text-mining techniques are used to extract the main ontological elements from a large corpus of accounting literature. Subsequently, expert knowledge is utilized to refine and validate these identified elements. The resulting ontology can be used as the foundation of a knowledge-based system to promote collaboration and analyze the state of innovation.

Journal Article

Share this book

Add to My Shelf

Edge Weight Updating Neural Network for Named Entity Normalization

by Cho, Sungzoon , Jeon, Sung Hwan in Automation , Bioinformatics , Canonical forms

2023

Discriminating the matched named entity pairs or identifying the entities' canonical forms are critical in text mining tasks. More precise named entity normalization in text mining will benefit other subsequent text analytic applications. We built the named entity normalization model with a novel edge weight updating neural network. We, next, verify our model's performance on NCBI disease, BC5CDR disease, and BC5CDR chemical databases, which are widely used named entity normalization datasets in the bioinformatics field. We also tested our model with our own financial named entity normalization dataset to validate the efficacy for more general applications. Using the constructed dataset, we differentiate named entity pairs. Our model achieved the highest named entity normalization performances in terms of various evaluation metrics. Our proposed model when tested on four different datasets achieved state-of-the-art results.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter