Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
14
result(s) for
"Association relation mining algorithm"
Sort by:
Exploring a Student-Centered One-Stop Community Service Model
2024
The student community in colleges and universities is based on students’ common living areas, and the community service model in colleges and universities should be student-oriented and centered on students’ development. This paper proposes a one-stop community service model from a student-oriented perspective, with the service community model and service recommendation model being the main component modules. In the service community model, a context-based association relationship mining algorithm is proposed to add time and location contexts to the collaborative filtering algorithm in order to obtain a collection of similar users and services. After constructing the one-stop service community, a service recommendation algorithm based on a trusted coalition is proposed to introduce student credibility and service usage frequency to achieve personalized recommendations of services. University H’s student community implemented the one-stop community service model. After the practice, the mean value of each dimension of the community’s service mode and content evaluation was greater than 3, and the overall satisfaction evaluation value of the community was 39.49, which was extremely significant compared with the evaluation value of University C (P<0.01). The mean value of students’ mental health evaluation reached 3.33.
Journal Article
TBGA: a large-scale Gene-Disease Association dataset for Biomedical Relation Extraction
2022
Background
Databases are fundamental to advance biomedical science. However, most of them are populated and updated with a great deal of human effort. Biomedical Relation Extraction (BioRE) aims to shift this burden to machines. Among its different applications, the discovery of Gene-Disease Associations (GDAs) is one of BioRE most relevant tasks. Nevertheless, few resources have been developed to train models for GDA extraction. Besides, these resources are all limited in size—preventing models from scaling effectively to large amounts of data.
Results
To overcome this limitation, we have exploited the DisGeNET database to build a large-scale, semi-automatically annotated dataset for GDA extraction. DisGeNET stores one of the largest available collections of genes and variants involved in human diseases. Relying on DisGeNET, we developed TBGA: a GDA extraction dataset generated from more than 700K publications that consists of over 200K instances and 100K gene-disease pairs. Each instance consists of the sentence from which the GDA was extracted, the corresponding GDA, and the information about the gene-disease pair.
Conclusions
TBGA is amongst the largest datasets for GDA extraction. We have evaluated state-of-the-art models for GDA extraction on TBGA, showing that it is a challenging and well-suited dataset for the task. We made the dataset publicly available to foster the development of state-of-the-art BioRE models for GDA extraction.
Journal Article
Disaster Precursor Identification and Early Warning of the Lishanyuan Landslide Based on Association Rule Mining
by
Xu, Junwei
,
Luo, Jianlan
,
He, Hongsheng
in
Algorithms
,
Apriori algorithm
,
association rule mining
2022
It is the core prerequisite of landslide warning to mine short-term deformation patterns and extract disaster precursors from real-time and multi-source monitoring data. This study used the sliding window method and gray relation analysis to obtain features from multi-source, real-time monitoring data of the Lishanyuan landslide in Hunan Province, China. Then, the k-means algorithm with particle swarm optimization was used for clustering. Finally, the Apriori algorithm is used to mine strong association rules between the high-speed deformation process and rainfall features of this landslide to obtain short-term deformation patterns and precursors of the disaster. The data mining results show that the landslide has a high-speed deformation probability of more than 80% when rainfall occurs within 24 h and the cumulative rainfall is greater than 130.60 mm within 7 days. It is of great significance to extract the short-term deformation pattern of landslides by data mining technology to improve the accuracy and reliability of early warning.
Journal Article
Mining microbe–disease interactions from literature via a transfer learning model
2021
Background
Interactions of microbes and diseases are of great importance for biomedical research. However, large-scale of microbe–disease interactions are hidden in the biomedical literature. The structured databases for microbe–disease interactions are in limited amounts. In this paper, we aim to construct a large-scale database for microbe–disease interactions automatically. We attained this goal via applying text mining methods based on a deep learning model with a moderate curation cost. We also built a user-friendly web interface that allows researchers to navigate and query required information.
Results
Firstly, we manually constructed a golden-standard corpus and a sliver-standard corpus (SSC) for microbe–disease interactions for curation. Moreover, we proposed a text mining framework for microbe–disease interaction extraction based on a pretrained model BERE. We applied named entity recognition tools to detect microbe and disease mentions from the free biomedical texts. After that, we fine-tuned the pretrained model BERE to recognize relations between targeted entities, which was originally built for drug–target interactions or drug–drug interactions. The introduction of SSC for model fine-tuning greatly improved detection performance for microbe–disease interactions, with an average reduction in error of approximately 10%. The MDIDB website offers data browsing, custom searching for specific diseases or microbes, and batch downloading.
Conclusions
Evaluation results demonstrate that our method outperform the baseline model (rule-based PKDE4J) with an average
F
1
-score of 73.81%. For further validation, we randomly sampled nearly 1000 predicted interactions by our model, and manually checked the correctness of each interaction, which gives a 73% accuracy. The MDIDB webiste is freely avaliable throuth
http://dbmdi.com/index/
Journal Article
Assigning factuality values to semantic relations extracted from biomedical research literature
by
Rosemblat, Graciela
,
Kilicoglu, Halil
,
Rindflesch, Thomas C.
in
Annotations
,
Artificial intelligence
,
Associations
2017
Biomedical knowledge claims are often expressed as hypotheses, speculations, or opinions, rather than explicit facts (propositions). Much biomedical text mining has focused on extracting propositions from biomedical literature. One such system is SemRep, which extracts propositional content in the form of subject-predicate-object triples called predications. In this study, we investigated the feasibility of assessing the factuality level of SemRep predications to provide more nuanced distinctions between predications for downstream applications. We annotated semantic predications extracted from 500 PubMed abstracts with seven factuality values (fact, probable, possible, doubtful, counterfact, uncommitted, and conditional). We extended a rule-based, compositional approach that uses lexical and syntactic information to predict factuality levels. We compared this approach to a supervised machine learning method that uses a rich feature set based on the annotated corpus. Our results indicate that the compositional approach is more effective than the machine learning method in recognizing the factuality values of predications. The annotated corpus as well as the source code and binaries for factuality assignment are publicly available. We will also incorporate the results of the better performing compositional approach into SemMedDB, a PubMed-scale repository of semantic predications extracted using SemRep.
Journal Article
Electronic medical records imputation by temporal Generative Adversarial Network
2024
The loss of electronic medical records has seriously affected the practical application of biomedical data. Therefore, it is a meaningful research effort to effectively fill these lost data. Currently, state-of-the-art methods focus on using Generative Adversarial Networks (GANs) to fill the missing values of electronic medical records, achieving breakthrough progress. However, when facing datasets with high missing rates, the imputation accuracy of these methods sharply deceases. This motivates us to explore the uncertainty of GANs and improve the GAN-based imputation methods. In this paper, the GRUD (Gate Recurrent Unit Decay) network and the UGAN (Uncertainty Generative Adversarial Network) are proposed and organically combined, called UGAN-GRUD. In UGAN-GRUD, it highlights using GAN to generate imputation values and then leveraging GRUD to compensate them. We have designed the UGAN and the GRUD network. The former is employed to learn the distribution pattern and uncertainty of data through the Generator and Discriminator, iteratively. The latter is exploited to compensate the former by leveraging the GRUD based on time decay factor, which can learn the specific temporal relations in electronic medical records. Through experimental research on publicly available biomedical datasets, the results show that UGAN-GRUD outperforms the current state-of-the-art methods, with average 13% RMSE (Root Mean Squared Error) and 24.5% MAPE (Mean Absolute Percentage Error) improvements.
Journal Article
Maternal epigenetic index links early neglect to later neglectful care and other psychopathological, cognitive, and bonding effects
by
López Rodríguez, Maykel
,
Mitchell, Colter
,
Góngora, Daylín
in
Adult
,
Adult Survivors of Child Abuse - psychology
,
Algorithms
2025
Background
Past experiences of maltreatment and life adversity induce DNA methylation changes in adults, but less is known about their impact on mothers’ maladaptive neglectful parenting and its negative effects. We performed an epigenome-wide association study to investigate the role of DNA methylation levels in mothers with neglectful care, who were exposed to childhood maltreatment and neglect, and their current negative effects. Saliva DNA methylation was determined with the Illumina Human Methylation EPIC BeadChip v1. The individual epigenome was the input to a machine learning algorithm for trajectory inference, which assigned a specific state to each mother in the progression from healthy controls to the extreme neglect condition. A compound epigenetic maternal neglect score (EMN) was derived from 138 mothers (
n
= 51 in the neglectful group;
n
= 87 in the control non-neglectful group) having young children. Differential methylation between groups was utilized to derive the EMNs adjusted for education level, age, experimental variables, and blood cell types in saliva samples.
Results
Structural equation modeling:
X
2
(29) = 37.81;
p
= 0.127; RMSEA = 0.048, confirmed that EMNs link their early experience of physical neglect to current reports of psychopathological symptoms, lower cognitive status, and observed poor mother–child emotional availability. A third of the genes annotated to the CpGs that affect EMNs are related to cognitive impairment and neurodegenerative and psychopathological disorders.
Conclusions
EMNs are a novel index to assess the contribution of DNA methylations as a neglected girl to later neglectful caregiving behavior and other negative effects. The evidence provided expands the possibilities for earlier interventions on the neglect condition to prevent and ameliorate the direct or indirect epigenetic impact of maternal adversities on mother–child care, helping to break the cycle of maltreatment.
Journal Article
A Correlation Analysis Method for Geographical Object Flows from a Geoeconomic Perspective
2022
Geographic object flow is the reason behind the relationship of geographic units. There are interactions in the process of dynamic change of a geographic object flow, and its regularity, which can reflect the relationship or pattern of geographic units in a region. In this paper, an association rule mining method for the geographic object flow linkage process is studied from a geoeconomics perspective. Additionally, an association rule mining algorithm with hierarchical constraints is proposed. Data segmentation is performed according to the time series characteristics of geographic object flow data. The basic attributes for the association rule mining are determined based on the basic parameters of geographic object flows, and a database for the association rule mining is formed according to the characteristics of the hierarchical structure of the geographic object flows. Based on the obtained data, the association rule mining algorithm with hierarchical constraints obtained using a parent–child matrix is improved by adding the Apriori algorithm. With the Indo-Pacific region as an example, the trade flow association rules for 25 countries in the region from 2010 to 2021 are selected. In addition, a mathematical statistical analysis of the strongly associated mined trade flows and geoeconomic factors is conducted in terms of both a basic feature analysis of trade flow associations and a country-oriented trade flow association analysis by considering domain knowledge. The effectiveness of the method has been evaluated from various perspectives such as correlation analysis, mathematical statistics, and comparison with the findings of existing studies and proved the validity of the method.
Journal Article
miRiaD: A Text Mining Tool for Detecting Associations of microRNAs with Diseases
by
Wu, Cathy H.
,
Gupta, Samir
,
Ross, Karen E.
in
Algorithms
,
Bioinformatics
,
Biological Ontologies
2016
Background
MicroRNAs are increasingly being appreciated as critical players in human diseases, and questions concerning the role of microRNAs arise in many areas of biomedical research. There are several manually curated databases of microRNA-disease associations gathered from the biomedical literature; however, it is difficult for curators of these databases to keep up with the explosion of publications in the microRNA-disease field. Moreover, automated literature mining tools that assist manual curation of microRNA-disease associations currently capture only one microRNA property (expression) in the context of one disease (cancer). Thus, there is a clear need to develop more sophisticated automated literature mining tools that capture a variety of microRNA properties and relations in the context of multiple diseases to provide researchers with fast access to the most recent published information and to streamline and accelerate manual curation.
Methods
We have developed
miRiaD
(
mi
cro
R
NAs
i
n
a
ssociation with
D
isease), a text-mining tool that automatically extracts associations between microRNAs and diseases from the literature. These associations are often not directly linked, and the intermediate relations are often highly informative for the biomedical researcher. Thus, miRiaD extracts the miR-disease pairs together with an explanation for their association. We also developed a procedure that assigns scores to sentences, marking their informativeness, based on the microRNA-disease relation observed within the sentence.
Results
miRiaD was applied to the entire Medline corpus, identifying 8301 PMIDs with miR-disease associations. These abstracts and the miR-disease associations are available for browsing at
http://biotm.cis.udel.edu/miRiaD
. We evaluated the recall and precision of miRiaD with respect to information of high interest to public microRNA-disease database curators (expression and target gene associations), obtaining a recall of 88.46–90.78. When we expanded the evaluation to include sentences with a wide range of microRNA-disease information that may be of interest to biomedical researchers, miRiaD also performed very well with a F-score of 89.4. The informativeness ranking of sentences was evaluated in terms of nDCG (0.977) and correlation metrics (0.678-0.727) when compared to an annotator’s ranked list.
Conclusions
miRiaD, a high performance system that can capture a wide variety of microRNA-disease related information, extends beyond the scope of existing microRNA-disease resources. It can be incorporated into manual curation pipelines and serve as a resource for biomedical researchers interested in the role of microRNAs in disease. In our ongoing work we are developing an improved miRiaD web interface that will facilitate complex queries about microRNA-disease relationships, such as “In what diseases does microRNA regulation of apoptosis play a role?” or “Is there overlap in the sets of genes targeted by microRNAs in different types of dementia?”.”
Journal Article
Mining Keywords from Short Text Based on LDA-Based Hierarchical Semantic Graph Model
2020
Extracting keywords from a text set is an important task. Most of the previous studies extract keywords from a single text. Using the key topics in the text collection, the association relationship between the topic and the topic in the cross-text, and the association relationship between the words and the words in the cross-text has not played an important role in the previous method of extracting keywords from the text collection. In order to improve the accuracy of extracting keywords from text collections, using the semantic relationship between topics and topics in texts and highlighting the semantic relationship between words and words under the key topics, this article proposes an unsupervised method for mining keywords from short text collections. In this method, a two level semantic association model is used to link the semantic relations between topics and the semantic relations between words, and extract the key words based on the combined action. First, the text is represented with LDA; the authors used word2vec to calculate the semantic association between topic and topic, and build a semantic relation graph between topics, that is the upper level graph, and use a graph ranking algorithm to calculate each topic score. In the lower layer, the semantic association between words and words is calculated by using the topic scores and the relationship between topics in the upper network allow a graph to be constructed. Using a graph sorting algorithm sorts the words in short text sets to determine the keywords. The experimental results show that the method is better for extracting keywords from the text set, especially in short articles. In the text, the important topics, the relationship between topics and the correlation between words can improve the accuracy of extracting keywords from the text set.
Journal Article