Catalogue Search | MBRL

Multi‐Task Learning for Simultaneous Retrievals of Passive Microwave Precipitation Estimates and Rain/No‐Rain Classification

by Bannai, Takumi , Kim, Hyungjun , Lu, Keming in AI retrieval algorithm , Atmospheric precipitations , Classification

2023

Satellite‐based precipitation estimations provide frequent, large‐scale measurements. Deep learning has recently shown significant potential for improving estimation accuracy. Most studies have employed a two‐stage framework, which is a sequential architecture of a rain/no‐rain binary classification task followed by a rain rate regression task. This study proposes a novel precipitation retrieval framework in which these two tasks are simultaneously trained using multi‐task learning approach (MTL). Furthermore, a novel network architecture and loss function were designed to reap the benefits of MTL. The proposed two‐task model successfully achieved a better performance than the conventional single‐task model possibly due to efficient knowledge transfer between tasks. Furthermore, the product intercomparison showed that our product outperformed existing products in rain rate retrieval and also yielded better skills in the rain/no‐rain retrieval task. Plain Language Summary Satellite‐based observation can provide frequent large‐scale precipitation measurements. Recently, machine learning techniques have been widely used in satellite precipitation estimates. This study introduces a novel deep learning (DL) method using multi‐task approach. The proposed method enables the simultaneous learning of rain/no‐rain classification and rain rate estimates. The experiment determined that our method achieved a better result than the conventional DL. Furthermore, a comparison between existing products demonstrated that our method provided a better rain rate estimate and comparable rain/no‐rain classification. Key Points Multi‐task learning was devised to infer precipitation intensity and rain/no‐rain classification simultaneously Simultaneous learning demonstrated a better performance than the conventional single task learning Retrieval based on the proposed algorithm outperformed existing satellite precipitation products

Journal Article

Share this book

Add to My Shelf

Can natural language processing help differentiate inflammatory intestinal diseases in China? Models applying random forest and convolutional neural network approaches

by Lin, Yucong , Yang, Aiming , Tong, Yuanren in Algorithms , Artificial Intelligence , Artificial neural networks

2020

Background Differentiating between ulcerative colitis (UC), Crohn’s disease (CD) and intestinal tuberculosis (ITB) using endoscopy is challenging. We aimed to realize automatic differential diagnosis among these diseases through machine learning algorithms. Methods A total of 6399 consecutive patients (5128 UC, 875 CD and 396 ITB) who had undergone colonoscopy examinations in the Peking Union Medical College Hospital from January 2008 to November 2018 were enrolled. The input was the description of the endoscopic image in the form of free text. Word segmentation and key word filtering were conducted as data preprocessing. Random forest (RF) and convolutional neural network (CNN) approaches were applied to different disease entities. Three two-class classifiers (UC and CD, UC and ITB, and CD and ITB) and a three-class classifier (UC, CD and ITB) were built. Results The classifiers built in this research performed well, and the CNN had better performance in general. The RF sensitivities/specificities of UC-CD, UC-ITB, and CD-ITB were 0.89/0.84, 0.83/0.82, and 0.72/0.77, respectively, while the values for the CNN of CD-ITB were 0.90/0.77. The precisions/recalls of UC-CD-ITB when employing RF were 0.97/0.97, 0.65/0.53, and 0.68/0.76, respectively, and when employing the CNN were 0.99/0.97, 0.87/0.83, and 0.52/0.81, respectively. Conclusions Classifiers built by RF and CNN approaches had excellent performance when classifying UC with CD or ITB. For the differentiation of CD and ITB, high specificity and sensitivity were achieved as well. Artificial intelligence through machine learning is very promising in helping unexperienced endoscopists differentiate inflammatory intestinal diseases. Conference The abstract of this article has won the first prize of the Young Investigator Award during the Asian Pacific Digestive Week (APDW) 2019 held in Kolkata, India.

Journal Article

Share this book

Add to My Shelf

Building a trustworthy AI differential diagnosis application for Crohn’s disease and intestinal tuberculosis

by Lin, Yucong , Lu, Keming , Tong, Yuanren in Accuracy , Analysis , Artificial intelligence

2023

Background Differentiating between Crohn’s disease (CD) and intestinal tuberculosis (ITB) with endoscopy is challenging. We aim to perform more accurate endoscopic diagnosis between CD and ITB by building a trustworthy AI differential diagnosis application. Methods A total of 1271 electronic health record (EHR) patients who had undergone colonoscopies at Peking Union Medical College Hospital (PUMCH) and were clinically diagnosed with CD ( n = 875) or ITB ( n = 396) were used in this study. We build a workflow to make diagnoses with EHRs and mine differential diagnosis features; this involves finetuning the pretrained language models, distilling them into a light and efficient TextCNN model, interpreting the neural network and selecting differential attribution features, and then adopting manual feature checking and carrying out debias training. Results The accuracy of debiased TextCNN on differential diagnosis between CD and ITB is 0.83 (CR F1: 0.87, ITB F1: 0.77), which is the best among the baselines. On the noisy validation set, its accuracy was 0.70 (CR F1: 0.87, ITB: 0.69), which was significantly higher than that of models without debias. We also find that the debiased model more easily mines the diagnostically significant features. The debiased TextCNN unearthed 39 diagnostic features in the form of phrases, 17 of which were key diagnostic features recognized by the guidelines. Conclusion We build a trustworthy AI differential diagnosis application for differentiating between CD and ITB focusing on accuracy, interpretability and robustness. The classifiers perform well, and the features which had statistical significance were in agreement with clinical guidelines.

Journal Article

Share this book

Add to My Shelf

The Role of In-Group and Out-Group Facial Feedback in Implicit Rule Learning

by Peng, Wenjie , Lu, Keming , Ouyang, Muxin in Datasets , Experiments , facial expression

2023

Implicit learning refers to the fact that people acquire new knowledge (structures or rules) without conscious awareness. Previous studies have shown that implicit learning is affected by feedback. However, few studies have investigated the role of social feedback in implicit learning concretely. Here, we conducted two experiments to explore how in-group and out-group facial feedback impact different difficulty levels of implicit rule learning. In Experiment 1, the Chinese participants in each group could only see one type of facial feedback, i.e., either in-group (East Asian) or out-group (Western) faces, and learned the implicit rule through happy and sad facial expressions. The only difference between Experiment 2 and Experiment 1 was that the participants saw both the in-group and out-group faces before group assignment to strengthen the contrast between the two group identities. The results showed that only in Experiment 2 but not Experiment 1 was there a significant interaction effect in the accuracy of tasks between the difficulty levels and groups. For the lowest difficulty level, the learning accuracy of the in-group facial feedback group was significantly higher than that of the out-group facial feedback group, whereas this did not happen at the two highest levels of difficulty. In conclusion, when the contrast of group identities was highlighted, out-group feedback reduced the accuracy of the least difficult task; on the contrary, there was no accuracy difference between out-group and in-group feedback conditions. These findings have extensively important implications for our understanding of implicit learning and improving teaching achievement in the context of educational internationalization.

Journal Article

Share this book

Add to My Shelf

By Carrot or by Stick: The Influence of Encouraging and Discouraging Facial Feedback on Implicit Rule Learning

by Peng, Wenjie , Lu, Keming , Ouyang, Muxin in Attentional bias , discouraging , Employees

2024

Implicit learning refers to the process of unconsciously learning complex knowledge through feedback. Previous studies investigated the influences of different types of feedback (e.g., social and non-social feedback) on implicit learning. This study focused on the social information presented in the learning situation and tried to explore the effects of different social feedback on implicit rule learning. We assigned participants randomly into an encouraging facial feedback group (happy expression for correct answer, neutral but not negative expression for incorrect answer) and a discouraging facial feedback group (neutral but not happy expression for correct answer, negative expression for incorrect answer). The implicit learning task included four difficulty levels, and social feedback was presented in the learning phase but not the testing phase in two experiments. The only difference between the two experiments was that the sad face used as negative feedback in Experiment 1 was replaced with an angry face in Experiment 2 to enhance the ecological validity of the discouraging facial feedback group. These two experiments yielded consistent results: the performances in the encouraging facial feedback group were more accurate in both the learning and the testing phases at all difficulty levels. These findings indicated that the influence of encouraging social feedback for a better implicit learning achievement was stable and established a new groundwork for future research on incentive-based education, making it critical to investigate the impact of various forms of encouraging-based education on learning.

Journal Article

Share this book

Add to My Shelf

Detection of heterozygous mutation in hook microtubule-tethering protein 1 in three patients with decapitated and decaudated spermatozoa syndrome

by Lu, Keming , Li, Zheng , Xia, Weiliang in Deoxyribonucleic acid , Ethics , Fertilization

2018

BackgroundThe mechanism of intramanchette transport is crucial to the transformation of sperm tail and the nuclear condensation during spermiogenesis. Although few dysfunctional proteins could result in abnormal junction between the head and tail of spermatozoon, little is known about the genetic cues in this process.ObjectiveBased on patients with severe decapitated and decaudated spermatozoa (DDS) syndrome, the study aimed to validate whether new mutation exists on their Hook microtubule-tethering protein 1 (HOOK1) genes and follow their results of assisted reproduction treatment (ART).Methods7 severe teratozoospermia patients with DDS (proportion >95%) and three relative members in one pedigree were collected to sequence the whole genomic DNA. The fertilisation rates (FRs) of these patients were followed. Morphological observation and interspecies intracytoplasmic sperm injection (ICSI) assays were applied.ResultsA novel missense mutation of A to G (p.Q286R) in patients with DDS (n=3/7) was found in the HOOK1 gene, which was inherited from the mother in one patient. This variant was absent in 160 fertile population-matched control individuals. Morphological observation showed that almost all the DDS broke into decaudated heads and headless tails at the implantation fossa or the basal plate. The clinical studies indicated that the mutation might cause reduced FRs on both ART (FR=18.07%) and interspecies ICSI (FR=16.98%).ConclusionsAn unreported mutation in HOOK1 gene was identified, which might be responsible to some patients with DDS. Further studies need to uncover the molecular mechanism of spermiogenesis for genomic therapy.

Journal Article

Share this book

Add to My Shelf

Exploring Partial Knowledge Base Inference in Biomedical Entity Linking

by Lu, Keming , Yuan, Hongyi , Zheng, Yuan in Inference , Knowledge bases (artificial intelligence) , Training

2023

Biomedical entity linking (EL) consists of named entity recognition (NER) and named entity disambiguation (NED). EL models are trained on corpora labeled by a predefined KB. However, it is a common scenario that only entities within a subset of the KB are precious to stakeholders. We name this scenario partial knowledge base inference: training an EL model with one KB and inferring on the part of it without further training. In this work, we give a detailed definition and evaluation procedures for this practically valuable but significantly understudied scenario and evaluate methods from three representative EL paradigms. We construct partial KB inference benchmarks and witness a catastrophic degradation in EL performance due to dramatically precision drop. Our findings reveal these EL paradigms can not correctly handle unlinkable mentions (NIL), so they are not robust to partial KB inference. We also propose two simple-and-effective redemption methods to combat the NIL issue with little computational overhead. Codes are released at https://github.com/Yuanhy1997/PartialKB-EL.

Paper

Share this book

Add to My Shelf

Speculative Contrastive Decoding

by Lu, Keming , Huang, Fei , Yuan, Hongyi in Large language models

2024

Large language models~(LLMs) exhibit exceptional performance in language tasks, yet their auto-regressive inference is limited due to high computational requirements and is sub-optimal due to the exposure bias. Inspired by speculative decoding and contrastive decoding, we introduce Speculative Contrastive Decoding~(SCD), a straightforward yet powerful decoding approach that leverages predictions from smaller language models~(LMs) to achieve both decoding acceleration and quality improvement. Extensive evaluations and analyses on four diverse language tasks demonstrate the effectiveness of SCD, showing that decoding efficiency and quality can compatibly benefit from one smaller LM.

Paper

Share this book

Add to My Shelf

Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment

by Lu, Keming , Bowen, Yu , Zhou, Chang in Datasets , Large language models , Role playing

2024

Considerable efforts have been invested in augmenting the role-playing proficiency of open-source large language models (LLMs) by emulating proprietary counterparts. Nevertheless, we posit that LLMs inherently harbor role-play capabilities, owing to the extensive knowledge of characters and potential dialogues ingrained in their vast training corpora. Thus, in this study, we introduce Ditto, a self-alignment method for role-play. Ditto capitalizes on character knowledge, encouraging an instruction-following LLM to simulate role-play dialogues as a variant of reading comprehension. This method creates a role-play training set comprising 4,000 characters, surpassing the scale of currently available datasets by tenfold regarding the number of roles. Subsequently, we fine-tune the LLM using this self-generated dataset to augment its role-playing capabilities. Upon evaluating our meticulously constructed and reproducible role-play benchmark and the roleplay subset of MT-Bench, Ditto, in various parameter scales, consistently maintains a consistent role identity and provides accurate role-specific knowledge in multi-turn role-play conversations. Notably, it outperforms all open-source role-play baselines, showcasing performance levels comparable to advanced proprietary chatbots. Furthermore, we present the first comprehensive cross-supervision alignment experiment in the role-play domain, revealing that the intrinsic capabilities of LLMs confine the knowledge within role-play. Meanwhile, the role-play styles can be easily acquired with the guidance of smaller models. We open-source related resources at https://github.com/OFA-Sys/Ditto.

Paper

Share this book

Add to My Shelf

MARGE: Improving Math Reasoning for LLMs with Guided Exploration

by Lu, Keming , Lin, Junyang , Chen, Jianyu in Annotations , Effectiveness , Large language models

2025

Large Language Models (LLMs) exhibit strong potential in mathematical reasoning, yet their effectiveness is often limited by a shortage of high-quality queries. This limitation necessitates scaling up computational responses through self-generated data, yet current methods struggle due to spurious correlated data caused by ineffective exploration across all reasoning stages. To address such challenge, we introduce \\textbf{MARGE}: Improving \\textbf{Ma}th \\textbf{R}easoning with \\textbf{G}uided \\textbf{E}xploration, a novel method to address this issue and enhance mathematical reasoning through hit-guided exploration. MARGE systematically explores intermediate reasoning states derived from self-generated solutions, enabling adequate exploration and improved credit assignment throughout the reasoning process. Through extensive experiments across multiple backbone models and benchmarks, we demonstrate that MARGE significantly improves reasoning capabilities without requiring external annotations or training additional value models. Notably, MARGE improves both single-shot accuracy and exploration diversity, mitigating a common trade-off in alignment methods. These results demonstrate MARGE's effectiveness in enhancing mathematical reasoning capabilities and unlocking the potential of scaling self-generated training data. Our code and models are available at \\href{https://github.com/georgao35/MARGE}{this link}.

Paper

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter