Search Results Heading

MBRLSearchResults

mbrl.module.common.modules.added.book.to.shelf
Title added to your shelf!
View what I already have on My Shelf.
Oops! Something went wrong.
Oops! Something went wrong.
While trying to add the title to your shelf something went wrong :( Kindly try again later!
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
    Done
    Filters
    Reset
  • Discipline
      Discipline
      Clear All
      Discipline
  • Is Peer Reviewed
      Is Peer Reviewed
      Clear All
      Is Peer Reviewed
  • Item Type
      Item Type
      Clear All
      Item Type
  • Subject
      Subject
      Clear All
      Subject
  • Year
      Year
      Clear All
      From:
      -
      To:
  • More Filters
8 result(s) for "Chenwei Lou"
Sort by:
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via Reinforcement Learning
Large Language Models (LLMs) have demonstrated remarkable capabilities but often face challenges with tasks requiring sophisticated reasoning. While Chain-of-Thought (CoT) prompting significantly enhances reasoning, it indiscriminately generates lengthy reasoning steps for all queries, leading to substantial computational costs and inefficiency, especially for simpler inputs. To address this critical issue, we introduce AdaCoT (Adaptive Chain-of-Thought), a novel framework enabling LLMs to adaptively decide when to invoke CoT. AdaCoT framed adaptive reasoning as a Pareto optimization problem that seeks to balance model performance with the costs associated with CoT invocation (both frequency and computational overhead). We propose a reinforcement learning (RL) based method, specifically utilizing Proximal Policy Optimization (PPO), to dynamically control the CoT triggering decision boundary by adjusting penalty coefficients, thereby allowing the model to determine CoT necessity based on implicit query complexity. A key technical contribution is Selective Loss Masking (SLM), designed to counteract decision boundary collapse during multi-stage RL training, ensuring robust and stable adaptive triggering. Experimental results demonstrate that AdaCoT successfully navigates the Pareto frontier, achieving substantial reductions in CoT usage for queries not requiring elaborate reasoning. For instance, on our production traffic testset, AdaCoT reduced CoT triggering rates to as low as 3.18\\% and decreased average response tokens by 69.06%, while maintaining high performance on complex tasks.
StructVRM: Aligning Multimodal Reasoning with Structured and Verifiable Reward Models
Existing Vision-Language Models often struggle with complex, multi-question reasoning tasks where partial correctness is crucial for effective learning. Traditional reward mechanisms, which provide a single binary score for an entire response, are too coarse to guide models through intricate problems with multiple sub-parts. To address this, we introduce StructVRM, a method that aligns multimodal reasoning with Structured and Verifiable Reward Models. At its core is a model-based verifier trained to provide fine-grained, sub-question-level feedback, assessing semantic and mathematical equivalence rather than relying on rigid string matching. This allows for nuanced, partial credit scoring in previously intractable problem formats. Extensive experiments demonstrate the effectiveness of StructVRM. Our trained model, Seed-StructVRM, achieves state-of-the-art performance on six out of twelve public multimodal benchmarks and our newly curated, high-difficulty STEM-Bench. The success of StructVRM validates that training with structured, verifiable rewards is a highly effective approach for advancing the capabilities of multimodal models in complex, real-world reasoning domains.
\Tom\ pet robot applied to urban autism
With the fast development of network information technology, more and more people are immersed in the virtual community environment brought by the network, ignoring the social interaction in real life. The consequent urban autism problem has become more and more serious. Promoting offline communication between people \" and \"eliminating loneliness through emotional communication between pet robots and breeders\" to solve this problem, and has developed a design called \"Tom\". \"Tom\" is a smart pet robot with a pet robot-based social mechanism Called \"Tom-Talker\". The main contribution of this paper is to propose a social mechanism called \"Tom-Talker\" that encourages users to socialize offline. And \"Tom-Talker\" also has a corresponding reward mechanism and a friend recommendation algorithm. It also proposes a pet robot named \"Tom\" with an emotional interaction algorithm to recognize users' emotions, simulate animal emotions and communicate emotionally with use s. This paper designs experiments and analyzes the results. The results show that our pet robots have a good effect on solving urban autism problems.
A retrospective cohort study of gastrografin in the management of adhesive small bowel obstruction during pregnancy
Adhesive small bowel obstruction (ASBO) during pregnancy is a critical condition that remains understudied. This study aimed to address management challenges of ASBO in pregnancy by evaluating the dual diagnostic and therapeutic role of gastrografin. We retrospectively analyzed medical records of pregnant patients with ASBO admitted between September 2018 and September 2023. Patients were categorized into conventional treatment or gastrografin groups based on received therapy. The groups showed no significant differences in demographics, baseline characteristics, or adverse events. However, the gastrografin group demonstrated superior outcomes, including a higher conservative treatment success rate, reduced need for surgery, shorter hospital stays, lower medical costs, and enhanced diagnostic utility. Gastrografin appears relatively safe in pregnancy, with its effective therapeutic action and precise diagnostic capability proving instrumental in managing ASBO.
Prevalence and Prognostic Significance of HPV in Laryngeal Squamous Cell Carcinoma in Northeast China
Background/Aims: Human papillomavirus (HPV) is an etiological risk factor for a subset of head and neck squamous cell carcinomas. HPV has been proven to be a powerful prognostic biomarker for oropharyngeal cancer, but its role in the larynx has not been explored in depth. Here, we sought to evaluate the prevalence and genotype distribution of HPV in patients with laryngeal squamous cell carcinoma (LSCC) in northeast China. Methods: HPV DNA in specimens from 211 patients diagnosed with LSCC was analyzed by the polymerase chain reaction and in situ hybridization, and p16 overexpression was evaluated by immunohistochemistry. p16 expression was scored positive if strong and diffuse nuclear and cytoplasmic staining was present in > 75% of tumor cells. Results: In this study, infection with HPV and p16 expression were not absolutely consistent. Among all patients, 132 (62.6%) were positive for HPV DNA (HPV+), while 23 (10.9%) were inconsistent for HPV and p16. Multivariate analysis indicated that HPV, but not p16, is an independent prognostic factor for overall survival in LSCC. Overall survival was significantly improved in HPV+ LSCC patients compared with the HPV-negative group (hazard ratio, 0.395; 95% confidence interval, 0.185–0.843; p = 0.016). Among the 132 HPV+ patients, 28 (21.2%) were HPV-16 single infection. Conclusion: This study indicates that HPV DNA is a more reliable surrogate marker than p16 for the prediction of survival in patients with LSCC.
The Role of Fever Clinics in the Strategic Triage of Suspected Cases of Imported COVID-19
Novel coronavirus pneumonia (COVID-19) is an acute respiratory infectious disease, which has the characteristic of human-to-human transmission and is extremely contagious. Correctly standardizing the process of early screening of infection or suspected cases in the fever clinic has become a key part of the fight against the pandemic. A retrospective analysis of patients in the fever clinic of Shenyang Medical College Affiliated Central Hospital from January 23 to March 1, 2020, was conducted in the present study. It was found that 16 suspected cases of COVID-19 in the fever clinic were diagnosed with respiratory infections, accounting for 0.59%. In case of a negative result in the second nucleic acid test, strategic triage and typing might be more conducive for the following nucleic acid tests for suspected cases in order to prevent the spread of the epidemic caused by missed diagnosis.
Quantum plasmonic hot-electron injection in lateral WSe2/MoSe2 heterostructures
Lateral two-dimensional (2D) transitional metal dichalcogenide (TMD) heterostructures have recently attracted a wide attention as promising materials for optoelectronic nanodevices. Due to the nanoscale width of lateral heterojunctions, the study of their optical properties is challenging and requires using subwavelength optical characterization techniques. We investigated the photoresponse of a lateral 2D WSe2/MoSe2 heterostructure using tip-enhanced photoluminescence (TEPL) with nanoscale spatial resolution and with picoscale tip-sample distance dependence. We demonstrate the observation of quantum plasmonic effects in 2D heterostructures on a non-metallic substrate, and we report the nano-optical measurements of the lateral 2D TMD heterojunction width of ~ 150 nm and the charge tunneling distance of ~ 20 pm. Controlling the plasmonic tip location allows for both nano-optical imaging and plasmon-induced hot electron injection into the heterostructure. By adjusting the tip-sample distance, we demonstrated the controllability of the hot-electron injection via the competition of two quantum plasmonic photoluminescence (PL) enhancement and quenching mechanisms. The directional charge transport in the depletion region leads to the increased hot electron injection, enhancing the MoSe2 PL signal. The properties of the directional hot-electron injection in the quantum plasmonic regime make the lateral 2D MoSe2/WSe2 heterostructures promising for quantum nanodevices with tunable photoresponse.
Nano-optical imaging of monolayer MoSe2 using tip-enhanced photoluminescence
Band gap tuning in two-dimensional transitional metal dichalcogenides (TMDs) is crucial in fabricating new optoelectronic devices. High resolution photoluminescence (PL) microscopy is needed for accurate band gap characterization. We performed tip-enhanced photoluminescence (TEPL) measurements of monolayer MoSe2 with nanoscale spatial resolution, providing an improved characterization of the band gap correlated with the topography compared with the conventional far field spectroscopy. We also observed PL shifts at the edges and investigated the spatial dependence of the TEPL enhancement factors.