Catalogue Search | MBRL

Multiple social platforms reveal actionable signals for software vulnerability awareness: A study of GitHub, Twitter and Reddit

by Volkova, Svitlana , Sathanur, Arun , Shrestha, Prasha in Community structure , Computer and Information Sciences , Computer programs

2020

The awareness about software vulnerabilities is crucial to ensure effective cybersecurity practices, the development of high-quality software, and, ultimately, national security. This awareness can be better understood by studying the spread, structure and evolution of software vulnerability discussions across online communities. This work is the first to evaluate and contrast how discussions about software vulnerabilities spread on three social platforms-Twitter, GitHub, and Reddit. Moreover, we measure how user-level e.g., bot or not, and content-level characteristics e.g., vulnerability severity, post subjectivity, targeted operating systems as well as social network topology influence the rate of vulnerability discussion spread. To lay the groundwork, we present a novel fundamental framework for measuring information spread in multiple social platforms that identifies spread mechanisms and observables, units of information, and groups of measurements. We then contrast topologies for three social networks and analyze the effect of the network structure on the way discussions about vulnerabilities spread. We measure the scale and speed of the discussion spread to understand how far and how wide they go, how many users participate, and the duration of their spread. To demonstrate the awareness of more impactful vulnerabilities, a subset of our analysis focuses on vulnerabilities targeted during recent major cyber-attacks and those exploited by advanced persistent threat groups. One of our major findings is that most discussions start on GitHub not only before Twitter and Reddit, but even before a vulnerability is officially published. The severity of a vulnerability contributes to how much it spreads, especially on Twitter. Highly severe vulnerabilities have significantly deeper, broader and more viral discussion threads. When analyzing vulnerabilities in software products we found that different flavors of Linux received the highest discussion volume. We also observe that Twitter discussions started by humans have larger size, breadth, depth, adoption rate, lifetime, and structural virality compared to those started by bots. On Reddit, discussion threads of positive posts are larger, wider, and deeper than negative or neutral posts. We also found that all three networks have high modularity that encourages spread. However, the spread on GitHub is different from other networks, because GitHub is more dense, has stronger community structure and assortativity that enhances information diffusion. We anticipate the results of our analysis to not only increase the understanding of software vulnerability awareness but also inform the existing and new analytical frameworks for simulating information spread e.g., disinformation across multiple social environments online.

Journal Article

Share this book

Add to My Shelf

Evaluating uncertainty-based active learning for accelerating the generalization of molecular property prediction

by Panapitiya, Gihan , Coda, Elizabeth D. , Saldanha, Emily G. in Active learning , Chemistry , Chemistry and Materials Science

2023

Deep learning models have proven to be a powerful tool for the prediction of molecular properties for applications including drug design and the development of energy storage materials. However, in order to learn accurate and robust structure–property mappings, these models require large amounts of data which can be a challenge to collect given the time and resource-intensive nature of experimental material characterization efforts. Additionally, such models fail to generalize to new types of molecular structures that were not included in the model training data. The acceleration of material development through uncertainty-guided experimental design has the promise to significantly reduce the data requirements and enable faster generalization to new types of materials. To evaluate the potential of such approaches for electrolyte design applications, we perform comprehensive evaluation of existing uncertainty quantification methods on the prediction of two relevant molecular properties - aqueous solubility and redox potential. We develop novel evaluation methods to probe the utility of the uncertainty estimates for both in-domain and out-of-domain data sets. Finally, we leverage selected uncertainty estimation methods for active learning to evaluate their capacity to support experimental design.

Journal Article

Share this book

Add to My Shelf

The Effects of Maternal and Paternal Helicopter Parenting on the Self-determination and Well-being of Emerging Adults

by Schiffrin, Holly H , Power, Victoria , Yost, Jennaveve C in Adults , Anxiety , Autonomy

2019

ObjectivesWe examined gender differences in helicopter parenting and emerging adults’ well-being through the basic psychological needs of autonomy, competence, and relatedness. Based on gender congruence theory, we hypothesized that daughters’ well-being would be more adversely impacted by their mothers’ helicopter parenting compared to fathers’, while the opposite pattern would emerge for sons.MethodParticipants were 446 college students between 18–25 years old who completed an online survey. The majority of participants were white, female, underclassman from middle to upper-middle class families.ResultsParticipants reported that their mothers engaged in more helicopter parenting than their fathers. Male and female participants did not differ in the amount of helicopter parenting they experienced, so we tested a model combining these sub-samples. Two minor differences were identified: Daughters reported maternal helicopter parenting was more strongly associated with decreased autonomy and sons reported paternal helicopter parenting was more strongly associated with a decreased relatedness. Thus, a partial equivalence model was tested with only these two paths free to vary between groups. Maternal helicopter parenting was indirectly associated with their children’s reduced well-being on all three measures (i.e., anxiety, depression, and satisfaction with life) through a reduced sense of autonomy and competence. Paternal helicopter parenting was only indirectly associated with their offspring’s well-being through autonomy.ConclusionsResults supported prior research suggesting helicopter parenting adversely affects emerging adults’ well-being through its negative impact on the basic psychological needs of self-determination. There was limited support for gender differences in the impact of helicopter parenting on emerging adults.

Journal Article

Share this book

Add to My Shelf

Studying information recurrence, gatekeeping, and the role of communities during internet outages in Venezuela

by Saldanha, Emily , Thomas, Pamela Bilo , Volkova, Svitlana in 639/705 , 639/705/1042 , 639/705/117

2021

Many authoritarian regimes have taken to censoring internet access in order to stop the spread of misinformation, restrict citizens from discussing certain topics, and prevent mobilization, among other reasons. There are several theories about the effectiveness of censorship. Some suggest that censorship will effectively limit the flow of information, whereas others predict that a backlash will form, resulting in ultimately more discussion about the topic. In this work, we analyze the role of communities and gatekeepers during multiple internet outages in Venezuela in January 2019. First, we measure how critical information (e.g., entities and hashtags) spreads during outages focusing on information recurrence and burstiness within and across language and location communities. We discover that information bursts tend to cross both language and location community boundaries rather than being limited to a single community during several outages. Then we identify users who play central roles and propose a novel method to detect gatekeepers—users who prevent critical information from spreading across communities during outages. We show that bilingual and English-speaking users play more central roles compared to Spanish-speaking users, but users inside and outside Venezuela have similar distribution of centrality. Finally, we measure the differences in social network structure before and after each outage event and discuss its effect on how information spreads. We find that with each outage event social connections tend to get less connected with higher mean shortest path, indicating that the effect of censorship makes it harder for information to spread.

Journal Article

Share this book

Add to My Shelf

Examining the Relationship between Helicopter Parenting and Emerging Adults’ Mindsets Using the Consolidated Helicopter Parenting Scale

by Schiffrin, Holly H , Power, Victoria , Yost, Jennaveve C in Academic achievement , Adults , Aviation

2019

ObjectivesThe purpose of this study was to develop a consolidated helicopter parenting scale (CHPS) from five existing measures of helicopter parenting and utilize the new measure to examine the relationship between helicopter parenting and intelligence mindset.MethodsParticipants were 275 emerging adults between 18–25 years of age who completed an online survey. First, we conducted an Exploratory Factor Analysis of five helicopter parenting measures to develop a scale that reliably measured participants’ reports of helicopter parenting by both their mothers and fathers. Then, we utilized the new measure to examine whether helicopter parenting mediated the relationship between emerging adults’ report of their parents’ failure mindsets and their own intelligence mindsets.ResultsThe 10 items retained in the factor analysis primarily captured emerging adults’ perception that their parents’ involvement was inappropriate rather than delineating objective behaviors in which their parents engaged. Both mothers and fathers were more likely to engage in helicopter parenting when emerging adults reported their parents had a failure mindset. However, only fathers’ helicopter parenting mediated the relationship between parents’ failure mindsets and their children’s intelligence mindsets.ConclusionsWhen parents were viewed as having a failure-is-debilitating mindset, emerging adults also reported that fathers were more likely to participate in helicopter parenting behaviors, which was associated with fixed mindsets in emerging adults. People with fixed mindsets have been found to have decreased motivation (e.g., avoiding challenges and less perseverance) and academic achievement in prior research.

Journal Article

Share this book

Add to My Shelf

SOMAS: a platform for data-driven material discovery in redox flow battery development

by Murugesan, Vijayakumar , Andersen, Amity , Saldanha, Emily G. in 639/301/299/891 , 639/4077/4079/891 , Accuracy

2022

Aqueous organic redox flow batteries offer an environmentally benign, tunable, and safe route to large-scale energy storage. The energy density is one of the key performance parameters of organic redox flow batteries, which critically depends on the solubility of the redox-active molecule in water. Prediction of aqueous solubility remains a challenge in chemistry. Recently, machine learning models have been developed for molecular properties prediction in chemistry and material science. The fidelity of a machine learning model critically depends on the diversity, accuracy, and abundancy of the training datasets. We build a comprehensive open access organic molecular database “Solubility of Organic Molecules in Aqueous Solution” (SOMAS) containing about 12,000 molecules that covers wider chemical and solubility regimes suitable for aqueous organic redox flow battery development efforts. In addition to experimental solubility, we also provide eight distinctive quantum descriptors including optimized geometry derived from high-throughput density functional theory calculations along with six molecular descriptors for each molecule. SOMAS builds a critical foundation for future efforts in artificial intelligence-based solubility prediction models. Measurement(s) quantum descriptors • molecular descriptors • physical descriptors Technology Type(s) Computation • experiment

Journal Article

Share this book

Add to My Shelf

Correction to: Examining the Relationship between Helicopter Parenting and Emerging Adults’ Mindsets Using the Consolidated Helicopter Parenting Scale

by Power, Victoria , Yost, Jennaveve C. , Sendrick, Erynn in Author Correction , Behavioral Science and Psychology , Child and School Psychology

2021

Journal Article

Share this book

Add to My Shelf

Explaining and predicting human behavior and social dynamics in simulated virtual worlds: reproducibility, generalizability, and robustness of causal discovery methods

by Volkova, Svitlana , Saldanha, Emily , Aksoy, Sinan in Artificial intelligence , Behavior , Causal models

2023

Ground Truth program was designed to evaluate social science modeling approaches using simulation test beds with ground truth intentionally and systematically embedded to understand and model complex Human Domain systems and their dynamics Lazer et al. (Science 369:1060–1062, 2020). Our multidisciplinary team of data scientists, statisticians, experts in Artificial Intelligence (AI) and visual analytics had a unique role on the program to investigate accuracy, reproducibility, generalizability, and robustness of the state-of-the-art (SOTA) causal structure learning approaches applied to fully observed and sampled simulated data across virtual worlds. In addition, we analyzed the feasibility of using machine learning models to predict future social behavior with and without causal knowledge explicitly embedded. In this paper, we first present our causal modeling approach to discover the causal structure of four virtual worlds produced by the simulation teams—Urban Life, Financial Governance, Disaster and Geopolitical Conflict. Our approach adapts the state-of-the-art causal discovery (including ensemble models), machine learning, data analytics, and visualization techniques to allow a human-machine team to reverse-engineer the true causal relations from sampled and fully observed data. We next present our reproducibility analysis of two research methods team’s performance using a range of causal discovery models applied to both sampled and fully observed data, and analyze their effectiveness and limitations. We further investigate the generalizability and robustness to sampling of the SOTA causal discovery approaches on additional simulated datasets with known ground truth. Our results reveal the limitations of existing causal modeling approaches when applied to large-scale, noisy, high-dimensional data with unobserved variables and unknown relationships between them. We show that the SOTA causal models explored in our experiments are not designed to take advantage from vasts amounts of data and have difficulty recovering ground truth when latent confounders are present; they do not generalize well across simulation scenarios and are not robust to sampling; they are vulnerable to data and modeling assumptions, and therefore, the results are hard to reproduce. Finally, when we outline lessons learned and provide recommendations to improve models for causal discovery and prediction of human social behavior from observational data, we highlight the importance of learning data to knowledge representations or transformations to improve causal discovery and describe the benefit of causal feature selection for predictive and prescriptive modeling.

Journal Article

Share this book

Add to My Shelf

Outlier-Based Domain of Applicability Identification for Materials Property Prediction Models

by Saldanha, Emily , Panapitiya, Gihan in Accuracy , Confidence intervals , Data analysis

2023

Machine learning models have been widely applied for material property prediction. However, practical application of these models can be hindered by a lack of information about how well they will perform on previously unseen types of materials. Because machine learning model predictions depend on the quality of the available training data, different domains of the material feature space are predicted with different accuracy levels by such models. The ability to identify such domains enables the ability to find the confidence level of each prediction, to determine when and how the model should be employed depending on the prediction accuracy requirements of different tasks, and to improve the model for domains with high errors. In this work, we propose a method to find domains of applicability using a large feature space and also introduce analysis techniques to gain more insight into the detected domains and subdomains.

Paper

Share this book

Add to My Shelf

AutoLabs: Cognitive Multi-Agent Systems with Self-Correction for Autonomous Chemical Experimentation

by Saldanha, Emily , Job, Heather , Panapitiya, Gihan in Ablation , Architecture , Error reduction

2025

The automation of chemical research through self-driving laboratories (SDLs) promises to accelerate scientific discovery, yet the reliability and granular performance of the underlying AI agents remain critical, under-examined challenges. In this work, we introduce AutoLabs, a self-correcting, multi-agent architecture designed to autonomously translate natural-language instructions into executable protocols for a high-throughput liquid handler. The system engages users in dialogue, decomposes experimental goals into discrete tasks for specialized agents, performs tool-assisted stoichiometric calculations, and iteratively self-corrects its output before generating a hardware-ready file. We present a comprehensive evaluation framework featuring five benchmark experiments of increasing complexity, from simple sample preparation to multi-plate timed syntheses. Through a systematic ablation study of 20 agent configurations, we assess the impact of reasoning capacity, architectural design (single- vs. multi-agent), tool use, and self-correction mechanisms. Our results demonstrate that agent reasoning capacity is the most critical factor for success, reducing quantitative errors in chemical amounts (nRMSE) by over 85% in complex tasks. When combined with a multi-agent architecture and iterative self-correction, AutoLabs achieves near-expert procedural accuracy (F1-score > 0.89) on challenging multi-step syntheses. These findings establish a clear blueprint for developing robust and trustworthy AI partners for autonomous laboratories, highlighting the synergistic effects of modular design, advanced reasoning, and self-correction to ensure both performance and reliability in high-stakes scientific applications. Code: https://github.com/pnnl/autolabs

Paper

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter