Catalogue Search | MBRL

Retraction Note: Utilizing hybrid computing models for network monitoring and security analysis through optical network modeling and data analytics

by Rastogi, Ajay , Reddy, Sudhakar , Gupta, Mukur in Characterization and Evaluation of Materials , Computer Communication Networks , Electrical Engineering

2024

Journal Article

Share this book

Add to My Shelf

RETRACTED ARTICLE: Utilizing hybrid computing models for network monitoring and security analysis through optical network modeling and data analytics

by Rastogi, Ajay , Reddy, Sudhakar , Gupta, Mukur in Characterization and Evaluation of Materials , Computer Communication Networks , Electrical Engineering

2024

Safe and reliable data transport over optical networks is essential for a high-speed Internet. Optical fibres are the backbone of the Internet, allowing billions of people around the world to connect and exchange data. However, these cables can be compromised by both deliberate attacks and accidental damage. The disruption caused by these abnormalities may result in significant monetary and data losses, undermine the security of optical networks by enabling unauthorised parties to view the sent data, or progressively lower network performance. Based on an examination of data from optical networks, this research proposes an original strategy for hybrid machine learning-based network monitoring and security improvement. Here, an anomaly detection method called convolutional principal component network is utilised to keep an eye on the optical infrastructure. Then, we apply hybrid cloud vector Bayesian graph networks to fortify the system. Anomaly detection rate, accuracy, precision rate, and quality of service are all experimentally evaluated across a variety of security analysis datasets. We validate our research using data from operational networks and simulated data created in a realistic-looking simulator. The findings verify the efficacy of our approach for optimising routing decisions in real time and designing networks for the future.

Journal Article

Share this book

Add to My Shelf

Curriculum generation using Autoencoder based continuous optimization

by Sarkar, Dipankar , Gupta, Mukur in Ablation , Algorithms , Coders

2022

Research in Curriculum Learning has shown better performance on the task by optimizing the sequence of the training data. Recent works have focused on using complex reinforcement learning techniques to find the optimal data ordering strategy to maximize learning for a given network. In this paper, we present a simple yet efficient technique based on continuous optimization trained with auto-encoding procedure. We call this new approach Training Sequence Optimization (TSO). With a usual encoder-decoder setup we try to learn the latent space continuous representation of the training strategy and a predictor network is used on the continuous representation to predict the accuracy of the strategy on the fixed network architecture. The performance predictor and encoder enable us to perform gradient-based optimization by gradually moving towards the latent space representation of training data ordering with potentially better accuracy. We show an empirical gain of 2AP with our generated optimal curriculum strategy over the random strategy using the CIFAR-100 and CIFAR-10 datasets and have better boosts than the existing state-of-the-art CL algorithms.

Paper

Share this book

Add to My Shelf

CodeSCM: Causal Analysis for Multi-Modal Code Generation

by Suman, Jana , Bhatt, Noopur , Gupta, Mukur in Large language models , Natural language processing , Semantics

2025

In this paper, we propose CodeSCM, a Structural Causal Model (SCM) for analyzing multi-modal code generation using large language models (LLMs). By applying interventions to CodeSCM, we measure the causal effects of different prompt modalities, such as natural language, code, and input-output examples, on the model. CodeSCM introduces latent mediator variables to separate the code and natural language semantics of a multi-modal code generation prompt. Using the principles of Causal Mediation Analysis on these mediators we quantify direct effects representing the model's spurious leanings. We find that, in addition to natural language instructions, input-output examples significantly influence code generation.

Paper

Share this book

Add to My Shelf

Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Reasoning

by Suman, Jana , Srivastava, Prashast , Hajizadeh, Samira in Context , Evaluation , Large language models

2026

Large language models (LLMs) are increasingly deployed for understanding large codebases, but whether they understand operational semantics of long code context or rely on pattern matching shortcuts remains unclear. We distinguish between lexical recall (retrieving code verbatim) and semantic recall (understanding operational semantics). Evaluating 10 state-of-the-art LLMs, we find that while frontier models achieve near-perfect, position-independent lexical recall, semantic recall degrades severely when code is centrally positioned in long contexts. We introduce semantic recall sensitivity to measure whether tasks require understanding of code's operational semantics vs. permit pattern matching shortcuts. Through a novel counterfactual measurement method, we show that models rely heavily on pattern matching shortcuts to solve existing code understanding benchmarks. We propose a new task SemTrace, which achieves high semantic recall sensitivity through unpredictable operations; LLMs' accuracy exhibits severe positional effects, with median accuracy drops of 92.73% versus CRUXEval's 53.36% as the relevant code snippet approaches the middle of the input code context. Our findings suggest current evaluations substantially underestimate semantic recall failures in long context code understanding.

Paper

Share this book

Add to My Shelf

TrajSyn: Privacy-Preserving Dataset Distillation from Federated Model Trajectories for Server-Side Adversarial Training

by Rahman, Saifur , Karmakar, Chandan , Gupta, Niharika in Datasets , Deep learning , Federated learning

2025

Deep learning models deployed on edge devices are increasingly used in safety-critical applications. However, their vulnerability to adversarial perturbations poses significant risks, especially in Federated Learning (FL) settings where identical models are distributed across thousands of clients. While adversarial training is a strong defense, it is difficult to apply in FL due to strict client-data privacy constraints and the limited compute available on edge devices. In this work, we introduce TrajSyn, a privacy-preserving framework that enables effective server-side adversarial training by synthesizing a proxy dataset from the trajectories of client model updates, without accessing raw client data. We show that TrajSyn consistently improves adversarial robustness on image classification benchmarks with no extra compute burden on the client device.

Paper

Share this book

Add to My Shelf

Intent Detection and Entity Extraction from BioMedical Literature

by Mullick, Ankan , Goyal, Pawan , Gupta, Mukur in Large language models , Natural language processing

2024

Biomedical queries have become increasingly prevalent in web searches, reflecting the growing interest in accessing biomedical literature. Despite recent research on large-language models (LLMs) motivated by endeavours to attain generalized intelligence, their efficacy in replacing task and domain-specific natural language understanding approaches remains questionable. In this paper, we address this question by conducting a comprehensive empirical evaluation of intent detection and named entity recognition (NER) tasks from biomedical text. We show that Supervised Fine Tuned approaches are still relevant and more effective than general-purpose LLMs. Biomedical transformer models such as PubMedBERT can surpass ChatGPT on NER task with only 5 supervised examples.

Paper

Share this book

Add to My Shelf

Generating from Discrete Distributions Using Diffusions: Insights from Random Constraint Satisfaction Problems

by Bhatt, Alankrita , Germain Kolossov , Montanari, Andrea in Benchmarks , Constraints

2026

Generating data from discrete distributions is important for a number of application domains including text, tabular data, and genomic data. Several groups have recently used random \\(k\\)-satisfiability (\\(k\\)-SAT) as a synthetic benchmark for new generative techniques. In this paper, we show that fundamental insights from the theory of random constraint satisfaction problems have observable implications (sometime contradicting intuition) on the behavior of generative techniques on such benchmarks. More precisely, we study the problem of generating a uniformly random solution of a given (random) \\(k\\)-SAT or \\(k\\)-XORSAT formula. Among other findings, we observe that: \\((i)\\)~Continuous diffusions outperform masked discrete diffusions; \\((ii)\\)~Learned diffusions can match the theoretical `ideal' accuracy; \\((iii)\\)~Smart ordering of the variables can significantly improve accuracy, although not following popular heuristics.

Paper

Share this book

Add to My Shelf

Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Reasoning

by Suman, Jana , Srivastava, Prashast , Hajizadeh, Samira in Context , Evaluation , Large language models

2026

Large language models (LLMs) are increasingly deployed for understanding large codebases, but whether they understand operational semantics of long code context or rely on pattern matching shortcuts remains unclear. We distinguish between lexical recall (retrieving code verbatim) and semantic recall (understanding operational semantics). Evaluating 10 state-of-the-art LLMs, we find that while frontier models achieve near-perfect, position-independent lexical recall, semantic recall degrades severely when code is centrally positioned in long contexts. We introduce semantic recall sensitivity to measure whether tasks require understanding of code's operational semantics vs. permit pattern matching shortcuts. Through a novel counterfactual measurement method, we show that models rely heavily on pattern matching shortcuts to solve existing code understanding benchmarks. We propose a new task SemTrace, which achieves high semantic recall sensitivity through unpredictable operations; LLMs' accuracy exhibits severe positional effects, with median accuracy drops of 92.73% versus CRUXEval's 53.36% as the relevant code snippet approaches the middle of the input code context. Our findings suggest current evaluations substantially underestimate semantic recall failures in long context code understanding.

Paper

Share this book

Add to My Shelf

XOXO: Stealthy Cross-Origin Context Poisoning Attacks against AI Coding Assistants

by Suman, Jana , Kim, Janie , Srivastava, Prashast in Coding , Poisoning , Poisons

2026

AI coding assistants are widely used for tasks like code generation. These tools now require large and complex contexts, automatically sourced from various origins\\(x2014\\)across files, projects, and contributors\\(x2014\\)forming part of the prompt fed to underlying LLMs. This automatic context-gathering introduces new vulnerabilities, allowing attackers to subtly poison input to compromise the assistant's outputs, potentially generating vulnerable code or introducing critical errors. We propose a novel attack, Cross-Origin Context Poisoning (XOXO), that is challenging to detect as it relies on adversarial code modifications that are semantically equivalent. Traditional program analysis techniques struggle to identify these perturbations since the semantics of the code remains correct, making it appear legitimate. This allows attackers to manipulate coding assistants into producing incorrect outputs, while shifting the blame to the victim developer. We introduce a novel, task-agnostic, black-box attack algorithm GCGS that systematically searches the transformation space using a Cayley Graph, achieving a 75.72% attack success rate on average across five tasks and eleven models, including GPT 4.1 and Claude 3.5 Sonnet v2 used by popular AI coding assistants. Furthermore, defenses like adversarial fine-tuning are ineffective against our attack, underscoring the need for new security measures in LLM-powered coding tools.

Paper

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter