Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Item TypeItem Type
-
SubjectSubject
-
YearFrom:-To:
-
More FiltersMore FiltersSourceLanguage
Done
Filters
Reset
15
result(s) for
"Gupta, Mukur"
Sort by:
Retraction Note: Utilizing hybrid computing models for network monitoring and security analysis through optical network modeling and data analytics
by
Rastogi, Ajay
,
Reddy, Sudhakar
,
Gupta, Mukur
in
Characterization and Evaluation of Materials
,
Computer Communication Networks
,
Electrical Engineering
2024
Journal Article
RETRACTED ARTICLE: Utilizing hybrid computing models for network monitoring and security analysis through optical network modeling and data analytics
by
Rastogi, Ajay
,
Reddy, Sudhakar
,
Gupta, Mukur
in
Characterization and Evaluation of Materials
,
Computer Communication Networks
,
Electrical Engineering
2024
Safe and reliable data transport over optical networks is essential for a high-speed Internet. Optical fibres are the backbone of the Internet, allowing billions of people around the world to connect and exchange data. However, these cables can be compromised by both deliberate attacks and accidental damage. The disruption caused by these abnormalities may result in significant monetary and data losses, undermine the security of optical networks by enabling unauthorised parties to view the sent data, or progressively lower network performance. Based on an examination of data from optical networks, this research proposes an original strategy for hybrid machine learning-based network monitoring and security improvement. Here, an anomaly detection method called convolutional principal component network is utilised to keep an eye on the optical infrastructure. Then, we apply hybrid cloud vector Bayesian graph networks to fortify the system. Anomaly detection rate, accuracy, precision rate, and quality of service are all experimentally evaluated across a variety of security analysis datasets. We validate our research using data from operational networks and simulated data created in a realistic-looking simulator. The findings verify the efficacy of our approach for optimising routing decisions in real time and designing networks for the future.
Journal Article
Curriculum generation using Autoencoder based continuous optimization
2022
Research in Curriculum Learning has shown better performance on the task by optimizing the sequence of the training data. Recent works have focused on using complex reinforcement learning techniques to find the optimal data ordering strategy to maximize learning for a given network. In this paper, we present a simple yet efficient technique based on continuous optimization trained with auto-encoding procedure. We call this new approach Training Sequence Optimization (TSO). With a usual encoder-decoder setup we try to learn the latent space continuous representation of the training strategy and a predictor network is used on the continuous representation to predict the accuracy of the strategy on the fixed network architecture. The performance predictor and encoder enable us to perform gradient-based optimization by gradually moving towards the latent space representation of training data ordering with potentially better accuracy. We show an empirical gain of 2AP with our generated optimal curriculum strategy over the random strategy using the CIFAR-100 and CIFAR-10 datasets and have better boosts than the existing state-of-the-art CL algorithms.
CodeSCM: Causal Analysis for Multi-Modal Code Generation
by
Suman, Jana
,
Bhatt, Noopur
,
Gupta, Mukur
in
Large language models
,
Natural language processing
,
Semantics
2025
In this paper, we propose CodeSCM, a Structural Causal Model (SCM) for analyzing multi-modal code generation using large language models (LLMs). By applying interventions to CodeSCM, we measure the causal effects of different prompt modalities, such as natural language, code, and input-output examples, on the model. CodeSCM introduces latent mediator variables to separate the code and natural language semantics of a multi-modal code generation prompt. Using the principles of Causal Mediation Analysis on these mediators we quantify direct effects representing the model's spurious leanings. We find that, in addition to natural language instructions, input-output examples significantly influence code generation.
Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Reasoning
by
Suman, Jana
,
Srivastava, Prashast
,
Hajizadeh, Samira
in
Context
,
Evaluation
,
Large language models
2026
Large language models (LLMs) are increasingly deployed for understanding large codebases, but whether they understand operational semantics of long code context or rely on pattern matching shortcuts remains unclear. We distinguish between lexical recall (retrieving code verbatim) and semantic recall (understanding operational semantics). Evaluating 10 state-of-the-art LLMs, we find that while frontier models achieve near-perfect, position-independent lexical recall, semantic recall degrades severely when code is centrally positioned in long contexts. We introduce semantic recall sensitivity to measure whether tasks require understanding of code's operational semantics vs. permit pattern matching shortcuts. Through a novel counterfactual measurement method, we show that models rely heavily on pattern matching shortcuts to solve existing code understanding benchmarks. We propose a new task SemTrace, which achieves high semantic recall sensitivity through unpredictable operations; LLMs' accuracy exhibits severe positional effects, with median accuracy drops of 92.73% versus CRUXEval's 53.36% as the relevant code snippet approaches the middle of the input code context. Our findings suggest current evaluations substantially underestimate semantic recall failures in long context code understanding.
TrajSyn: Privacy-Preserving Dataset Distillation from Federated Model Trajectories for Server-Side Adversarial Training
by
Rahman, Saifur
,
Karmakar, Chandan
,
Gupta, Niharika
in
Datasets
,
Deep learning
,
Federated learning
2025
Deep learning models deployed on edge devices are increasingly used in safety-critical applications. However, their vulnerability to adversarial perturbations poses significant risks, especially in Federated Learning (FL) settings where identical models are distributed across thousands of clients. While adversarial training is a strong defense, it is difficult to apply in FL due to strict client-data privacy constraints and the limited compute available on edge devices. In this work, we introduce TrajSyn, a privacy-preserving framework that enables effective server-side adversarial training by synthesizing a proxy dataset from the trajectories of client model updates, without accessing raw client data. We show that TrajSyn consistently improves adversarial robustness on image classification benchmarks with no extra compute burden on the client device.
Intent Detection and Entity Extraction from BioMedical Literature
by
Mullick, Ankan
,
Goyal, Pawan
,
Gupta, Mukur
in
Large language models
,
Natural language processing
2024
Biomedical queries have become increasingly prevalent in web searches, reflecting the growing interest in accessing biomedical literature. Despite recent research on large-language models (LLMs) motivated by endeavours to attain generalized intelligence, their efficacy in replacing task and domain-specific natural language understanding approaches remains questionable. In this paper, we address this question by conducting a comprehensive empirical evaluation of intent detection and named entity recognition (NER) tasks from biomedical text. We show that Supervised Fine Tuned approaches are still relevant and more effective than general-purpose LLMs. Biomedical transformer models such as PubMedBERT can surpass ChatGPT on NER task with only 5 supervised examples.
Generating from Discrete Distributions Using Diffusions: Insights from Random Constraint Satisfaction Problems
2026
Generating data from discrete distributions is important for a number of application domains including text, tabular data, and genomic data. Several groups have recently used random \\(k\\)-satisfiability (\\(k\\)-SAT) as a synthetic benchmark for new generative techniques. In this paper, we show that fundamental insights from the theory of random constraint satisfaction problems have observable implications (sometime contradicting intuition) on the behavior of generative techniques on such benchmarks. More precisely, we study the problem of generating a uniformly random solution of a given (random) \\(k\\)-SAT or \\(k\\)-XORSAT formula. Among other findings, we observe that: \\((i)\\)~Continuous diffusions outperform masked discrete diffusions; \\((ii)\\)~Learned diffusions can match the theoretical `ideal' accuracy; \\((iii)\\)~Smart ordering of the variables can significantly improve accuracy, although not following popular heuristics.
Sense and Sensitivity: Examining the Influence of Semantic Recall on Long Context Code Reasoning
by
Suman, Jana
,
Srivastava, Prashast
,
Hajizadeh, Samira
in
Context
,
Evaluation
,
Large language models
2026
Large language models (LLMs) are increasingly deployed for understanding large codebases, but whether they understand operational semantics of long code context or rely on pattern matching shortcuts remains unclear. We distinguish between lexical recall (retrieving code verbatim) and semantic recall (understanding operational semantics). Evaluating 10 state-of-the-art LLMs, we find that while frontier models achieve near-perfect, position-independent lexical recall, semantic recall degrades severely when code is centrally positioned in long contexts. We introduce semantic recall sensitivity to measure whether tasks require understanding of code's operational semantics vs. permit pattern matching shortcuts. Through a novel counterfactual measurement method, we show that models rely heavily on pattern matching shortcuts to solve existing code understanding benchmarks. We propose a new task SemTrace, which achieves high semantic recall sensitivity through unpredictable operations; LLMs' accuracy exhibits severe positional effects, with median accuracy drops of 92.73% versus CRUXEval's 53.36% as the relevant code snippet approaches the middle of the input code context. Our findings suggest current evaluations substantially underestimate semantic recall failures in long context code understanding.
XOXO: Stealthy Cross-Origin Context Poisoning Attacks against AI Coding Assistants
2026
AI coding assistants are widely used for tasks like code generation. These tools now require large and complex contexts, automatically sourced from various origins\\(x2014\\)across files, projects, and contributors\\(x2014\\)forming part of the prompt fed to underlying LLMs. This automatic context-gathering introduces new vulnerabilities, allowing attackers to subtly poison input to compromise the assistant's outputs, potentially generating vulnerable code or introducing critical errors. We propose a novel attack, Cross-Origin Context Poisoning (XOXO), that is challenging to detect as it relies on adversarial code modifications that are semantically equivalent. Traditional program analysis techniques struggle to identify these perturbations since the semantics of the code remains correct, making it appear legitimate. This allows attackers to manipulate coding assistants into producing incorrect outputs, while shifting the blame to the victim developer. We introduce a novel, task-agnostic, black-box attack algorithm GCGS that systematically searches the transformation space using a Cayley Graph, achieving a 75.72% attack success rate on average across five tasks and eleven models, including GPT 4.1 and Claude 3.5 Sonnet v2 used by popular AI coding assistants. Furthermore, defenses like adversarial fine-tuning are ineffective against our attack, underscoring the need for new security measures in LLM-powered coding tools.