Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
MalBERTv2: Code Aware BERT-Based Model for Malware Identification
by
Akhloufi, Moulay A.
, Rahali, Abir
in
Algorithms
/ Anti-virus software
/ Artificial intelligence
/ Classification
/ Coders
/ Computational linguistics
/ Computer viruses
/ Cybersecurity
/ Cyberterrorism
/ Datasets
/ Deep learning
/ Language
/ Language processing
/ Malware
/ malware detection
/ Natural language interfaces
/ Natural language processing
/ Neural networks
/ Rankings
/ Security software
/ Social networks
/ Software
/ Source code
/ Spyware
/ transformer-based model
2023
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
MalBERTv2: Code Aware BERT-Based Model for Malware Identification
by
Akhloufi, Moulay A.
, Rahali, Abir
in
Algorithms
/ Anti-virus software
/ Artificial intelligence
/ Classification
/ Coders
/ Computational linguistics
/ Computer viruses
/ Cybersecurity
/ Cyberterrorism
/ Datasets
/ Deep learning
/ Language
/ Language processing
/ Malware
/ malware detection
/ Natural language interfaces
/ Natural language processing
/ Neural networks
/ Rankings
/ Security software
/ Social networks
/ Software
/ Source code
/ Spyware
/ transformer-based model
2023
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
MalBERTv2: Code Aware BERT-Based Model for Malware Identification
by
Akhloufi, Moulay A.
, Rahali, Abir
in
Algorithms
/ Anti-virus software
/ Artificial intelligence
/ Classification
/ Coders
/ Computational linguistics
/ Computer viruses
/ Cybersecurity
/ Cyberterrorism
/ Datasets
/ Deep learning
/ Language
/ Language processing
/ Malware
/ malware detection
/ Natural language interfaces
/ Natural language processing
/ Neural networks
/ Rankings
/ Security software
/ Social networks
/ Software
/ Source code
/ Spyware
/ transformer-based model
2023
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
MalBERTv2: Code Aware BERT-Based Model for Malware Identification
Journal Article
MalBERTv2: Code Aware BERT-Based Model for Malware Identification
2023
Request Book From Autostore
and Choose the Collection Method
Overview
To proactively mitigate malware threats, cybersecurity tools, such as anti-virus and anti-malware software, as well as firewalls, require frequent updates and proactive implementation. However, processing the vast amounts of dataset examples can be overwhelming when relying solely on traditional methods. In cybersecurity workflows, recent advances in natural language processing (NLP) models can aid in proactively detecting various threats. In this paper, we present a novel approach for representing the relevance and significance of the Malware/Goodware (MG) datasets, through the use of a pre-trained language model called MalBERTv2. Our model is trained on publicly available datasets, with a focus on the source code of the apps by extracting the top-ranked files that present the most relevant information. These files are then passed through a pre-tokenization feature generator, and the resulting keywords are used to train the tokenizer from scratch. Finally, we apply a classifier using bidirectional encoder representations from transformers (BERT) as a layer within the model pipeline. The performance of our model is evaluated on different datasets, achieving a weighted f1 score ranging from 82% to 99%. Our results demonstrate the effectiveness of our approach for proactively detecting malware threats using NLP techniques.
This website uses cookies to ensure you get the best experience on our website.