Asset Details
MbrlCatalogueTitleDetail
Do you wish to reserve the book?
Neural Network-Based Bilingual Lexicon Induction for Indonesian Ethnic Languages
by
Resiandi, Kartika
, Murakami, Yohei
, Nasution, Arbi Haza
in
bilingual lexicon induction
/ Bilingualism
/ Dictionaries
/ Indonesian ethnic languages
/ Language
/ low-resource language
/ Morphology
/ Multilingualism
/ natural language processing
/ Neural networks
/ sequence-to-sequence model
2023
Hey, we have placed the reservation for you!
By the way, why not check out events that you can attend while you pick your title.
You are currently in the queue to collect this book. You will be notified once it is your turn to collect the book.
Oops! Something went wrong.
Looks like we were not able to place the reservation. Kindly try again later.
Are you sure you want to remove the book from the shelf?
Neural Network-Based Bilingual Lexicon Induction for Indonesian Ethnic Languages
by
Resiandi, Kartika
, Murakami, Yohei
, Nasution, Arbi Haza
in
bilingual lexicon induction
/ Bilingualism
/ Dictionaries
/ Indonesian ethnic languages
/ Language
/ low-resource language
/ Morphology
/ Multilingualism
/ natural language processing
/ Neural networks
/ sequence-to-sequence model
2023
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
Do you wish to request the book?
Neural Network-Based Bilingual Lexicon Induction for Indonesian Ethnic Languages
by
Resiandi, Kartika
, Murakami, Yohei
, Nasution, Arbi Haza
in
bilingual lexicon induction
/ Bilingualism
/ Dictionaries
/ Indonesian ethnic languages
/ Language
/ low-resource language
/ Morphology
/ Multilingualism
/ natural language processing
/ Neural networks
/ sequence-to-sequence model
2023
Please be aware that the book you have requested cannot be checked out. If you would like to checkout this book, you can reserve another copy
We have requested the book for you!
Your request is successful and it will be processed during the Library working hours. Please check the status of your request in My Requests.
Oops! Something went wrong.
Looks like we were not able to place your request. Kindly try again later.
Neural Network-Based Bilingual Lexicon Induction for Indonesian Ethnic Languages
Journal Article
Neural Network-Based Bilingual Lexicon Induction for Indonesian Ethnic Languages
2023
Request Book From Autostore
and Choose the Collection Method
Overview
Indonesia has a variety of ethnic languages, most of which belong to the same language family: the Austronesian languages. Due to the shared language family, words in Indonesian ethnic languages are very similar. However, previous research suggests that these Indonesian ethnic languages are endangered. Thus, to prevent that, we propose the creation of a bilingual dictionary between ethnic languages, using a neural network approach to extract transformation rules, employing character-level embedding and the Bi-LSTM method in a sequence-to-sequence model. The model has an encoder and decoder. The encoder reads the input sequence character by character, generates context, and then extracts a summary of the input. The decoder produces an output sequence wherein each character at each timestep, as well as the subsequent character output, are influenced by the previous character. The first experiment focuses on Indonesian and Minangkabau languages with 10,277 word pairs. To evaluate the model’s performance, five-fold cross-validation was used. The character-level seq2seq method (Bi-LSTM as an encoder and LSTM as a decoder) with an average precision of 83.92% outperformed the SentencePiece byte pair encoding (vocab size of 33) with an average precision of 79.56%. Furthermore, to evaluate the performance of the neural network model in finding the pattern, a rule-based approach was conducted as the baseline. The neural network approach obtained 542 more correct translations compared to the baseline. We implemented the best setting (character-level embedding with Bi-LSTM as the encoder and LSTM as the decoder) for four other Indonesian ethnic languages: Malay, Palembang, Javanese, and Sundanese. These have half the size of input dictionaries. The average precision scores for these languages are 65.08%, 62.52%, 59.69%, and 58.46%, respectively. This shows that the neural network approach can identify transformation patterns of the Indonesian language to closely related languages (such as Malay and Palembang) better than distantly related languages (such as Javanese and Sundanese).
Publisher
MDPI AG
This website uses cookies to ensure you get the best experience on our website.