Catalogue Search | MBRL

British Sign Language Recognition via Late Fusion of Computer Vision and Leap Motion with Transfer Learning to American Sign Language

by Ekárt, Anikó , Bird, Jordan J. , Faria, Diego R. in Computers , Humans , late fusion

2020

In this work, we show that a late fusion approach to multimodality in sign language recognition improves the overall ability of the model in comparison to the singular approaches of image classification (88.14%) and Leap Motion data classification (72.73%). With a large synchronous dataset of 18 BSL gestures collected from multiple subjects, two deep neural networks are benchmarked and compared to derive a best topology for each. The Vision model is implemented by a Convolutional Neural Network and optimised Artificial Neural Network, and the Leap Motion model is implemented by an evolutionary search of Artificial Neural Network topology. Next, the two best networks are fused for synchronised processing, which results in a better overall result (94.44%) as complementary features are learnt in addition to the original task. The hypothesis is further supported by application of the three models to a set of completely unseen data where a multimodality approach achieves the best results relative to the single sensor method. When transfer learning with the weights trained via British Sign Language, all three models outperform standard random weight distribution when classifying American Sign Language (ASL), and the best model overall for ASL classification was the transfer learning multimodality approach, which scored 82.55% accuracy.

Journal Article

Share this book

Add to My Shelf

Chatbot Interaction with Artificial Intelligence: human data augmentation with T5 and language transformer ensemble for text classification

by Ekárt, Anikó , Bird, Jordan J. , Faria, Diego R. in Academic disciplines , Accuracy , Algorithms

2023

In this work we present the Chatbot Interaction with Artificial Intelligence (CI-AI) framework as an approach to the training of a transformer based chatbot-like architecture for task classification with a focus on natural human interaction with a machine as opposed to interfaces, code, or formal commands. The intelligent system augments human-sourced data via artificial paraphrasing in order to generate a large set of training data for further classical, attention, and language transformation-based learning approaches for Natural Language Processing (NLP). Human beings are asked to paraphrase commands and questions for task identification for further execution of algorithms as skills. The commands and questions are split into training and validation sets. A total of 483 responses were recorded. Secondly, the training set is paraphrased by the T5 model in order to augment it with further data. Seven state-of-the-art transformer-based text classification algorithms (BERT, DistilBERT, RoBERTa, DistilRoBERTa, XLM, XLM-RoBERTa, and XLNet) are benchmarked for both sets after fine-tuning on the training data for two epochs. We find that all models are improved when training data is augmented by the T5 model, with an average increase of classification accuracy by 4.01%. The best result was the RoBERTa model trained on T5 augmented data which achieved 98.96% classification accuracy. Finally, we found that an ensemble of the five best-performing transformer models via Logistic Regression of output label predictions led to an accuracy of 99.59% on the dataset of human responses. A highly-performing model allows the intelligent system to interpret human commands at the social-interaction level through a chatbot-like interface (e.g. “Robot, can we have a conversation?”) and allows for better accessibility to AI by non-technical users.

Journal Article

Share this book

Add to My Shelf

A Deep Evolutionary Approach to Bioinspired Classifier Optimisation for Brain-Machine Interaction

by Ekárt, Anikó , Bird, Jordan J. , Manso, Luis J. in Accuracy , Algorithms , Analysis

2019

This study suggests a new approach to EEG data classification by exploring the idea of using evolutionary computation to both select useful discriminative EEG features and optimise the topology of Artificial Neural Networks. An evolutionary algorithm is applied to select the most informative features from an initial set of 2550 EEG statistical features. Optimisation of a Multilayer Perceptron (MLP) is performed with an evolutionary approach before classification to estimate the best hyperparameters of the network. Deep learning and tuning with Long Short-Term Memory (LSTM) are also explored, and Adaptive Boosting of the two types of models is tested for each problem. Three experiments are provided for comparison using different classifiers: one for attention state classification, one for emotional sentiment classification, and a third experiment in which the goal is to guess the number a subject is thinking of. The obtained results show that an Adaptive Boosted LSTM can achieve an accuracy of 84.44%, 97.06%, and 9.94% on the attentional, emotional, and number datasets, respectively. An evolutionary-optimised MLP achieves results close to the Adaptive Boosted LSTM for the two first experiments and significantly higher for the number-guessing experiment with an Adaptive Boosted DEvo MLP reaching 31.35%, while being significantly quicker to train and classify. In particular, the accuracy of the nonboosted DEvo MLP was of 79.81%, 96.11%, and 27.07% in the same benchmarks. Two datasets for the experiments were gathered using a Muse EEG headband with four electrodes corresponding to TP9, AF7, AF8, and TP10 locations of the international EEG placement standard. The EEG MindBigData digits dataset was gathered from the TP9, FP1, FP2, and TP10 locations.

Journal Article

Share this book

Add to My Shelf

Country-level pandemic risk and preparedness classification based on COVID-19 data: A machine learning approach

by Ekárt, Anikó , Bird, Jordan J. , Premebida, Cristiano in Algorithms , Betacoronavirus , Biology and Life Sciences

2020

In this work we present a three-stage Machine Learning strategy to country-level risk classification based on countries that are reporting COVID-19 information. A K% binning discretisation (K = 25) is used to create four risk groups of countries based on the risk of transmission (coronavirus cases per million population), risk of mortality (coronavirus deaths per million population), and risk of inability to test (coronavirus tests per million population). The four risk groups produced by K% binning are labelled as 'low', 'medium-low', 'medium-high', and 'high'. Coronavirus-related data are then removed and the attributes for prediction of the three types of risk are given as the geopolitical and demographic data describing each country. Thus, the calculation of class label is based on coronavirus data but the input attributes are country-level information regardless of coronavirus data. The three four-class classification problems are then explored and benchmarked through leave-one-country-out cross validation to find the strongest model, producing a Stack of Gradient Boosting and Decision Tree algorithms for risk of transmission, a Stack of Support Vector Machine and Extra Trees for risk of mortality, and a Gradient Boosting algorithm for the risk of inability to test. It is noted that high risk for inability to test is often coupled with low risks for transmission and mortality, therefore the risk of inability to test should be interpreted first, before consideration is given to the predicted transmission and mortality risks. Finally, the approach is applied to more recent risk levels to data from September 2020 and weaker results are noted due to the growth of international collaboration detracting useful knowledge from country-level attributes which suggests that similar machine learning approaches are more useful prior to situations later unfolding.

Journal Article

Share this book

Add to My Shelf

Thumbs up, thumbs down: non-verbal human-robot interaction through real-time EMG classification via inductive and supervised transductive transfer learning

by Ekárt, Anikó , Bird, Jordan J. , Kobylarz, Jhonatan in Accuracy , Algorithms , Amputation

2020

In this study, we present a transfer learning method for gesture classification via an inductive and supervised transductive approach with an electromyographic dataset gathered via the Myo armband. A ternary gesture classification problem is presented by states of ’thumbs up’ , ’thumbs down’ , and ’relax’ in order to communicate in the affirmative or negative in a non-verbal fashion to a machine. Of the nine statistical learning paradigms benchmarked over 10-fold cross validation (with three methods of feature selection), an ensemble of Random Forest and Support Vector Machine through voting achieves the best score of 91.74% with a rule-based feature selection method. When new subjects are considered, this machine learning approach fails to generalise new data, and thus the processes of Inductive and Supervised Transductive Transfer Learning are introduced with a short calibration exercise (15 s). Failure of generalisation shows that 5 s of data per-class is the strongest for classification (versus one through seven seconds) with only an accuracy of 55%, but when a short 5 s per class calibration task is introduced via the suggested transfer method, a Random Forest can then classify unseen data from the calibrated subject at an accuracy of around 97%, outperforming the 83% accuracy boasted by the proprietary Myo system. Finally, a preliminary application is presented through social interaction with a humanoid Pepper robot, where the use of our approach and a most-common-class metaclassifier achieves 100% accuracy for all trials of a ‘20 Questions’ game.

Journal Article

Share this book

Add to My Shelf

On the effects of pseudorandom and quantum-random number generators in soft computing

by Ekárt, Anikó , Bird, Jordan J. , Faria, Diego R. in 20th century , Accuracy , Algorithms

2020

In this work, we argue that the implications of pseudorandom and quantum-random number generators (PRNG and QRNG) inexplicably affect the performances and behaviours of various machine learning models that require a random input. These implications are yet to be explored in soft computing until this work. We use a CPU and a QPU to generate random numbers for multiple machine learning techniques. Random numbers are employed in the random initial weight distributions of dense and convolutional neural networks, in which results show a profound difference in learning patterns for the two. In 50 dense neural networks (25 PRNG/25 QRNG), QRNG increases over PRNG for accent classification at + 0.1%, and QRNG exceeded PRNG for mental state EEG classification by + 2.82%. In 50 convolutional neural networks (25 PRNG/25 QRNG), the MNIST and CIFAR-10 problems are benchmarked, and in MNIST the QRNG experiences a higher starting accuracy than the PRNG but ultimately only exceeds it by 0.02%. In CIFAR-10, the QRNG outperforms PRNG by + 0.92%. The n -random split of a Random Tree is enhanced towards and new Quantum Random Tree (QRT) model, which has differing classification abilities to its classical counterpart, 200 trees are trained and compared (100 PRNG/100 QRNG). Using the accent and EEG classification data sets, a QRT seemed inferior to a RT as it performed on average worse by − 0.12%. This pattern is also seen in the EEG classification problem, where a QRT performs worse than a RT by − 0.28%. Finally, the QRT is ensembled into a Quantum Random Forest (QRF), which also has a noticeable effect when compared to the standard Random Forest (RF). Ten to 100 ensembles of trees are benchmarked for the accent and EEG classification problems. In accent classification, the best RF (100 RT) outperforms the best QRF (100 QRF) by 0.14% accuracy. In EEG classification, the best RF (100 RT) outperforms the best QRF (100 QRT) by 0.08% but is extremely more complex, requiring twice the amount of trees in committee. All differences are observed to be situationally positive or negative and thus are likely data dependent in their observed functional behaviour.

Journal Article

Share this book

Add to My Shelf

Advanced predictive-analysis-based decision support for collaborative logistics networks

by Ekárt, Anikó , Ilie-Zudor, Elisabeth , Buckingham, Christopher in Algorithms , Big Data , Collaboration

2015

Purpose – The purpose of this paper is to examine challenges and potential of big data in heterogeneous business networks and relate these to an implemented logistics solution. Design/methodology/approach – The paper establishes an overview of challenges and opportunities of current significance in the area of big data, specifically in the context of transparency and processes in heterogeneous enterprise networks. Within this context, the paper presents how existing components and purpose-driven research were combined for a solution implemented in a nationwide network for less-than-truckload consignments. Findings – Aside from providing an extended overview of today’s big data situation, the findings have shown that technical means and methods available today can comprise a feasible process transparency solution in a large heterogeneous network where legacy practices, reporting lags and incomplete data exist, yet processes are sensitive to inadequate policy changes. Practical implications – The means introduced in the paper were found to be of utility value in improving process efficiency, transparency and planning in logistics networks. The particular system design choices in the presented solution allow an incremental introduction or evolution of resource handling practices, incorporating existing fragmentary, unstructured or tacit knowledge of experienced personnel into the theoretically founded overall concept. Originality/value – The paper extends previous high-level view on the potential of big data, and presents new applied research and development results in a logistics application.

Journal Article

Share this book

Add to My Shelf

The Use of Retinal Microvascular Function and Telomere Length in Age and Blood Pressure Prediction in Individuals with Low Cardiovascular Risk

by Ekárt, Anikó , Dias, Irundika HK , Shokr, Hala in Adult , Aged , Aging

2022

Ageing represents a major risk factor for many pathologies that limit human lifespan, including cardiovascular diseases. Biological ageing is a good biomarker to assess early individual risk for CVD. However, finding good measurements of biological ageing is an ongoing quest. This study aims to assess the use retinal microvascular function, separate or in combination with telomere length, as a predictor for age and systemic blood pressure in individuals with low cardiovascular risk. In all, 123 healthy participants with low cardiovascular risk were recruited and divided into three groups: group 1 (less than 30 years old), group 2 (31–50 years old) and group 3 (over 50 years old). Relative telomere length (RTL), parameters of retinal microvascular function, CVD circulatory markers and blood pressure (BP) were measured in all individuals. Symbolic regression- analysis was used to infer chronological age and systemic BP measurements using either RTL or a combination of RTL and parameters for retinal microvascular function. RTL decreased significantly with age (p = 0.010). There were also age-related differences between the study groups in retinal arterial time to maximum dilation (p = 0.005), maximum constriction (p = 0.007) and maximum constriction percentage (p = 0.010). In the youngest participants, the error between predicted versus actual values for the chronological age were smallest in the case of using both retinal vascular functions only (p = 0.039) or the combination of this parameter with RTL (p = 0.0045). Systolic BP was better predicted by RTL also only in younger individuals (p = 0.043). The assessment of retinal arterial vascular function is a better predictor than RTL for non-modifiable variables such as age, and only in younger individuals. In the same age group, RTL is better than microvascular function when inferring modifiable risk factors for CVDs. In older individuals, the accumulation of physiological and structural biological changes makes such predictions unreliable.

Journal Article

Share this book

Add to My Shelf

Axial Generation: Mixing Colour and Shapes to Automatically Form Diverse Digital Sculptures

by Ekárt, Anikó , Easton, Edward , Bernardet, Ulysses in Algorithms , Art works , Banksy

2022

Automated computer generation of aesthetically pleasing artwork has been the subject of research for several decades. The unsolved problem of interest is how to please any audience without requiring too much of their involvement in the process of creation. Two-dimensional pictures have received a lot of attention; however, 3D artwork has remained relatively unexplored. This paper showcases an extended version of the Axial Generation Process (AGP), a versatile generation algorithm that can create both 2D and 3D items within the Concretism art style. The extensions presented here include calculating colour values for the artwork, increasing the range of forms that can be created through dynamic sizing of shapes and including more primitive shape types, finally, 2D items can be created from multiple viewpoints. Both 2D and 3D items generated through the AGP were evaluated against a set of formal aesthetic measures and compared against two established generation systems, one based on manipulating pixels/voxels and another tracking the path of particles through 2D and 3D space. This initial evaluation shows that the process is capable of generating visually varied items which exhibit a generally diverse range of values across the measures used, in both two and three dimensions. Comparatively, against the established generation processes, the AGP shows a good balance of performance and ability to create complex and visually varied items.

Journal Article

Share this book

Add to My Shelf