Catalogue Search | MBRL

Neural network models for software development effort estimation: a comparative study

by Azzeh, Mohammad , Capretz, Luiz Fernando , Ho, Danny in Artificial Intelligence , Bids , Comparative studies

2016

Software development effort estimation (SDEE) is one of the main tasks in software project management. It is crucial for a project manager to efficiently predict the effort or cost of a software project in a bidding process, since overestimation will lead to bidding loss and underestimation will cause the company to lose money. Several SDEE models exist; machine learning models, especially neural network models, are among the most prominent in the field. In this study, four different neural network models—multilayer perceptron, general regression neural network, radial basis function neural network, and cascade correlation neural network—are compared with each other based on: (1) predictive accuracy centred on the mean absolute error criterion, (2) whether such a model tends to overestimate or underestimate, and (3) how each model classifies the importance of its inputs. Industrial datasets from the International Software Benchmarking Standards Group (ISBSG) are used to train and validate the four models. The main ISBSG dataset was filtered and then divided into five datasets based on the productivity value of each project. Results show that the four models tend to overestimate in 80 % of the datasets, and the significance of the model inputs varies based on the selected model. Furthermore, the cascade correlation neural network outperforms the other three models in the majority of the datasets constructed on the mean absolute residual criterion.

Journal Article

Share this book

Add to My Shelf

Dynamic neural networks for gas turbine engine degradation prediction, health monitoring and prognosis

by Kiakojoori, S. , Khorasani, K. in Aircraft engines , Artificial Intelligence , Computational Biology/Bioinformatics

2016

In this paper, the problem of health monitoring and prognosis of aircraft gas turbine engines is considered by using computationally intelligent methodologies. Two different dynamic neural networks, namely the nonlinear autoregressive with exogenous input neural networks and the Elman neural networks, are developed and designed for this purpose. The proposed dynamic neural networks are designed to capture the dynamics of two main degradations in the gas turbine engine, namely the compressor fouling and the turbine erosion. The health status and condition of the engine in terms of the turbine output temperature (TT) are then predicted subject to occurrence of these deteriorations. Various scenarios consisting of fouling and erosion separately as well as combined are considered. For each scenario, several neural networks are trained and their performance in predicting multiple flights ahead TTs is evaluated. Finally, the most suitable neural networks for achieving the best prediction are selected by using the normalized Bayesian information criterion model selection. Simulation results presented demonstrate and illustrate the effective performance of our proposed neural network-based prediction and prognosis strategies.

Journal Article

Share this book

Add to My Shelf

Combining time series prediction models using genetic algorithm to autoscaling Web applications hosted in the cloud infrastructure

by Messias, Valter Rogério , Ehlers, Ricardo , Estrella, Julio Cezar in Applications programs , Artificial Intelligence , Cloud computing

2016

In a cloud computing environment, companies have the ability to allocate resources according to demand. However, there is a delay that may take minutes between the request for a new resource and it being ready for using. This causes the reactive techniques, which request a new resource only when the system reaches a certain load threshold, to be not suitable for the resource allocation process. To address this problem, it is necessary to predict requests that arrive at the system in the next period of time to allocate the necessary resources, before the system becomes overloaded. There are several time series forecasting models to calculate the workload predictions based on history of monitoring data. However, it is difficult to know which is the best time series forecasting model to be used in each case. The work becomes even more complicated when the user does not have much historical data to be analyzed. Most related work considers only single methods to evaluate the results of the forecast. Other works propose an approach that selects suitable forecasting methods for a given context. But in this case, it is necessary to have a significant amount of data to train the classifier. Moreover, the best solution may not be a specific model, but rather a combination of models. In this paper we propose an adaptive prediction method using genetic algorithms to combine time series forecasting models. Our method does not require a previous phase of training, because it constantly adapts the extent to which the data are coming. To evaluate our proposal, we use three logs extracted from real Web servers. The results show that our proposal often brings the best result and is generic enough to adapt to various types of time series.

Journal Article

Share this book

Add to My Shelf

A Q-learning-based swarm optimization algorithm for economic dispatch problem

by Su, Mu-Chun , Hsieh, Yi-Zeng in Algorithms , Artificial Intelligence , Benchmarks

2016

In this paper, we treat optimization problems as a kind of reinforcement learning problems regarding an optimization procedure for searching an optimal solution as a reinforcement learning procedure for finding the best policy to maximize the expected rewards. This viewpoint motivated us to propose a Q -learning-based swarm optimization (QSO) algorithm. The proposed QSO algorithm is a population-based optimization algorithm which integrates the essential properties of Q -learning and particle swarm optimization. The optimization procedure of the QSO algorithm proceeds as each individual imitates the behavior of the global best one in the swarm. The best individual is chosen based on its accumulated performance instead of its momentary performance at each evaluation. Two data sets including a set of benchmark functions and a real-world problem—the economic dispatch (ED) problem for power systems—were used to test the performance of the proposed QSO algorithm. The simulation results on the benchmark functions show that the proposed QSO algorithm is comparable to or even outperforms several existing optimization algorithms. As for the ED problem, the proposed QSO algorithm has found solutions better than all previously found solutions.

Journal Article

Share this book

Add to My Shelf

A fine-grained Random Forests using class decomposition: an application to medical diagnosis

by Gaber, Mohamed Medhat , Elyan, Eyad in Artificial Intelligence , Clustering , Computational Biology/Bioinformatics

2016

Class decomposition describes the process of segmenting each class into a number of homogeneous subclasses. This can be naturally achieved through clustering. Utilising class decomposition can provide a number of benefits to supervised learning, especially ensembles. It can be a computationally efficient way to provide a linearly separable data set without the need for feature engineering required by techniques like support vector machines and deep learning. For ensembles, the decomposition is a natural way to increase diversity, a key factor for the success of ensemble classifiers. In this paper, we propose to adopt class decomposition to the state-of-the-art ensemble learning Random Forests. Medical data for patient diagnosis may greatly benefit from this technique, as the same disease can have a diverse of symptoms. We have experimentally validated our proposed method on a number of data sets that are mainly related to the medical domain. Results reported in this paper show clearly that our method has significantly improved the accuracy of Random Forests.

Journal Article

Share this book

Add to My Shelf

A hybrid grid-GA-based LSSVR learning paradigm for crude oil price forecasting

by Wu, Jiaqian , Yu, Lean , Tang, Ling in Artificial Intelligence , Computational Biology/Bioinformatics , Computational Science and Engineering

2016

In order to effectively model crude oil spot price with inherently high complexity, a hybrid learning paradigm integrating least squares support vector regression (LSSVR) with a hybrid optimization searching approach for the parameters selection in the LSSVR [consisting of grid method and genetic algorithm (GA)], i.e., a hybrid grid-GA-based LSSVR model, is proposed in this study. In the proposed hybrid learning paradigm, the grid method, a simple but efficient searching method, is first applied to roughly but rapidly determine the proper boundaries of the parameters in the LSSVR; then, the GA, an effective and powerful intelligent searching algorithm, is further implemented to select the most suitable parameters. For illustration and verification, the proposed learning paradigm is used to predict the crude oil spot prices of the West Texas Intermediate and the Brent markets. The empirical results demonstrate that the proposed hybrid grid-GA-based LSSVR learning paradigm can outperform its benchmarking models (including some popular forecasting techniques and similar LSSVRs with other parameter searching algorithms) in terms of both prediction accuracy and time-savings, indicating that it can be utilized as one effective forecasting tool for crude oil price with high volatility and irregularity.

Journal Article

Share this book

Add to My Shelf

Deep multilayer multiple kernel learning

by Rebai, Ilyes , Mahdi, Walid , BenAyed, Yassine in Adaptive algorithms , Artificial Intelligence , Back propagation

2016

Multiple kernel learning (MKL) approach has been proposed for kernel methods and has shown high performance for solving some real-world applications. It consists on learning the optimal kernel from one layer of multiple predefined kernels. Unfortunately, this approach is not rich enough to solve relatively complex problems. With the emergence and the success of the deep learning concept, multilayer of multiple kernel learning (MLMKL) methods were inspired by the idea of deep architecture. They are introduced in order to improve the conventional MKL methods. Such architectures tend to learn deep kernel machines by exploring the combinations of multiple kernels in a multilayer structure. However, existing MLMKL methods often have trouble with the optimization of the network for two or more layers. Additionally, they do not always outperform the simplest method of combining multiple kernels (i.e., MKL). In order to improve the effectiveness of MKL approaches, we introduce, in this paper, a novel backpropagation MLMKL framework. Specifically, we propose to optimize the network over an adaptive backpropagation algorithm. We use the gradient ascent method instead of dual objective function, or the estimation of the leave-one-out error. We test our proposed method through a large set of experiments on a variety of benchmark data sets. We have successfully optimized the system over many layers. Empirical results over an extensive set of experiments show that our algorithm achieves high performance compared to the traditional MKL approach and existing MLMKL methods.

Journal Article

Share this book

Add to My Shelf

Pareto efficient multi-objective optimization for local tuning of analogy-based estimation

by Banitaan, Shadi , Azzeh, Mohammad , Almasalha, Fadi in Adaptation , Algorithms , Artificial Intelligence

2016

Analogy-based effort estimation (ABE) is one of the prominent methods for software effort estimation. The fundamental concept of ABE is closer to the mentality of expert estimation but with an automated procedure in which the final estimate is generated by reusing similar historical projects. The main key issue when using ABE is how to adapt the effort of the retrieved nearest neighbors. The adaptation process is an essential part of ABE to generate more successful accurate estimation based on tuning the selected raw solutions, using some adaptation strategy. In this study, we show that there are three interrelated decision variables that have great impact on the success of adaptation method: (1) number of nearest analogies ( k ), (2) optimum feature set needed for adaptation and (3) adaptation weights. To find the right decision regarding these variables, one need to study all possible combinations and evaluate them individually to select the one that can improve all prediction evaluation measures. The existing evaluation measures usually behave differently, presenting sometimes opposite trends in evaluating prediction methods. This means that changing one decision variable could improve one evaluation measure while it is decreasing the others. Therefore, the main theme of this research is how to come up with best decision variables that improve adaptation strategy and thus the overall evaluation measures without degrading the others. The impact of these decisions together has not been investigated before; therefore, we propose to view the building of adaptation procedure as a multi-objective optimization problem. The Particle swarm optimization algorithm (PSO) is utilized to find the optimum solutions for such decision variables based on optimizing multiple evaluation measures. We evaluated the proposed approaches over 15 datasets and using four evaluation measures. After extensive experimentation, we found that: (1) predictive performance of ABE has noticeably been improved, (2) optimizing all decision variables together is more efficient than ignoring any one of them, and (3) optimizing decision variables for each project individually yields better accuracy than optimizing them for the whole dataset.

Journal Article

Share this book

Add to My Shelf

Two-level matrix factorization for recommender systems

by Cao, Longbing , Xu, Guandong , Li, Fangfang in Artificial Intelligence , Cold starts , Computational Biology/Bioinformatics

2016

Many existing recommendation methods such as matrix factorization (MF) mainly rely on user–item rating matrix, which sometimes is not informative enough, often suffering from the cold-start problem. To solve this challenge, complementary textual relations between items are incorporated into recommender systems (RS) in this paper. Specifically, we first apply a novel weighted textual matrix factorization (WTMF) approach to compute the semantic similarities between items, then integrate the inferred item semantic relations into MF and propose a two-level matrix factorization (TLMF) model for RS. Experimental results on two open data sets not only demonstrate the superiority of TLMF model over benchmark methods, but also show the effectiveness of TLMF for solving the cold-start problem.

Journal Article

Share this book

Add to My Shelf

Supervised classification of spam emails with natural language stylometry

by Shams, Rushdi , Mercer, Robert E. in Algorithms , Artificial Intelligence , Bayesian analysis

2016

Email spam is one of the biggest threats to today’s Internet. To deal with this threat, there are long-established measures like supervised anti-spam filters. In this paper, we report the development and evaluation of sentinel —an anti-spam filter based on natural language and stylometry attributes. The performance of the filter is evaluated not only on non-personalized emails (i.e., emails collected randomly) but also on personalized emails (i.e., emails collected from particular individuals). Among the non-personalized datasets are CSDMC2010, SpamAssassin, and LingSpam, while the Enron-Spam collection comprises personalized emails. The proposed filter extracts natural language attributes from email text that are closely related to writer stylometry and generate classifiers using multiple learning algorithms. Experimental outcomes show that classifiers generated by meta-learning algorithms such as adaboostm1 and bagging are the best, performing equally well and surpassing the performance of a number of filters proposed in previous studies, while a random forest generated classifier is a close second. On the other hand, the performance of classifiers using support vector machine and Naïve Bayes is not satisfactory. In addition, we find much improved results on personalized emails and mixed results on non-personalized emails.

Journal Article

Share this book

Add to My Shelf

Language Selector

MBRLGlobalSearch

Language Selector

Catalogue Search | MBRL

Search Results Heading

Explore the vast range of titles available.

MBRLSearchResults

MBRLHappinessMeter