Catalogue Search | MBRL
Search Results Heading
Explore the vast range of titles available.
MBRLSearchResults
-
DisciplineDiscipline
-
Is Peer ReviewedIs Peer Reviewed
-
Series TitleSeries Title
-
Reading LevelReading Level
-
YearFrom:-To:
-
More FiltersMore FiltersContent TypeItem TypeIs Full-Text AvailableSubjectCountry Of PublicationPublisherSourceTarget AudienceDonorLanguagePlace of PublicationContributorsLocation
Done
Filters
Reset
9,643
result(s) for
"Big Data Mining"
Sort by:
RETRACTED: A Smart Social Insurance Big Data Analytics Framework Based on Machine Learning Algorithms
by
Riad, Alaa M.
,
Elkhamisy, Nashaat
,
Senousy, Youssef
in
Big Data Mining and Big Data Analytics
,
Data Integration
,
Social Insurance
2020
Social insurance is an individual’s protection against risks such as retirement, death or disability. Big data mining and analytics are a way that could help the insurers and the actuaries to get the optimal decision for the insured individuals. Dependently, this paper proposes a novel analytic framework for Egyptian Social insurance big data. NOSI’s data contains data, which need some pre-processing methods after extraction like replacing missing values, standardization and outlier/extreme data. The paper also presents using some mining methods, such as clustering and classification algorithms on the Egyptian social insurance dataset through an experiment. In clustering, we used K-means clustering and the result showed a silhouette score 0.138 with two clusters in the dataset features. In classification, we used the Support Vector Machine (SVM) classifier and classification results showed a high accuracy percentage of 94%.
Journal Article
Beginning Apache Pig : big data processing made easy
\"Learn to use Apache Pig to develop lightweight big data applications easily and quickly. This book shows you many optimization techniques and covers every context where Pig is used in big data analytics. Beginning Apache Pig shows you how Pig is easy to learn and requires relatively little time to develop big data applications. The book is divided into four parts: the complete features of Apache Pig; integration with other tools; how to solve complex business problems; and optimization of tools. You'll discover topics such as MapReduce and why it cannot meet every business need; the features of Pig Latin such as data types for each load, store, joins, groups, and ordering; how Pig workflows can be created; submitting Pig jobs using Hue; and working with Oozie. You'll also see how to extend the framework by writing UDFs and custom load, store, and filter functions. Finally you'll cover different optimization techniques such as gathering statistics about a Pig script, joining strategies, parallelism, and the role of data formats in good performance. What You Will Learn* Use all the features of Apache Pig* Integrate Apache Pig with other tools* Extend Apache Pig* Optimize Pig Latin code* Solve different use cases for Pig LatinWho This Book Is ForAll levels of IT professionals: architects, big data enthusiasts, engineers, developers, and big data administrators.\"-- Provided by publisher.
Development and validation of a model for assessing potential strategic innovation risk in banks based on data mining-Monte-Carlo in the 'Open Innovation' System
by
Milenkov, Alexander Vladimirovich
,
Borlakova, Aminat Islamovna
,
Manuylenko, Viktoriya Valeryevna
in
bank innovations
,
Banking industry
,
Banks
2021
Innovation risk in banks, a formalized instrument that is part of banks' financial and innovative strategies, influences the assessment of innovative activity, demonstrating the importance of forecasting and assessment models of potential innovation risks. Our research into general scientific and specific methods allowed us to: (1) distinguish hierarchical concepts and their order-namely, \"banking innovation\", \"economic effects of innovational activities\", \"financial and innovative strategy\", and \"innovation risk\"; (2) identify links between innovative and strategic bank management, since bank innovations are carried out in conjunction with strategies and imply positive strategic economic effects, making the assessment of potential innovation risk necessary for the current moment and the future; (3) note that the launching and use of new technologies on economic cycles and phases involving a necessary correlation between innovative profit and these phases; (4) provide preferable measurements of banks' innovative activity and financial performance against commission income; (5) assess the potential financial performance of banks' financial and innovative strategies within economic cycles and phases and in accordance with the nature of income; (6) present general areas for the practical application of an adapted data mining-Monte Carlo method, based on a proprietary software product. The model's application in the \"open innovation\" system exhibits its multipurpose nature and allows for the selection of alternative strategic innovative solutions within economic cycle phases. It also serves in the promotion of Big Data technology in relation to finance and innovation, which is a promising area, and determines the values of the desired indicators for the \"bank of the future\" concept.
Journal Article
A Workload-Driven Approach for View Selection in Large Dimensional Datasets
2020
The information explosion the world has witnessed in the last two decades has forced businesses to adopt a data-driven culture for them to be competitive. These data-driven businesses have access to countless sources of information, and face the challenge of making sense of overwhelming amounts of data in a efficient and reliable manner, which implies the execution of read-intensive operations. In the context of this challenge, a framework for the dynamic read-optimization of large dimensional datasets has been designed, and on top of it a workload-driven mechanism for automatic materialized view selection and creation has been developed. This paper presents an extensive description of this mechanism, along with a proof-of-concept implementation of it and its corresponding performance evaluation. Results show that the proposed mechanism is able to derive a limited but comprehensive set of views leading to a drop in query latency ranging from 80% to 99.99% at the expense of 13% of the disk space used by the base dataset. This way, the devised mechanism enables speeding up query execution by building materialized views that match the actual demand of query workloads.
Journal Article
Ship Spatiotemporal Key Feature Point Online Extraction Based on AIS Multi-Sensor Data Using an Improved Sliding Window Algorithm
2019
Large volumes of automatic identification system (AIS) data provide new ideas and methods for ship data mining and navigation behavior pattern analysis. However, large volumes of big data have low unit values, resulting in the need for large-scale computing, storage, and display. Learning efficiency is low and learning direction is blind and untargeted. Therefore, key feature point (KFP) extraction from the ship trajectory plays an important role in fields such as ship navigation behavior analysis and big data mining. In this paper, we propose a ship spatiotemporal KFP online extraction algorithm that is applied to AIS trajectory data. The sliding window algorithm is modified for application to ship navigation angle deviation, position deviation, and the spatiotemporal characteristics of AIS data. Next, in order to facilitate the subsequent use of the algorithm, a recommended threshold range for the corresponding two parameters is discussed. Finally, the performance of the proposed method is compared with that of the Douglas–Peucker (DP) algorithm to assess its feature extraction accuracy and operational efficiency. The results show that the proposed improved sliding window algorithm can be applied to rapidly and easily extract the KFPs from AIS trajectory data. This ability provides significant benefits for ship traffic flow and navigational behavior learning.
Journal Article
College students’ Network behavior Using data mining and feature analysis
by
HeenaKauser, Sk
,
Madala, Srinivasa Rao
,
Sravani, Thatiparthi
in
Big Data
,
Data mining
,
Undergraduate Network Behavior
2021
Teachers may use advanced analytics to rapidly and correctly understand undergraduate behavior trends, especially when it comes to identifying undergraduate groupings that need to be focused on at a later time. This study uses data mining cluster analysis to analyze the constituent behavior of 3,245 undergraduates in a specific level ‘B’ institution’s college network. According to the data, there are four different undergraduate groups with different Web access features, with 350 participants using the accomplishments and other variables of their success have an influence on these students. As a result of this research, we were able to collect data on undergraduate college network activity, which may be used to aid in the development of academic advising management.
Journal Article