Javascript must be enabled to continue!
Optimizing Random Forests: Spark Implementations of Random Genetic Forests
View through CrossRef
The Random Forest (RF) algorithm, originally proposed by Breiman [7], is a widely used machine learning algorithm that gains its merit from its fast learning speed as well as high classification accuracy. However, despite its widespread use, the different mechanisms at work in Breiman’s RF are not yet fully understood, and there is still on-going research on several aspects of optimizing the RF algorithm, especially in the big data environment. To optimize the RF algorithm, this work builds new ensembles that optimize the random portions of the RF algorithm using genetic algorithms, yielding Random Genetic Forests (RGF), Negatively Correlated RGF (NC-RGF), and Preemptive RGF (PFS-RGF). These ensembles are compared with Breiman’s classic RF algorithm in Hadoop’s big data framework using Spark on a large, high-dimensional network intrusion dataset, UNSW-NB15.
Title: Optimizing Random Forests: Spark Implementations of Random Genetic Forests
Description:
The Random Forest (RF) algorithm, originally proposed by Breiman [7], is a widely used machine learning algorithm that gains its merit from its fast learning speed as well as high classification accuracy.
However, despite its widespread use, the different mechanisms at work in Breiman’s RF are not yet fully understood, and there is still on-going research on several aspects of optimizing the RF algorithm, especially in the big data environment.
To optimize the RF algorithm, this work builds new ensembles that optimize the random portions of the RF algorithm using genetic algorithms, yielding Random Genetic Forests (RGF), Negatively Correlated RGF (NC-RGF), and Preemptive RGF (PFS-RGF).
These ensembles are compared with Breiman’s classic RF algorithm in Hadoop’s big data framework using Spark on a large, high-dimensional network intrusion dataset, UNSW-NB15.
Related Results
Muriel Spark and the Art of Deception: Constructing Plausibility with the Methods of WWII Black Propaganda
Muriel Spark and the Art of Deception: Constructing Plausibility with the Methods of WWII Black Propaganda
Abstract
From May to October 1944, Muriel Spark was employed by the Political Warfare Executive (PWE), a secret service created by Britain during the Second World Wa...
How Terms Shape Forests: 'Niederwald', 'Mittelwald' and 'Hochwald', and their Interaction with Forest Development in the Canton of Zurich, Switzerland
How Terms Shape Forests: 'Niederwald', 'Mittelwald' and 'Hochwald', and their Interaction with Forest Development in the Canton of Zurich, Switzerland
Changes in forests are influenced by, and themselves influence, such local conditions as soil, climate and exposure, and also the demands put on the forests by society. Forestry ha...
Filming Non-Normative Embodiment: Power, Sexuality and Genetic Difference in Jabe Babe – A Heightened Life, a Documentary about a Dominatrix with Marfan Syndrome
Filming Non-Normative Embodiment: Power, Sexuality and Genetic Difference in Jabe Babe – A Heightened Life, a Documentary about a Dominatrix with Marfan Syndrome
This article will examine the ethical and directorial challenges faced by the documentary filmmaker when collaborating with a central subject who lives with a potentially fatal gen...
Perfecting Bodies: Who Are the Disabled in Andrew Niccol’s Gattaca?
Perfecting Bodies: Who Are the Disabled in Andrew Niccol’s Gattaca?
This paper will examine the impact of genetic technologies on the corporeal and economical aspects of human lives while emphasizing the ambiguity of disability under these subversi...
Genetic diversity, polyphenolic composition and fruit quality trait phenotypic analyses of a Chilean heritage blood-flesh peach (Prunus persica L.)
Genetic diversity, polyphenolic composition and fruit quality trait phenotypic analyses of a Chilean heritage blood-flesh peach (Prunus persica L.)
This study reports the genetic diversity among Chilean heritage blood-flesh peaches and the characterization of phytochemicals and bioactive compounds present in these fruits. A ge...
Cultural heritage preservation by using blockchain technologies
Cultural heritage preservation by using blockchain technologies
AbstractUbiquitous digitization enables promising options for cultural heritage preservation. Therefore, a new approach is presented that considers deployment scenarios by linking ...
Development of Optimized Phenomic Predictors for Efficient Plant Breeding Decisions Using Phenomic-Assisted Selection in Soybean
Development of Optimized Phenomic Predictors for Efficient Plant Breeding Decisions Using Phenomic-Assisted Selection in Soybean
The rate of advancement made in phenomic-assisted breeding methodologies has lagged those of genomic-assisted techniques, which is now a critical component of mainstream cultivar d...
Renewal of Tidal Forests in Washington State after a Subduction Earthquake in A.D. 1700
Renewal of Tidal Forests in Washington State after a Subduction Earthquake in A.D. 1700
AbstractWith few exceptions, today's tidal trees near Washington's Pacific coast postdate an earthquake that lowered the region by 1 m or more. The earthquake, which occurred in A....
Recent Results
ENGINEERING ARCHIE: ARCHIBALD LEITCH: FOOTBALL GROUND DESIGNER
ENGINEERING ARCHIE: ARCHIBALD LEITCH: FOOTBALL GROUND DESIGNER
SIMON INGLIS, Engineers, biography, ENGLISH HERITAGE...
Tire choices in Roman chariot racing
Tire choices in Roman chariot racing
Formal chariot racing was a sophisticated and popular sport for over 1800 years, from Etruria in the 6th c. B.C. down to the fall of Constantinople, and the races held in a large n...
Women as mythmakers
Women as mythmakers
Estella Lauter, Art and mythology, 1984, Indiana University Press...