Publications

Show all

2021

Juez-Gil, Mario; Arnaiz-González, Álvar; Rodríguez, Juan José; López-Nozal, Carlos; García-Osorio, César

Approx-SMOTE: Fast SMOTE for Big Data on Apache Spark Journal Article

In: Neurocomputing, vol. 464, pp. 432-437, 2021, ISSN: 0925-2312.

Abstract | Links | BibTeX | Tags: Big data, Data Mining, imbalance, SMOTE, Spark

Juez-Gil, Mario; Arnaiz-González, Álvar; Rodríguez, Juan José; López-Nozal, Carlos; García-Osorio, César

Rotation Forest for Big Data Journal Article

In: Information Fusion, vol. 74, pp. 39-49, 2021, ISSN: 1566-2535.

Abstract | Links | BibTeX | Tags: Big data, Ensemble learning, Machine learning, Random forest, Rotation forest, Spark

Juez-Gil, Mario; Arnaiz-González, Álvar; Rodríguez, Juan José; García-Osorio, César

Experimental evaluation of ensemble classifiers for imbalance in Big Data Journal Article

In: Applied Soft Computing, vol. 108, no. 107447, 2021, ISSN: 1568-4946.

Abstract | Links | BibTeX | Tags: Big data, ensemble, imbalance, resampling, Spark, unbalance

2016

Arnaiz-González, Álvar; Díez-Pastor, José Francisco; Rodríguez, Juan José; García-Osorio, César

Instance selection of linear complexity for big data Journal Article

In: Knowledge-Based Systems, vol. 107, pp. 83–95, 2016, ISSN: 0950-7051.

Abstract | Links | BibTeX | Tags: Big data, Data Mining, Data reduction, Hashing, Instance selection, Nearest neighbors

2010

García-Osorio, César; de Haro-García, Aida; García-Pedrajas, Nicolás

Democratic instance selection: A linear complexity instance selection algorithm based on classifier ensemble concepts Journal Article

In: Artif. Intell., vol. 174, no. 5-6, pp. 410–441, 2010, ISSN: 0004-3702.

Links | BibTeX | Tags: Big data, Data Mining, Instance selection