Publications – PLAIBDE

Salma El Hajjami; Jamal Malki; Mohammed Berrada; Bouziane Fourka

Machine Learning for Anomaly Detection. Performance Study considering Anomaly Distribution in an Unbalanced Dataset Inproceedings

IEEE, (Ed.): The 5th International Conference on Cloud Computing and Artificial Intelligence: Technologies and Applications, IEEE Xplore Digital Library, 2020, ISBN: 978-1-7281-6175-4.

Résumé | Liens | BibTeX | Étiquettes:

Salma El Hajjami; Jamal Malki; Alain Bouju; Mohammed Berrada

Machine Learning Facing Behavioral Noise Problem in an Imbalanced Data Using One Side Behavioral Noise Reduction: Application to a Fraud Detection Article de journal

World Academy of Science, Engineering and Technology International. Journal of Computer and Information Engineering, 14 (9), 2020.

Résumé | Liens | BibTeX | Étiquettes:

@article{salma2020Lisbon,
title = {Machine Learning Facing Behavioral Noise Problem in an Imbalanced Data Using One Side Behavioral Noise Reduction: Application to a Fraud Detection},
author = {Salma El Hajjami and Jamal Malki and Alain Bouju and Mohammed Berrada},
editor = {publications.waset.org},
url = {https://publications.waset.org/abstracts/127869/pdf},
year = {2020},
date = {2020-09-14},
journal = {World Academy of Science, Engineering and Technology International. Journal of Computer and Information Engineering},
volume = {14},
number = {9},
abstract = {With the expansion of machine learning and data mining in the context of Big Data analytics, the common problem
that affects data is class imbalance. It refers to an imbalanced distribution of instances belonging to each class. This problem is
present in many real world applications such as fraud detection, network intrusion detection, medical diagnostics, etc. In these
cases, data instances labeled negatively are significantly more numerous than the instances labeled positively. When this
difference is too large, the learning system may face difficulty when tackling this problem, since it is initially designed to work
in relatively balanced class distribution scenarios. Another important problem, which usually accompanies these imbalanced
data, is the overlapping instances between the two classes. It’s commonly referred as noise or overlapping data. In this article, we propose an approach called: One Side Behavioral Noise Reduction (OSBNR). This approach presents a way to deal with the problem of class imbalance in the presence of a high noise level. OSBNR is based on two steps. Firstly, a cluster analysis is applied to groups similar instances from the minority class into several behavior clusters. Secondly, we select and eliminate the instances of the majority class, considered as behavioral noise, which overlaps with behavior clusters of the minority class. The results of experiments carried out on a representative public dataset confirm that the proposed approach is efficient for the treatment of class imbalances in the presence of noise.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Fermer

Thouraya Sakouhi; Jamal Malki; Jalel Akaichi

A mobility data model for web-based tourists tracking Inproceedings

CEUR-WS.org, (Ed.): The 14th international baltic conference on databases and information systems (balticdb&is 2020), Tallinn, Estonia, 2020, ISSN: 1613-0073.

Résumé | Liens | BibTeX | Étiquettes:

Issam Ghabri; Ladjel Bellatreche; Sadok Ben Yahia

Selection of a Green Logical Data Warehouse Schema by Anti-monotonicity Constraint Inproceedings

Chatzigeorgiou, Alexander; Dondi, Riccardo; Herodotou, Herodotos; Kapoutsis, Christos A; Manolopoulos, Yannis; Papadopoulos, George A; Sikora, Florian (Ed.): SOFSEM 2020: Theory and Practice of Computer Science - 46th International Conference on Current Trends in Theory and Practice of Informatics, SOFSEM 2020, Limassol, Cyprus, January 20-24, 2020, Proceedings, p. 350–361, Springer, 2020.

Liens | BibTeX | Étiquettes:

Salma El Hajjami; Jamal Malki; Alain Bouju; Mohammed Berrada

A Machine Learning based Approach to Reduce Behavioral Noise Problem in an Imbalanced Data: Application to a fraud detection Inproceedings

2020 International Conference on Intelligent Data Science Technologies and Applications (IDSTA), p. 11-20, 2020.

Résumé | Liens | BibTeX | Étiquettes:

@inproceedings{9264114,
title = {A Machine Learning based Approach to Reduce Behavioral Noise Problem in an Imbalanced Data: Application to a fraud detection},
author = {Salma El Hajjami and Jamal Malki and Alain Bouju and Mohammed Berrada},
doi = {10.1109/IDSTA50958.2020.9264114},
year = {2020},
date = {2020-01-01},
booktitle = {2020 International Conference on Intelligent Data Science Technologies and Applications (IDSTA)},
pages = {11-20},
abstract = {The question of class imbalance has become more pronounced with the application of learning algorithms in real applications. It has received significant attention in the machine learning and data mining community. This problem is present in fraud detection, medical diagnostics, and a number of other areas where training data contains significantly more representatives of one class (called the majority class) than the other class (called the minority class). Machine learning techniques struggle to deal with imbalanced data by focusing on minimizing the error rate for the majority class while ignoring the minority class, which is the most interesting from a learning point of view and also involves a high cost when it is not well classified. However, the imbalance ratio is not the only cause of poor performance when learning from imbalanced data. Another critical factor that accompanies imbalanced data in the real world is the presence of a number of instances of the two classes being overlapped in feature space. This problem is commonly referred to as class overlap and we have called it “behavioral noise”. In this paper, we propose One Side Behavioral Noise Reduction (OSBNR) approach to deal with the problem of class imbalance in the presence of a behavioral noise level. OSBNR is based on two stages. Firstly, a clustering is applied to groups similar instances of the minority class in multiple behavior clusters. Secondly, we select and eliminate instances of the majority class, considered as behavioral noise, which overlap with the behavior clusters of the minority class. The results of experiments conducted on a representative public dataset confirm that the proposed approach is effective for class imbalance problem in the presence of behavioral noise.},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}

Fermer

S E Hajjami; J Malki; A Bouju; M Berrada

A Machine Learning based Approach to Reduce Behavioral Noise Problem in an Imbalanced Data: Application to a fraud detection Inproceedings

2020 International Conference on Intelligent Data Science Technologies and Applications (IDSTA), p. 11-20, 2020.

Liens | BibTeX | Étiquettes:

Mahfoud Djedaini; Jamal Malki; Alain Bouju

Architectures Big Data basées sur l'Open-Sources pour le Data Analytics Technical Report

Lab. L3i et aYaline - Projet FEDER PLAIBDE 2019.

Résumé | Liens | BibTeX | Étiquettes:

Ladjel Bellatreche; Sharma Chakravarthy

A special issue in extending data warehouses to big data analytics Article de journal

Distributed Parallel Databases, 37 (3), p. 323–327, 2019.

Liens | BibTeX | Étiquettes:

Selma Khouri; Nabila Berkani; Ladjel Bellatreche; Dihia Lanasri

Data Cube Is Dead, Long Life to Data Cube in the Age of Web Data Inproceedings

Madria, Sanjay; -, Philippe Fournier; Chaudhary, Sanjay; Reddy, Krishna P (Ed.): Big Data Analytics - 7th International Conference, BDA 2019, Ahmedabad, India, December 17-20, 2019, Proceedings, p. 44–64, Springer, 2019.

Liens | BibTeX | Étiquettes:

Jorge Galicia; Amin Mesmoudi; Ladjel Bellatreche

RDFPartSuite: Bridging Physical and Logical RDF Partitioning Inproceedings

Ordonez, Carlos; -, Il; -, Gabriele Anderst; Tjoa, Min A; Khalil, Ismail (Ed.): Big Data Analytics and Knowledge Discovery - 21st International Conference, DaWaK 2019, Linz, Austria, August 26-29, 2019, Proceedings, p. 136–150, Springer, 2019.

Liens | BibTeX | Étiquettes:

Jorge Galicia; Amin Mesmoudi; Ladjel Bellatreche; Carlos Ordonez

Reverse Partitioning for SPARQL Queries: Principles and Performance Analysis Article de journal

DEXA (2), p. 174–183, 2019.

BibTeX | Étiquettes:

Selma Khouri; Dihia Lanasri; Roaya Saidoune; Kamila Boudoukha; Ladjel Bellatreche

LogLInc: LoG Queries of Linked Open Data Investigator for Cube Design Inproceedings

Hartmann, Sven; ü, Josef K; Chakravarthy, Sharma; -, Gabriele Anderst; Tjoa, Min A; Khalil, Ismail (Ed.): Database and Expert Systems Applications - 30th International Conference, DEXA 2019, Linz, Austria, August 26-29, 2019, Proceedings, Part I, p. 352–367, Springer, 2019.

Liens | BibTeX | Étiquettes:

Carlos Ordonez; Ladjel Bellatreche

Enhancing ER Diagrams to View Data Transformations Computed with Queries Inproceedings

-, Il; Romero, Oscar; Wrembel, Robert (Ed.): Proceedings of the 21st International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data, co-located with EDBT/ICDT Joint Conference, DOLAP@EDBT/ICDT 2019, Lisbon, Portugal, March 26, 2019, CEUR-WS.org, 2019.

Liens | BibTeX | Étiquettes:

Nabila Berkani; Ladjel Bellatreche; Selma Khouri; Carlos Ordonez

Value-driven Approach for Designing Extended Data Warehouses Inproceedings

-, Il; Romero, Oscar; Wrembel, Robert (Ed.): Proceedings of the 21st International Workshop on Design, Optimization, Languages and Analytical Processing of Big Data, co-located with EDBT/ICDT Joint Conference, DOLAP@EDBT/ICDT 2019, Lisbon, Portugal, March 26, 2019, CEUR-WS.org, 2019.

Liens | BibTeX | Étiquettes:

Selma Khouri; Ladjel Bellatreche; Abdessamed R é; Yasmine Aouimer

Intégrer les LOD dans un cube de données : Transformons une action technique en valeur organisationnelle Inproceedings

Lemire, Daniel; Sautot, Lucile (Ed.): Business Intelligence & Big Data, 15ème Edition de la conférence EDA, Montpellier, France, 3-4 octobre 2019, p. 61–76, Éditions RNTI, 2019.

Liens | BibTeX | Étiquettes:

Soumia Benkrid; Ladjel Bellatreche

Vers une Conception des Entrepôts de Données Parallèles Autonomes Inproceedings

Lemire, Daniel; Sautot, Lucile (Ed.): Business Intelligence & Big Data, 15ème Edition de la conférence EDA, Montpellier, France, 3-4 octobre 2019, p. 109–124, Éditions RNTI, 2019.

Liens | BibTeX | Étiquettes:

Dihia Lanasri; Carlos Ordonez; Ladjel Bellatreche; Selma Khouri

ER4ML: An ER Modeling Tool to Represent Data Transformations in Data Science Inproceedings

é, Jos; Guizzardi, Renata S S; Claro, Daniela Barreiro (Ed.): Proceedings of the ER Forum and Poster & Demos Session 2019 on Publishing Papers with CEUR-WS co-located with 38th International Conference on Conceptual Modeling (ER 2019), Salvador, Brazil, November 4, 2019, p. 123–127, CEUR-WS.org, 2019.

Liens | BibTeX | Étiquettes:

Dihia Lanasri; Selma Khouri; Roaya Saidoune; Kamila Boudoukha; Ladjel Bellatreche

Crumbs4Cube: Turning Breadcrumbs into Smart Enriched Data Cubes Inproceedings

é, Jos; Guizzardi, Renata S S; Claro, Daniela Barreiro (Ed.): Proceedings of the ER Forum and Poster & Demos Session 2019 on Publishing Papers with CEUR-WS co-located with 38th International Conference on Conceptual Modeling (ER 2019), Salvador, Brazil, November 4, 2019, p. 128–132, CEUR-WS.org, 2019.

Liens | BibTeX | Étiquettes:

Mahfoud Djedaini; Jamal Malki; Alain Bouju

Architecture Big Data open source pour l'Informatique Décisionnelle Technical Report

Lab. L3i et aYaline - Projet FEDER PLAIBDE 2018.

Résumé | Liens | BibTeX | Étiquettes:

Sirine Knaz; Jamal Malki

Architecture dórchestration et de communication des microservices à base de conteneurs pour le développement dápplications cloud-natives Masters Thesis

Lab. L3i et aYaline - Projet FEDER PLAIBDE, 2018.

Résumé | Liens | BibTeX | Étiquettes:

Anas El Majdoubi; Jamal Malki

Mise en place d’un OLAP entreprise basé sur l’écosystème Hadoop Masters Thesis

Lab. L3i et aYaline - Projet FEDER PLAIBDE, 2018.

Résumé | Liens | BibTeX | Étiquettes:

Ladjel Bellatreche; Carson Leung; Yinglong Xia; Didier El Baz

Advances in cloud and big data computing - Foreward to the special issue Article de journal

Concurrency and Computation: Practice and Experience, 31 (2), p. e5053, 2018.

Liens | BibTeX | Étiquettes:

Selma Khouri; Ladjel Bellatreche

LOD for Data Warehouses: Managing the Ecosystem Co-Evolution Article de journal

Inf., 9 (7), p. 174, 2018.

Liens | BibTeX | Étiquettes:

Nabila Berkani; Ladjel Bellatreche; Laurent Guittet

ETL Processes in the Era of Variety Article de journal

Trans. Large Scale Data Knowl. Centered Syst., 39 , p. 98–129, 2018.

Liens | BibTeX | Étiquettes:

Carlos Ordonez; Ladjel Bellatreche

A Survey on Parallel Database Systems from a Storage Perspective: Rows Versus Columns Inproceedings

Elloumi, Mourad; Granitzer, Michael; Hameurlain, Abdelkader; Seifert, Christin; Stein, Benno; Tjoa, Min A; Wagner, Roland R (Ed.): Database and Expert Systems Applications - DEXA 2018 International Workshops, BDMICS, BIOKDD, and TIR, Regensburg, Germany, September 3-6, 2018, Proceedings, p. 5–20, Springer, 2018.

Liens | BibTeX | Étiquettes:

Nabila Berkani; Selma Khouri; Ladjel Bellatreche

Linked Open Data pour les Entrepôts de Données: Opportunité et Défis Inproceedings

Badir, Hassan; Bentayeb, Fadila; ï, Omar Boussa (Ed.): Business Intelligence & Big Data, 14ème Edition de la conference EDA, Tanger, Maroc, 4-6 octobre 2018, p. 303–312, Éditions RNTI, 2018.

Liens | BibTeX | Étiquettes: