Publications

2025
BibTeX

—————————————————————————————–

Fatma Dhaoui, Kalthoum Rezgui, Nadia Ben Azzouna
Explaining MOOC Dropout Prediction Using ML and DL Models: An Empirical Study on the KDDCup 2015 Dataset

L’étude vise à prédire les abandons dans les MOOCs en comparant des modèles d’apprentissage automatique (ML) et d’apprentissage profond (DL), tout en intégrant des techniques d’explicabilité (XAI) pour comprendre les comportements des apprenants., 2025

Abstract

Massive Open Online Courses (MOOCs) face high
dropout rates, often exceeding 80%, undermining their educational
potential. This study presents a comparative evaluation
of Machine Learning (ML) and Deep Learning (DL) models for
early dropout prediction using the KDDCup2015 dataset, with a
dual focus on predictive performance and model interpretability
through eXplainable AI (XAI) techniques. Among traditional
ML models, the Decision Tree (DT) achieves the highest
performance (90.18% AUC-PR by Week 4), outperforming
Logistic Regression (LR) and Support Vector Machine (SVM).
In the ensemble category, AdaBoost leads with 90.35% AUCPR.
The hybrid CNN-LSTM outperforms standalone CNN and
LSTM models, reaching up to 93,76% AUC-PR. XAI analysis
reveals that frequent platform access, navigation patterns and
problem solving activities are key predictors of dropout. These
insights support early interpretable interventions to improve
learner retention while maintaining model transparency.
BibTeX

@INPROCEEDINGS {11272794,
author = { Chedly, Jihen and Maatouk, Ons and Ayadi, Wassim },
booktitle = { 2025 IEEE 37th International Conference on Tools with Artificial Intelligence (ICTAI) },
title = {{ Biological Knowledge-Driven Evolutionary Algorithm for Biclustering Gene Expression Data }},
year = {2025},
volume = {},
ISSN = {},
pages = {705-709},
abstract = { Biclustering is an important data analysis technique that simultaneously groups coherent rows and columns of a matrix. It has broad applications, especially in bioinformatics, where it helps identify groups of genes with similar behavior under specific experimental conditions, potentially revealing shared biological functions. Although many algorithms have been developed, including evolutionary approaches, most rely on statistical criteria that may not reflect true biological significance, making interpretation difficult for biologists. To address this limitation, we propose integrating biological knowledge directly into an evolutionary algorithm, particularly during the selection and crossover phases. We also employ a global alignment technique with a binary mask to guide crossover operations more effectively. Experiments conducted on real Saccharomyces cerevisiae data confirm that integrating biological knowledge yields biologically meaningful and non-trivial biclusters, demonstrating the effectiveness of our approach. },
keywords = {Data analysis;Annotations;Evolutionary computation;Coherence;Biology;Gene expression;Object recognition;Bioinformatics;Artificial intelligence},
doi = {10.1109/ICTAI66417.2025.00101},
url = {https://doi.ieeecomputersociety.org/10.1109/ICTAI66417.2025.00101},
publisher = {IEEE Computer Society},
address = {Los Alamitos, CA, USA},
month =Nov}

Jihen Chedli, Ons Maatouk, Wassim Ayadi
Biological Knowledge-Driven Evolutionary Algorithm for Biclustering Gene Expression Data

IEEE 37th International Conference on Tools with Artificial Intelligence (ICTAI), 2025

Abstract

Biclustering is an important data analysis technique
that simultaneously groups coherent rows and columns of a
matrix. It has broad applications, especially in bioinformatics,
where it helps identify groups of genes with similar behavior under
specific experimental conditions, potentially revealing shared
biological functions. Although many algorithms have been developed,
including evolutionary approaches, most rely on statistical
criteria that may not reflect true biological significance, making
interpretation difficult for biologists. To address this limitation,
we propose integrating biological knowledge directly into an
evolutionary algorithm, particularly during the selection and
crossover phases. We also employ a global alignment technique
with a binary mask to guide crossover operations more effectively.
Experiments conducted on real Saccharomyces cerevisiae data
confirm that integrating biological knowledge yields biologically
meaningful and non-trivial biclusters, demonstrating the effectiveness
of our approach.
2024
BibTeX

@inproceedings{boubaker2024personalized, title={Personalized E-Learning Knowledge Graph-based Recommender System using Ensemble Attention Networks}, author={Boubaker, Nadia Ben Hadj and Kodia, Zahra and Ayadi, Nadia Yacoubi}, booktitle={International Conference on Management of Digital}, pages={84--100}, year={2024}, organization={Springer} }

Nadia Ben Hadj Boubaker, Zahra Kodia, Nadia Yacoubi Ayadi
Personalized E-Learning Knowledge Graph-based Recommender System using Ensemble Attention Networks

Boubaker, N. B. H., Kodia, Z., & Ayadi, N. Y. (2024, November). Personalized E-Learning Knowledge Graph-based Recommender System using Ensemble Attention Networks. In International Conference on Management of Digital (pp. 84-100). Cham: Springer Nature Sw, 2024

Abstract

In a rapidly evolving digital landscape, recommender systems have become essential tools for helping users navigate overwhelming amounts of information in various domains. In e-learning contexts, these systems aim to support learners by identifying educational resources or relevant academic content that are significant for their learning experience. However, research incorporating domain knowledge, such as course-related concepts, to improve recommendation quality remains limited. This paper presents a new personalized recommender system in e-learning context, which is called KA-ERN: Knowledge-based Attention Ensemble Recurrent Network. Specifically, KA-ERN leverages a Knowledge Graph to capture the dependencies and semantic relationships between users, courses, and their related concepts and then, performs ensemble learning by combining the Bidirectional Long Short-Term Memory network (Bi-LSTM) with the Artificial Neural Network (ANN). An attention mechanism is added to enhance recommendation quality. Our approach is based on a two-stage architecture. First, entities and relations embeddings are generated and then concatenated as sequences allowing the model to capture complex relationships and contextual dependencies of user preferences. Secondly, these embeddings are provided as inputs to the KA-ERN model. The proposed combination of Bi-LSTM network, ANN, and attention mechanisms shows the advantage of using Knowledge graph embeddings over bipartite graph embeddings capturing only user-item interactions and experimental results achieving the best recommendation metrics, with RMSE of 0.045 and MAE of 0.034, outperforming baseline methods.
BibTeX

–

Mouhamed Gaith Ayadi, Haithem Mezni
Enabling Configurable Workflows in Smart Environments with Knowledge-based Process Fragment Reuse

Grid Computing, 2024

Abstract

In today’s smart environments, the serviceli-zation of various resources has produced a tremendous number of IoT- and cloud-based smart services. Thanks to the pivotal role of pillar paradigms, such as edge/cloud computing, Internet of Things, and business process management, it is now possible to combine and translate these service-like resources into configurable workflows, to cope with users’ complex needs. Examples include treatment workflows in smart healthcare, delivery plans in drone-based missions, transportation plans in smart urban networks, etc. Rather than composing atomic services to obtain these workflows, reusing existing process fragments has several advantages, mainly the fast, secure, and configurable compositions. However, reusing smart process fragments has not yet been addressed in the context of smart environments. In addition, existing solutions in smart environments suffer from the complexity (e.g., multi-modal transportation in smart mobility) and privacy issues caused by the heterogeneity (e.g., package delivery in smart economy) of aggregated services. Moreover, these services may be conflicting in specific domains (e.g. medication/treatment workflows in smart healthcare), and may affect user experience. To solve the above issues, the present paper aims to accelerate the process of generating configurable treatment workflows w.r.t. the users’ requirements and their smart environment specificity. We exploit the principles of software reuse to map each sub-request into smart process fragments, which we combine using Cocke-Kasami-Younger (CKY) method, to finally obtain the suitable workflow. This contribution is preceded by a knowledge graph modeling of smart environments in terms of available services, process fragments, as well as their dependencies. The built information network is, then, managed using a graph representation learning method, in order to facilitate its processing and composing high-quality smart services. Experimental results on a real-world dataset proved the effectiveness of our approach, compared to existing solutions.
BibTeX

–

Haithem Mezni, Mokhtar Sellami, Amal Al-Rasheed, Hela Elmannai
Cross-network service recommendation in smart cities

Concurrency and Computation: Practice and Experience, 2024

Abstract

Nowadays, Internet of Things, artificial intelligence, cloud computing, and other revolutionary technologies (e.g., edge and fog computing) have become the pillar of smart cities. These latter make users’ lives easier, thanks to a wide variety of smart services offered in different dimensions (e.g., smart living, smart mobility, smart economy, smart governance). However, the rapid adoption of smart services by users and the full servicelization of several cities around the world is faced with two major issues: the lack of knowledge regarding smart services’ capacities (e.g., features, contextual requirements, pricing models, privacy policies, provisioning terms, etc.), and the lack of unified rating and quantification of smart services’ QoS behavior. Indeed, interested users often exploit traditional search tools (e.g., Web search engines, social networks) to find and rate the needed services. This behavior has scattered the smart services’ usage data (e.g., users contexts, ratings) across multiple providers platforms, which makes the search task beyond the capacity of users and, even, other service providers. Although recommender systems are a natural solution to exempt users from exploring the huge space of the offered smart services, current recommendation approaches for smart city environments are unable to deliver correct recommendations. In fact, they have been initially designed to single-network settings (a single service repository), while smart services’ consumers often are involved in multiple provider platforms. To the best of our knowledge, there exists no approach that treated smart service recommendation across multiple information networks. Therefore, the goal of this paper is to propose a cross-network recommender system for smart cities. We first model the multiplex network of smart services’ providers as a multirelational fuzzy lattice family thanks to fuzzy relational concept analysis (fuzzy RCA), which is a powerful mathematical method for data analysis and clustering. We also use the concept of anchor users to connect providers networks via the users involved in more than one provider platform. Guided by anchors’ cross-network relations, we compute the similarity between users and we define algorithms for exploring the smart services’ information network, i.e. lattice family. Extensive experiments have proved the effectiveness of cross-network recommendation and the quality of produced recommendations, compared to state-of-the-art single-network recommendation.
BibTeX

–

Ameni Hedhli, Haithem Mezni, Lamjed Ben Said
BPaaS placement over optimum cloud availability zones

Cluster Computing, 2024

Abstract

Business Process as a Service (BPaaS) has recently emerged from the synergy between business process management and cloud computing, allowing companies to outsource and migrate their businesses to the cloud. BPaaS management refers to the set of operations (decomposition, customization, placement, etc.) that maintain a high-quality of the deployed cloud-based businesses. Like its ancestor SaaS, BPaaS placement consists on the dispersion of its composing fragments over multiple cloud availability zones (CAZ). These latter are characterized by their huge, diverse and dynamic data, which are exploited to select the high-performance servers holding BPaaS fragments, while preserving their constraints. These fragments’ relations and their placement schemes constitute a dynamic BPaaS information network. However, the few existing BPaaS solutions adopt a static placement strategy, while it is important to take the CAZ dynamic and uncertain nature into account. Also, current solutions do not properly model the BPaaS environment. To offer an efficient BPaaS placement scheme, we combine prediction and learning capabilities, which will help identify the migrating fragments and their new hosting servers. We first model the BPaaS context as a heterogeneous information network. Then, we apply an incremental representation learning approach to facilitate its processing. Using the principles of proximity-aware representation learning, we infer useful knowledge regarding BPaaS fragments and the available servers at different CAZ. Finally based on the degree of closeness between the BPaaS environment’s entities (e.g., fragments, servers), we select the optimum cloud availability zone on which the resource-consuming BPaaS fragments are migrated based on a proposed placement scheme. Obtained results were very promising compared to traditional BPaaS placement solutions.
BibTeX

@INPROCEEDINGS{10596210,
author={Godhbani, Salah and Elkosantini, Sabeur and Lee, Seongkwan M. and Suh, Wonho},
booktitle={2024 IEEE International Conference on Advanced Systems and Emergent Technologies (IC_ASET)},
title={A New Multimodal and Spatio-Temporal Dataset for Traffic Control: Development, Analysis, and Potential Applications},
year={2024},
volume={},
number={},
pages={1-5},
keywords={Training;Road transportation;Technological innovation;Soft sensors;Transportation;Data integration;Traffic control;multimodal;dataset;traffic control;image;speed;flow;Machine learning},
doi={10.1109/IC_ASET61847.2024.10596210}}

Salah Ghodhbani
A New Multimodal and Spatio-Temporal Dataset for Traffic Control: Development, Analysis, and Potential Applications

The dataset provides a comprehensive view of traffic behavior at specific junctions, enabling detailed analysis and real-world applications. By integrating previously disparate data sources, this dataset offers a valuable resource for understanding and op, 2024

Abstract

Multimodal data, which includes various data formats such as image, video, text, and sensor data, is essential for urban traffic management. The lack of proven multimodal transportation data has been a significant challenge for urban planners, leading to biased or incomplete estimates of travel demand, mode choice, and network performance. Multimodal data integration offers a valuable resource for understanding and optimizing traffic control and management. However, the heterogeneity of the data, various kinds of noise, alignment of modalities, and techniques to handle missing data are some of the challenges that arise. This paper presents a novel multimodal dataset which is the first of its kind, its scraped from England Highways, incorporating speed, flow, and camera images for the M60, M25, and M1 motorways. The dataset provides a comprehensive view of traffic behavior at specific junctions, enabling detailed analysis and real-world applications. By integrating previously disparate data sources, this dataset offers a valuable resource for understanding and optimizing traffic control and management. The paper outlines the dataset’s development, including the gathering of speed and flow data, and the use of image scraping techniques to capture CCTV images. The potential applications of the dataset for traffic control, planning, and optimization are also discussed. Overall, this multimodal dataset represents a significant contribution to the field, with implications for the development of advanced traffic management systems and the improvement of transportation infrastructure
BibTeX

@INPROCEEDINGS{10596197,
author={Godhbani, Salah and Elkosantini, Sabeur and Lee, Seongkwan M. and Suh, Wonho},
booktitle={2024 IEEE International Conference on Advanced Systems and Emergent Technologies (IC_ASET)},
title={A Spatial-Temporal DLApproach for Traffic Flow Prediction Using Attention Fusion Method},
year={2024},
volume={},
number={},
pages={1-6},
keywords={Recurrent neural networks;Accuracy;Roads;Feature extraction;Data models;Spatial databases;Spatiotemporal phenomena;multimodal;dataset;traffic control;image;speed;flow;Machine learning},
doi={10.1109/IC_ASET61847.2024.10596197}}

Salah Ghodhbani, Sabeur Elkosantini
A Spatial-Temporal DLApproach for Traffic Flow Prediction Using Attention Fusion Method

The proposed model can extract comprehensive features from various transportation data and effectively capture the spatial-temporal dependencies. By merging these features, it aims to generate more accurate and robust traffic flow predictions. This method, 2024

Abstract

in recent years, traffic flow prediction has presented challenges in the management of transportation systems. It is a crucial part of Intelligent Transportation Systems (ITS). The complexities of various transportation data, spatial and temporal dependencies on road networks, and multimodalities, such as public transit, pedestrian flow, and bike sharing, make it a challenging task to forecast traffic flow accurately. Numerous works have been introduced to address these challenges, but few have simultaneously considered these factors, resulting in limited success. In this study, a model is proposed to integrate Graph Convolutional Networks (GCN) and Bidirectional Long Short-Term Memory (BiLSTM). This model utilizes the advantages of GCN in handling spatial data and capturing dependencies in road networks, combined with BiLSTM’s capability in learning temporal dynamics. The proposed model can extract comprehensive features from various transportation data and effectively capture the spatial-temporal dependencies. By merging these features, it aims to generate more accurate and robust traffic flow predictions. This method addresses the limitations of existing methods that fail to consider spatial-temporal dependencies and multimodalities, leading to improved prediction accuracy and efficiency
BibTeX

@article{latrech2024codfi, title={CoDFi-DL: a hybrid recommender system combining enhanced collaborative and demographic filtering based on deep learning.}, author={Latrech, Jihene and Kodia, Zahra and Ben Azzouna, Nadia}, journal={Journal of Supercomputing}, volume={80}, number={1}, year={2024} }

Jihene LATRECH, Zahra Kodia, Nadia Ben Azzouna
CoDFi-DL: a hybrid recommender system combining enhanced collaborative and demographic filtering based on deep learning.

Journal of Supercomputing, 80(1), 2024

Abstract

The cold start problem has always been a major challenge for recommender systems. It arises when the system lacks rating records for new users or items. Addressing the challenge of providing personalized recommendations in the cold start scenario is crucial. This research proposes a new hybrid recommender system named CoDFi-DL which combines demographic and enhanced collaborative filtering. The demographic filtering is performed through a deep neural network (DNN) and used to solve the new user cold start problem. The enhanced collaborative filtering component of our model focuses on delivering personalized recommendations through a neighborhood-based method. The major contribution in this research is the DNN-based demographic filtering which overcomes the new user cold start problem and enhances the collaborative filtering process. Our system significantly improves the relevancy of the recommendation task and thus provides personalized recommended items to cold users. To evaluate the effectiveness of our approach, we conducted experiments on real multi-label datasets, 1M and 100K MovieLens. CoDFi-DL recommender system showed higher performance in comparison with baseline methods, achieving lower RMSE rates of 0.5710 on the 1M MovieLens dataset and 0.6127 on the 100K MovieLens dataset.
BibTeX

@inproceedings{latrech2024context, title={Context-based collaborative filtering: K-means clustering and contextual matrix factorization}, author={Latrech, Jihene and Kodia, Zahra and Azzouna, Nadia Ben}, booktitle={2024 10th International Conference on Control, Decision and Information Technologies (CoDIT)}, pages={1--5}, year={2024}, organization={IEEE} }

Jihene LATRECH, Zahra Kodia, Nadia Ben Azzouna
Context-based Collaborative Filtering: K-Means Clustering and Contextual Matrix Factorization*

In 2024 10th International Conference on Control, Decision and Information Technologies (CoDIT) (pp. 1-5). IEEE., 2024

Abstract

The rapid expansion of contextual information from smartphones and Internet of Things (IoT) devices paved the way for Context-Aware Recommendation Systems (CARS). This abundance of contextual data heralds a transformative era for traditional recommendation systems. In alignment with this trend, we propose a novel model which provides personalized recommendations based on context. Our approach uses K-means algorithm to cluster users based on contextual features. Then, the model performs collaborative filtering based on matrix factorization with enhanced contextual biases to provide relevant recommendations. We demonstrated the performance of our method through experiments conducted on the movie recommender dataset LDOS-CoMoDa. The experimental results showed the effective performance of our proposal compared to reference methods, achieving an RMSE of 0.7416 and an MAE of 0.6183.

« Précédent 1 2 3 4 5 6 … 45 Suivant »

Explaining MOOC Dropout Prediction Using ML and DL Models: An Empirical Study on the KDDCup 2015 Dataset

Abstract

Biological Knowledge-Driven Evolutionary Algorithm for Biclustering Gene Expression Data

Abstract

Personalized E-Learning Knowledge Graph-based Recommender System using Ensemble Attention Networks

Abstract

Enabling Configurable Workflows in Smart Environments with Knowledge-based Process Fragment Reuse

Abstract

Cross-network service recommendation in smart cities

Abstract

BPaaS placement over optimum cloud availability zones

Abstract

A New Multimodal and Spatio-Temporal Dataset for Traffic Control: Development, Analysis, and Potential Applications

Abstract

A Spatial-Temporal DLApproach for Traffic Flow Prediction Using Attention Fusion Method

Abstract

CoDFi-DL: a hybrid recommender system combining enhanced collaborative and demographic filtering based on deep learning.

Abstract

Context-based Collaborative Filtering: K-Means Clustering and Contextual Matrix Factorization*

Abstract