Salim Miloudi

Logo

Software Developer | AI & Machine Learning Researcher | Data Mining & Analytics Specialist

View My GitHub Profile

Software Developer focused on Data Mining and Machine Learning Applications

Welcome to my GitHub Pages portfolio! Here you will find detailed information about my qualifications, professional experience, education, technical skills, and publications.

Table of Contents

Summary of Qualifications

Professional Experience

Postdoctoral Researcher @ Space Techniques Center, Algerian Space Agency, Algeria (Jan 2022 - Sep 2024)

Computer Science Engineer @ EHS LES PINS, Algeria (May 2014 - Jun 2016)

Computer Science Instructor @ Institute of Management and Enterprise Development of Oran, Algeria (Jan 2014 - Mar 2014)

Part-time Teaching Assistant @ University of Science and Technology of Oran Mohamed Boudiaf, Algeria (Jan 2012 - June 2012)

Education

D.Eng. in Software Engineering (软件工程博士) | School of Computer Science (计算机科) | Wuhan University (武汉大学), China (Dec 2021)

Ph.D. in Information Systems Engineering | University of Science and Technology of Oran Mohamed Boudiaf, Algeria (Feb 2019)

Technical Skills

Operating Systems

Windows Linux

Programming Languages

Java Python C++

Data Science & Machine Learning

TensorFlow Keras scikit-learn pandas NumPy

Web Development

HTML5 CSS3 JavaScript Flask

Database Management Systems

Oracle MySQL PostgreSQL

Version Control

Git GitHub

Geospatial Analysis

Leafmap Leaflet Folium

Big Data

ETL PySpark

Data Visualization

Matplotlib Seaborn

Dimensionality Reduction

PCA UMAP t-SNE

Journal Articles and Conferences

A Gradient-Based Clustering for Multi-Database Mining

Publication

Multinational corporations have multiple databases distributed throughout their branches, which store millions of transactions per day. For business applications, identifying disjoint clusters of similar and relevant databases contributes to learning the common buying patterns among customers and also increases the profits by targeting potential clients in the future. This process is called clustering, which is an important unsupervised technique for big data mining. In this article, we present an effective approach to search for the optimal clustering of multiple transaction databases in a weighted undirected similarity graph. To assess the clustering quality, we use dual gradient descent to minimize a constrained quasi-convex loss function whose parameters will determine the edges needed to form the optimal database clusters in the graph. Therefore, finding the global minimum is guaranteed in a finite and short time compared with the existing non-convex objectives where all possible candidate clusterings are generated to find the ideal clustering. Moreover, our algorithm does not require specifying the number of clusters a priori and uses a disjoint-set forest data structure to maintain and keep track of the clusters as they are updated. Through a series of experiments on public data samples and precomputed similarity matrices, we show that our algorithm is more accurate and faster in practice than the existing clustering algorithms for multi-database mining.

Gradient-Based Clustering

Training and Certifications

MOOC in Spatial Data Science: The New Frontier in Analytics (Nov 2022), ESRI, Online

Deep Learning Specialization (Jul 2018), Coursera, Online

PyImageSearch Gurus Graduate (Jun 2018), PyImageSearch, Online

Deep Learning Nanodegree (May 2018), Udacity, Online