AI Engineer Full Stack | Data Scientist | Researcher | Freelance
I build and integrate AI agents, RAG systems, and intelligent automation into existing data, web, and app workflows — end-to-end, from concept to code to delivery. Traditional ML, data science, and full-stack development across healthcare, finance, and enterprise.
I am Leon Doungala, an Engineer specialized in AI/ML and Data Science based in Milan, Italy. With a Master of Science in Computer Engineering (AI specialization) and a Master's in Data Science, I specialize in developing innovative engineering solutions at the intersection of artificial intelligence and practical business applications.
As an AI/ML Engineer at People Lab (March 2024-Present), I've developed AI solutions for healthcare and corporate environments. My work includes creating a RAG-based Clinical Semantic Search Engine that processes over 160,000 unstructured medical referrals, and an advanced ATS system supporting HR recruiters across multiple companies. These systems leverage LangChain, Flask, Scikit-learn, PyTorch, and OpenAI's LLM APIs to deliver actionable insights.
Previously, as a Software Developer and Data Scientist at the Italian Stock Exchange (Euronext Group) (2022-2024), I collaborated with teams developing financial software applications, optimizing SQL DataMarts, and automating AWS data ingestion workflows. I contributed to data science initiatives applying machine learning models for risk assessment and financial forecasting, working within agile teams to deliver high-impact solutions.
My expertise spans predictive and generative AI, with particular focus on Large Language Models, Retrieval-Augmented Generation, Transfer Learning, and Vector Databases. I'm committed to leveraging these technologies to create scalable, efficient, and impactful AI systems that address real-world challenges, particularly in healthcare and financial domains.
"Artificial Intelligence and Machine Learning Techniques for Diabetes Prediction and Characterization" - Developing predictive models using ensemble learning methods to identify early diabetes markers and optimize personalized treatment pathways.
Graduated with honors, completing a cross-institutional research project on data-driven business optimization strategies implementing machine learning solutions for real-world business challenges.
"Design and Development of a Modern Cloud-Based Web Application in Java Enterprise Edition for Radio Streaming with Integrated Deep Learning Neural Networks" - Created an innovative streaming platform with artificial intelligence-powered content recommendation algorithms and personalized user experience.
End-to-end AI systems — from classical ML to production RAG pipelines. Healthcare AI, semantic search, autonomous agents, and full-stack applications.
Featured work includes a RAG-powered Clinical Search Engine processing 160K+ medical records, an AI-driven ATS system for intelligent CV matching, and my M.Sc. thesis on diabetes prediction using ensemble learning and generative AI — co-authored and published in IEEE Journal of Biomedical and Health Informatics (JBHI).
My stack spans Python, PyTorch, LangChain, OpenAI/Anthropic APIs, Flask, FastAPI, React, and cloud infrastructure on AWS and Azure. Every project follows the same principle: research-grade AI, engineered for real-world scale.
What clients say about working with me
Had a great experience working with me?