I am a data scientist with a strong academic foundation and hands-on experience across various industries. With top-tier certifications and back-end expertise, I have contributed to impactful projects that range from optimizing recommendation systems to classifying companies for strategic investments. My passion lies in uncovering data-driven insights and applying them in the most efficient way to solve real-world problems, particularly in areas like energy and health.
I also like to learn new things like developing games and websites from zero to one.
Programming
Front-End: HTML, CSS, JavaScript
Back-End: Python (NumPy, Pandas, Scikit-Learn, TensorFlow, Keras), R, PHP, C#, Java, C++
Database: Relational database structures, SQL, Dbeaver, Azure functions
Cloud Platforms: AWS, Microsoft Azure, GCP
Analysis: PowerBI, Tableau, Excel
Version Control: Git, Jenkins, Docker, CI/CD
Computer Science
Core Concepts: Database Fundamentals, Data Structures & Algorithms (DSA)
Machine Learning: Supervised & Unsupervised Learning, Reinforcement Learning
Deep Learning: Neural Networks, Natural Language Processing (NLP), Transformer Models, LLMs (e.g., Llama, GPT, Claude)
Artificial Intelligence: Sentiment Analysis, AI API's (Hugging Face, OpenAI), Automation, Fine-Tuning LLMs, RAG
Economics
Macro & Microeconomics: Economic Theory, Policy, and Market Structures
Quantitative Finance: Quantitative Methods, Volatility Evaluation
Ethics: Ethics in AI, Big Data, Finance & Economics
Investment & Portfolio Management: Investment Strategies, Risk Assessment
Mathematics
Core Concepts: Calculus (Analysis, Derivatives, Integrals), Linear Algebra, Multivariate Statistics
Probability & Statistics: Econometrics, Statistical Modeling, Time Series, Fourier Analysis
Numerical Methods: Numerical Simulations, Optimizations, Bootstrapping
Communication
Academic: Academic Reporting, Presenting, LaTeX
Interpersonal: Interpersonal Skills, Collaboration, Teamwork
Management: Project Management (Agile/Scrum, Jira), Leadership, Communication
Tools: Slack, Microsoft Teams, Microsoft Powerpoint/Word/Spreadsheets
Language
Fluent: English and Dutch
Beginner: French, German, and Spanish
October 2024 untill April 2025 in The Hague
April 2024 untill September 2024 in Amsterdam
June 2023 untill September 2023 in Amsterdam
May 2025 until April 2026
September 2020 until April 2025
GPA: 7.5/10 or 3.5/4
September 2019 until July 2020
Withdrew to follow my passion with data-driven causality and programming.
September 2012 until July 2018
AI
Mathematics for Machine Learning @ Imperial College London
Machine Learning @ University of Washington
Deep Learning @ DeepLearning.AI
Natural Language Processing @ DeepLearning.AI
Artificial Intelligence in Healthcare @ Stanford University
Ethics of AI @ Politecnico di Milano
Data Science
Azure Data Scientist Associate @ Microsoft
Applied Data Science with Python @ University of Michigan
Data Science using R @ Johns Hopkins University
SQL for Data Science @ University of California, Davis
Data Structures and Algorithms @ University of California, San Diego
Recommender Systems @ University of Minnesota
Analysis
Excel for Data Analytics and Visualization @ Macquarie University
Data Analytics @ Google
Data Analysis using PySpark @ Coursera
Data Visualization with Tableau @ Duke University
Data-Driven Decisions with Power BI @ Coursera
Finance
Investment Management with Machine Learning @ EDHEC Business School
Fintech @ University of Pennsylvania
Finance & Quantitative Modeling @ UPenn Wharton
Financial Engineering and Risk Management @ Columbia University
Financial Markets @ Yale University
Other
Project Management @ Alphabet
Aeronautical Engineering @ TU Delft
University (grade)
Professional
@ McDermott International, Ltd
RAG optimization: implemented custom evaluation metrics, tested and implemented the best chunking methods, and added metadata for improved retrieval and response quality. Integrated hybrid retrieval and reranking techniques. All of this was done in a secure environment, ensuring no data leakage.
@ Invest-NL
Evaluated and Fine-tuned LLMs: I tackled the challenge of optimizing company classification using state-of-the-art Large Language Models (LLMs). This project focused on evaluating and fine-tuning models like GPT-4o, Claude 3.5 Sonnet, Llama-3.1, and Mistral Large 2 to categorize companies in innovative sectors such as Deep Tech, Life Sciences & Health, and Agrifood. The Llama-3.1–70B-Instruct model demonstrated the highest accuracy and efficiency for the task, surpassing traditional machine learning methods. Additionally, I fine-tuned a smaller Llama-3.1–8B model using Direct Preference Optimization (DPO), achieving a boost in speed and classification accuracy. This work highlights the potential of fine-tuned models for enhancing AI-driven investment decisions. I also wrote an article about this project!
Developed Dashboards for Company Insights by Focus Area: I created comprehensive dashboards that provided teams with a clear overview of companies categorized by sector and subsector. These dashboards included detailed company descriptions and vital financial information such as growth stage, investor details, funding history, and the number of employees. This centralized system offered teams an efficient tool for tracking companies across Invest-NL’s distinct focus areas, enabling better decision-making and resource allocation.
@ Talpa eCommerce
Built a Keras Transformer-Based Recommendation System: I developed a dynamic, transformer-based recommendation engine using Keras. This system leveraged customer behavior data to provide highly personalized product recommendations. By incorporating deep learning techniques, the model continuously adapted to new data, improving the accuracy and relevance of the recommendations over time.
Improved Product Matching Using Similarity Measures: I focused on enhancing product matching through advanced similarity measures. I utilized TensorFlow with web scraping to improve image-based product similarity, applying cosine similarity metrics for targeted marketing strategies, including product-to-product and geospatial recommendations. The implementation of these similarity models led to more accurate product matches, driving improved customer engagement, higher recommendation relevance and thus higher margins.
Analyzed Customer Seasonality with Fourier Transformations: I utilized Fourier transformations to analyze large-scale customer bidding data and uncover seasonal patterns in customer behavior. By examining over two years of auction data, I identified the intervals at which customers returned to bid on similar products. This method allowed me to compute a "Trendiness Score" for each product, combining returning customer percentages, time-weighted intervals between purchases, and dominant purchase frequencies. These insights provided a data-driven approach to predict customer re-engagement and optimize product recommendations based on seasonality trends. For example, the analysis showed that, on average, customers tend to return approximately every 30 days to bid on essential items like toilet paper, making it possible to adjust the timing of product recommendations to match their purchasing cycle.
Independent
Crypto Dashboard: A real-time cryptocurrency dashboard showing price and social media sentiment analysis. It's main addition to other sites is the constant analysis of new youtube videos.
Arcade Space Shooter: A dynamic space shooter game built with vanilla JavaScript, HTML5, and CSS3, featuring progressive difficulty scaling and engaging power-up mechanics. Players navigate a spaceship through an asteroid field while managing weapon heat levels and collecting power-ups like triple shots, shields, and unlimited fire modes. The game implements a unique scoring system where both difficulty and score multipliers increase every 30 seconds, encouraging longer survival times and higher scores. Technical features include smooth animations, collision detection, local storage for high scores, and responsive controls - all built without external libraries. The project demonstrates proficiency in game state management, object-oriented programming, and browser performance optimization while delivering an engaging user experience..
Open-World Automation Survival Game: I am currently also developing a bigger game from scratch using C# and Unity, set in a post-civilization world. The game blends elements from Factorio, Minecraft, and Valheim, featuring survival, combat and automation mechanics. My goal is to create an immersive experience where players build, manage resources, and interact with dynamic in-game economies, all while navigating the challenges of a world rebuilding from collapse.
I built this website from the ground up using PHP, JavaScript, HTML, and CSS. The site is fully custom-coded, featuring dynamic functionality and a clean, navigable, and responsive design.
Harvard ML Capstone: I developed a movie recommendation system using the MovieLens 10M dataset, focusing on predicting user ratings for movies. After performing data preparation and analysis, I built three models: a naive model predicting the same rating for all movies (RMSE: 1.06), a movie effect model incorporating movie-specific averages (RMSE: 0.94), and a movie + user effect model that achieved the best performance with an RMSE of 0.87.
I have also competed in some kaggle competitions and am active on leetcode.
More projects and code can be found on my github page.
Reach out to me on Linkedin or fill out the form below!