Master Projects

Automated knowledge base

Supervisors: Heerko Groefsema, Dilek Düştegör, Alexander Lazovik
Date: 2026-04-09
Type: bachelor-internship/bachelor-project/master-project/master-internship
Description:

Job hopping is a common practice in the current market. However, as employees come and go, business must train its new employees to work according to their processes and using their systems. Tool to do this is through onboarding and knowledge bases. However, maintaining and updating these knowledge bases is a large amount of work. During this project, you will investigate how the maintaining and updating of knowledge bases can be automated. In doing so, you will take our research group as an example. The result should be an automated knowledge base that methodically describes everything what a student should know when they join our group or do a project with our group.

Orchestration Framework for hybrid computing

Supervisors: Alexander Lazovik
Date: 2026-01-28
Type: master-project/master-internship
Description:

(Requirement: Availability for six months for full time internship at TNO; Interest in cloud computing, HPC, or AI infrastructure)
In this internship at TNO, the student will contribute to the design of the model and implementation of a tool that automatically selects the most suitable digital infrastructure for a given application. The tool will support intelligent decision-making across heterogeneous computing environments such as HPC, Quantum, and emerging accelerators (Neuromorphic). The internship focuses on translating application requirements (e.g. performance, cost, energy, data sensitivity) into infrastructure choices using rule-based logic, optimization methods, or AI-driven approaches. The work will be carried out in close collaboration with researchers working on digital infrastructures and AI orchestration.
The challenge of this internship is to design and implement a research-oriented prototype that automatically selects the most suitable digital infrastructure (e.g. cloud, HPC, edge, or accelerators) based on application requirements such as performance, cost, energy efficiency, and data constraints. The student will investigate how these requirements can be formalized and mapped to infrastructure capabilities using rule-based, optimization, or AI-driven methods. In this role, the student will combine analytical research with hands-on implementation, validate the approach using realistic use cases, and document the findings in a structured, research-quality manner, contributing to ongoing work on intelligent orchestration of heterogeneous computing infrastructures.

Vertical Federated Learning Framework

Supervisors: Revin Alief, Dilek Düştegör, Alexander Lazovik
Date: 2026-01-28
Type: master-project/master-internship
Description:

(Requirement: Availability for six months for full time internship at TNO)
Big Data and Data Science (AI & ML) are increasingly popular topics because of the advantages they can bring to companies. The data analysis is often done in long-running processes or even with an always-online streaming process. This data analysis is almost always done within different types of limitations: from users, business perspective, from hardware and from the platforms on which the data analysis is running. At TNO we are looking into ways of developing solutions for vertical federated learning framework which allows the separation of concerns between local models, making analysis on local data, and central model which learns from many local models and updates local models when necessary. We have applied federated learning on horizontal approaches applied in multiple domains like energy, industry. Vertical Federated Learning (VFL) enables multiple parties to collaboratively train a machine learning model over vertically distributed datasets without data privacy leakage.
At TNO we are looking into ways of developing solutions for vertical federated learning framework which allows the separation of concerns between local models, making analysis on local data, and central model which learns from many local models and updates local models when necessary. We have applied federated learning on horizontal approaches applied in multiple domains like energy, industry. Vertical Federated Learning (VFL) enables multiple parties to collaboratively train a machine learning model over vertically distributed datasets without data privacy leakage.
Internship Role and Responsibilities:
- Your challenge would be to investigate and experiment on vertical federated learning approach and apply it to the energy or industry domain. Develop a scalable federated learning platform using the state-of-the-art approach.
- Evaluation of real-world scenarios and benchmarking data.
- Research on state of the art of heterogeneous edge computing and federated learning frameworks and scenarios.

Building a Simulation Pipeline for Anomalous Time-Series Data in Water Networks

Supervisors: Samer Ahmed, Dilek Düştegör
Date: 2026-01-09
Type: bachelor-project/master-project/bachelor-internship/master-internship
Description:

Reliable anomaly detection in water distribution networks requires diverse and well-structured data covering both normal and faulty operating conditions. In this project, the student will design and implement an automated simulation and data-generation pipeline that produces time-series data (e.g. pressure, flow, demand) for water networks under a variety of scenarios, including leaks and sensor malfunctions. The work focuses on wrapping and orchestrating existing simulation tools (such as EPANET/WNTR), systematically varying configurations and fault parameters, and organising outputs into reproducible, machine-learning-ready datasets. The project is well suited for Bachelor or Master students with a background in computer science, data science, or AI, who are comfortable programming in Python and willing to independently learn domain-specific tools through documentation and examples. Prior knowledge of water networks is not required. References:
DiTEC-WDN Dataset DiTEC-WDN: A Large-Scale Dataset of Hydraulic Scenarios across Multiple Water Distribution Networks LeakG3PD: A Python Generator and Simulated Water Distribution System Dataset EPANET WNTR

Evaluating the Performance of vLLM and DeepSpeed for Serving LLM Inference Queries

Supervisors: Mahmoud Alasmar, Alexander Lazovik
Date: 2026-01-09
Type: master-project
Description:

The computational complexity of serving large language model (LLM) queries depends heavily on model size, sequence length, and memory access patterns. To address these challenges, several LLM inference serving frameworks have been proposed employing different optimization techniques to improve throughput and reduce memory overhead. vLLM and DeepSpeed are two prominent examples that deploy distinct techniques to achieve efficient inference serving frameworks. vLLM proposes PagedAttention for efficient key–value cache management. On the other hand, DeepSpeed integrates multiple optimization techniques, such as parallelism and kernel-level optimizations, for scalable inference. This project aims to systematically evaluate the end-to-end inference performance (Latency, throughput, Memory footprint) of vLLM and DeepSpeed under different inference workloads. Experiments will be performed using one of the publicly available datasets, such as ShareGPT. The results will highlight the trade-offs between KV cache management, kernel-level optimizations, and parallelism strategies in LLM inference serving, providing insights into the conditions under which each framework is most effective. References:
Efficient Memory Management for Large Language Model Serving with PagedAttention DeepSpeed Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale

Estimating Time and Resource Usage of SLURM Jobs Using RLM

Supervisors: Mahmoud Alasmar, Alexander Lazovik
Date: 2026-01-09
Type: master-project/master-internship
Description:

Efficient allocation of computational resources in high-performance computing (HPC) clusters requires accurate prediction of job runtime and resource requirements. Users often over-request CPU, memory, or time to avoid failures, which can lead to wasted resources and longer queue times. Therefore, predicting these requirements before job submission is critical for improving cluster utilization and scheduling efficiency. This project investigates how Regression Language Models (RLMs) can be used to estimate the time and resource usage of SLURM jobs based on submitted Bash scripts and job metadata. The study will use real job submission data from the Habrok HPC cluster. References:
Regression Language Models for Code

An Online, Continuous, Self-Adaptive Pipeline for Water Distribution Network State Estimation

Supervisors: Huy Truong, Dilek Düştegör
Date: 2026-01-09
Type: bachelor-project/master-project/master-internship
Description:

Deploying machine-learning pipelines in real-world systems is challenging due to data distribution drift and inherent instability. Conventional machine-learning models typically rely on fixed weights optimized for a specific training distribution, which leads to performance degradation when exposed to unseen and noisy data in practice. To address this limitation, this group project develops a framework that supports online learning and introduces an evaluation benchmark that more closely reflects real-world operating conditions. Specifically, the project consists of two main components: 1. A pipeline built around a pretrained Graph Neural Network (GNN) to estimate unknown hydraulic measurements from a limited set of sensors deployed across a water distribution network. This component focuses on implementing a Test-Time Training strategy that adapts model weights using only incoming test inputs. 2. A benchmarking platform that simulates real-world steady-state snapshots, incorporating hydraulic measurements such as pressure, demand, and network topology across multiple water distribution systems. The benchmark is designed to evaluate the robustness and adaptability of machine-learning pipelines under what-if analyses and out-of-distribution conditions. References:
Test-Time Training with Self-Supervision for Generalization under Distribution Shifts.

Multivariate State Estimation in Drinking Water Distribution Networks

Supervisors: Huy Truong, Andrés Tello, Alexander Lazovik
Date: 2026-01-09
Type: bachelor-project/master-project/master-internship
Description:

Monitoring water distribution networks plays the main role in ensuring safe drinking water delivery to millions of residents in the urban area. Traditionally, this task relies on physics-based mathematical simulations; however, such models require a large number of parameters and frequent recalibration to maintain accuracy consistent with sensor measurements. As an alternative, recent studies have proposed data-driven approaches based on Graph Neural Networks (GNNs), which leverage pressure measurements from a limited set of sensors at known locations to infer pressure values at unmonitored nodes in the network. Building on this idea, the project extends the existing univariate method to a multivariate framework, aiming to jointly estimate multiple hydraulic quantities, including pressure, demand, flow rate, head loss, and others. The candidate is expected to have a basis of machine-learning foundation and proficiency in one of the deep learning frameworks (PyTorch, TensorFlow). Reference:
Graph Neural Networks for Pressure Estimation in Water Distribution Systems**.**

Graph Reasoning Models

Supervisors: Huy Truong, Dilek Düştegör
Date: 2026-01-09
Type: master-project/master-internship
Description:

Graph Neural Networks (GNNs) have emerged as promising approaches in processing graph-based systems. GNNs leverage a message passing mechanism to update node features given neighborhood information. However, this mechanism often paired with several issues, particularly for over-smoothing, a phenomenon in which GNNs encode similar representations for all nodes in the graph. This hinders the scalability, constraining these models’ depth to a shallow level. This work explores a recursive approach to extend the number of layers virtually while measuring the impact of over-smoothing in this specific setting. The new approach is validated in the context of the water domain. Students interested in joining this project should have a basis of machine-learning knowledge and be familiar with one of deep-learning frameworks (PyTorch, Tensorflow). References:
Less is More: Recursive Reasoning with Tiny Networks. Assessing the performances and transferability of graph neural network metamodels for water distribution systems**.** Hierarchical Reasoning Model**.**

Evaluating LoRA for GNN-based model adaptation

Supervisors: Andrés Tello, Alexander Lazovik
Date: 2026-01-05
Type: bachelor-project/master-project
Description:

Foundation models have become a game-changer in several fields due to their strong generalization capabilities after some form of model adaptation, with fine-tuning being the most common approach. In this project, we aim to evaluate the effectiveness of Low-Rank Adaptation (LoRA) methods in terms of model performance, model size, and memory usage. While conventional full fine-tuning often yields high accuracy, LoRA can represent a more sustainable yet still effective alternative for model adaptation.

In this project, the student will implement a LoRA-based approach to adapt a pre-trained GNN-based model to new, unseen target datasets in the context of Water Distribution Networks (WDNs). The pre-trained model has been trained on several WDNs for pressure reconstruction, and the goal is to adapt it to make predictions on unseen WDN topologies with different operating conditions. The LoRA-based adaptation will be compared against a conventional full fine-tuning approach.

References:
LoRA: Low-Rank Adaptation of Large Language Models. Graph low-rank adapters of high regularityfor graph neural networks and graph transformers. ELoRA: Low-Rank Adaptation for Equivariant GNNs