Reinforcement Learning
MPC vs RL on the Cart-Pole: An Implementation Study
2026-06-05
Hello, I'm
Engineer · Researcher · Germany
I'm interested in mechatronics, AI, robotics, reinforcement learning, and model predictive control.
Writing
Reinforcement Learning
2026-06-05
2025-12-13
Reinforcement Learning
2024-01-06
2023-04-12
Work
Led a team of 30+ engineers to build and deploy an LLM + RAG pipeline processing unstructured mental health data in 9 weeks. Implemented RLHF feedback loops, structured output schemas, and evaluation pipelines to close test/production gaps.
Developing PPO and DQN algorithms for real-time multi-access traffic steering in 5G/Wi-Fi networks, framed as sequential decision-making under uncertainty. Building real-time telemetry pipelines on Linux servers.
Multi-sensor data fusion pipeline for real-time GPS localisation using Kalman filtering, merging inconsistent signals from multiple sources into a single reliable state estimate.