Engineering Data Solutions

šŸ‘‹ Welcome to my page!

I am Jonathan, a Data Engineer based in Munich, Germany. Here I share little snippets of things that I learn, find interesting, or worth discussing.

I will post mostly about topics that relate to the practice of data engineering. These might be centered around Microsoft Azure, Linux, and open source technology such as Airflow, DuckDB, or dlt. I also plan to publish posts around data governance, management, and strategy as I firmly believe that data (engineering) projects are most successful, if they are truly part of an organization’s broader business strategy.

At some point you’ll find some of my personal projects here too (once those are ready to share šŸ˜‰) - from a fun facts CLI and a recipe management web server written in Go to a data analytics platform for my personal finances.

You can find my socials and recent posts below and check the “About” page in case you’d like to know more about me!

First impressions with Claude Code CLI

I’ve been curious about AI-powered development tools for a while now, and recently decided to give Anthropic’s Claude Code CLI a proper test drive. After a few hours of experimenting with it on my lab repository, I wanted to share some initial thoughts on what it’s good at and where it might fit into my workflow. What I tested I threw a couple of realistic tasks at Claude to see how it handles common development scenarios: ...

First steps with Harlequin

Today I used Harlequin for the first time. It is a SQL IDE for the terminal and the setup or installation was surprisingly easy. I tried out the tool as part of my ā€œelt-on-github-actionsā€ repository (more information on that will follow). I wanted to test the conditional logic in my marts models: If it is the production environment, output to blob storage and if it is not, then create a view in the duckdb database. ...

Exploring Project Nessie - a transactional catalogue over iceberg tables

This week, I dove into Project Nessie - an open-source transactional data catalogue for Apache Iceberg tables. I’d heard about Nessie’s git-like semantics and was curious about its potential for better managing data versioning and auditability in my projects. Docker compose setup for Nessie Server and CLI To experiment locally, I leveraged Docker, conveniently supported by a guide provided by the Nessie team. Following their materials, I put together a straightforward Docker Compose file that neatly places both the Nessie server and CLI into the same Docker network. This setup greatly simplifies communication between the containers. ...

Azure Data Factory Data Flow - Oddity of the week

This week, I stumbled into an interesting puzzle while investigating an issue for a client. They had recently transitioned from System A to System B, and one of their critical metrics suddenly showed significant discrepancies. Business-wise, these numbers were expected to remain identical, so naturally, it called for some digging. I started by checking the basics: Source API: Was the new system feeding incorrect data? No issues there. Transformed Reporting Tables: Were calculations or transformations misconfigured? Again, everything seemed correct. ...

How I studied for the AZ-104 exam

I recently obtained the AZ-104 Microsoft Certified: Azure Administrator Associate certification. This is my second Azure certification after the Data Engineer Associate from last year. I am delibaretly focusing on the Azure platform as I believe deep knowledge on one cloud provider is more beneficial compared to shallow knowledge across multiple providers. And, all my clients over the last two years have used Azure. For this certification I changed up my study routine a bit and found it really helpful. It has taken me less time this time around and I also felt more confident in my abilities. Therefore I wanted to take a quick note and reflect on the resources I used and methods I tried to follow. ...

February 25, 2025 3 min