Dirk Kemper
Dirk Kemper

data scientist - managing consultant - AI enthousiast

About Me

I am an AI and machine learning enthousiast with 20+ years of experience in various fields of IT such as software development, consulting and data science. On this blog I will be writing about professional and personal projects in the areas of machine learning, data science and AI.

Interests
  • Data Science
  • Machine Learning
  • Artificial Intelligence
Education
  • MSc in Computer Science

    Twente University

Recent posts
NVIDIA DeepStream for highly performant video stream analytics
NVIDIA DeepStream for highly performant video stream analytics

If, like me, you have ever been experimenting with applying deep learning models for video stream processing to perform object detection or scene classification, you will probably run into approaches where Python scripts are used to extract each individual frame from a video file in a loop and feed each of the frames to a model individually. In this blog post I will present a much more performant approach based on NVIDIA's DeepStream toolkit.

Jan 5, 2025

Running a Self-Hosted LLM for Email Address Classification
Running a Self-Hosted LLM for Email Address Classification

Recently I was asked to come up with a solution to classify a large list of tens of thousands of email addresses based on their details. Although the original request was a bit different, for this blog post I will make the example of classifying the addresses as being from a personal email provider (like Gmail, Yahoo, etc.) or from a business or other institution. I will demonstrate an approach how to use a self-hosted Zephyr-7B model using the Litserve package to perform this task very efficiently.

Jul 17, 2024

Installing Nextcloud Plugin on TrueNAS CORE With Letsencrypt Support in 2024
Installing Nextcloud Plugin on TrueNAS CORE With Letsencrypt Support in 2024

Nowadays it feels that the TrueNAS CORE community is shrinking every month. I have been a user of this solution for more than 8 years now, but some aspects of the system like plugins have never matured to a degree where they are really usable. I have been using the Nextcloud plugin for years now (after having upgraded from ownCloud), but updating the plugin from either the designated plugin architecture of from the Nextcloud management screens is quite often painful and tends to break the jail if there are major upgrades on key components like PHP.

Feb 27, 2024

Running a Large Language Model on Your Own Hardware
Running a Large Language Model on Your Own Hardware

Last November ChatGPT has held its first anniversary, after having reached a user base of 180 million active users, making this the fastest adopted technology with an internet user base ever. Now we are a year in, I’d like to explore how the open-source world has caught up with ChatGPT’s success. What is the current state of affairs when comparing open source LLMs to the closed source and proprietary models from OpenAI and others? Is open-source technology able to match or surpass the technology brought by OpenAI in 2022?"

Dec 22, 2023

A Natural Language Query Dashboard on Your SAP S4/HANA Data
A Natural Language Query Dashboard on Your SAP S4/HANA Data

Retrieval Augmented Generation (RAG) allows for adding knowledge from external sources into a Large Language Model (LLM) prompt. In this blog post I will be using this approach for using an LLM to retrieve data from an SAP S4/HANA system and visualizing the data using Streamlit. You will learn how to build a dashboard converting natural language questions into data visualizations using only a minimal amount of code!

Sep 21, 2023