Test failure analysis with LLM in CI pipeline

On my current project, I'm the sole test engineer in a team with several developers. We merge pull requests only when regression tests are implemented, so I sometimes find myself under pressure to handle multiple tasks at once - especially when there are failed tests. Without my input, it can be difficult to understand what a test is doing and what exactly is causing a failure: is it a bug or does a test need to be updated?

After reading many posts about AI in testing on softwaretestingweekly.com, I decided to give it a try and integrate Claude into our pipelines. The idea was to feed it test reports, grant it read-only access to the repository and the changes in a PR, and ask it to analyze everything and leave a comment with its findings.

Photo by cottonbro studio

One month later, here are my thoughts.

19.01.2026

AI Automation CI/CD Testing

My new mug

I got myself a new mug!

I saw a meme on reddit with a mug like this and found it hilarious, because:

We use claude code at work.
It really does say "You're absolutely right!" almost every time.

So, I ordered one from kingitare.ee. They have an online editor where you can upload an image (including an svg!), choose a font, and preview how the product will look like - super convenient. The mug arrived three days after I placed the order.

I'm satisfied with the overall quality, though I haven't tested it in a dishwasher yet. I had a bad experience with an expensive mug from the official Arsenal store - it lost its print after a few cycles.

Happy vibing everyone!

31.10.2025

AI Humor Personal

LLM prompt techniques from Financial Times

I stumbled upon a talk by a principal engineer at Financial Times, Katie Koschland. She discussed how their authors apply large language models that her team is developing, the challenges they face, and shared some advice on prompt engineering.

15.02.2025

AI Review

LLM chatbot as a tool to simplify foreign language texts

I believe one of the best ways to learn a language is to use it every day. However, it can be hard to incorporate a new language into your life at the beginning. You simply don't know it well enough to chat online and read news. This leaves you with rather boring learning materials. I've tried to solve this problem with LLM chatbots, and so far, the results are rather encouraging!

19.08.2024

AI Language Learning Learning

DuckDuckGo AI Chat

DuckDuckGo, a privacy-friendly alternative to Google and other search providers, has recently launched a new product: AI Chat. It offers anonymous access to popular AI models, including GPT-3.5, Claude 3, and open-source Llama 3 and Mixtral. While I cannot make a reliable assessment of their claims, using this service allows you to work with these models without registration, which is a good starting point.

I decided to compare them to ChatGPT 4o. There are many ways to do this, but I didn't aim to make a professional and thorough comparison. As a user of these tools, I wanted to see how they could handle my daily requests. Since I am learning German, sometimes I need to clarify certain words, phrases, or how to apply different cases in various situations.

The prompt was inspired by my mistake on Duolingo. To put it simple, I thought that the German "in" was equivalent to the English "to". However, it turned out that "in" can change its meaning depending on the case.

Let's see how various LLMs explained the difference.

24.06.2024

AI Language Learning Review

"Power and Progress": Important book about technologies at our doorstep

While discussions on the topic of AI have been ongoing for years, they weren't as prominent outside the circle of enthusiasts. That changed one year ago when OpenAI introduced ChatGPT in November 2022. I want to recommend a book that is relevant to the subject.

18.11.2023

AI Book Review