Simon’s blog

Simon Veitner

I’m Simon and 31 years old. I’m into all kinds of stuff related to computing and data science.

Introduction to INT8 quantization in JAX

3 minute read

Introduction In this blog post, we will provide an intuitive introduction to a technique called quantization. The topic has always seemed a bit mysterious to...

High performance Inference on TPUs using Maxtext

3 minute read

Attention module in JAX

9 minute read

The attention module is the key ingredient of what makes up a transformer layer. In this blogpost we will show how to implement it from scratch in JAX alongs...

Multi chip performance in JAX

4 minute read

The larger the models we use get the more it becomes necessary to be able to perform training of machine learning models over multiple chips. In this blog po...

Simon Veitner

Recent Posts

Introduction to INT8 quantization in JAX

High performance Inference on TPUs using Maxtext

Attention module in JAX

Multi chip performance in JAX