Hey 👋
I am a 24-year-old AI Engineer working at Aleph Alpha, Berlin 🇩🇪. I recently completed my Masters in Data Science from the University of Bath 🇬🇧, where I worked on generating scaling laws for small language models as my thesis. I am also a Kaggle Notebooks Grandmaster and an avid contributor to major open source ML projects!
For my dissertation (Jun '24–Sept '24'), I generated scaling laws for large language models and also searching for phase transitions in LLMs to look for mathematical reasoning and other emergent abilities. Under Dr. Nello Cristianini's supervision, my daily tasks consist of training LLMs with various model parameters, hyper-parameter settings, and token sizes, then recording my findings from these experiments.
Below is a brief summary of the places I have worked (including internships and short-term employment).
And below is a tabular summary of some of my Open Source contributions