Llm

Context:

Running LLM's (large-language-models) locally is now possible ¹ due to the abundance of highly parallelised compute (GPU's) at affordable prices and also the advances of Deep Learning in the past decade.

As such, even slightly powerful consumer devices such as my M1 Macbook Pro with 8GB of RAM can run a small LLM. The purpose of this post is to investigate the token speed and accuracy of a variety of LLM's on my Machines.

Llm

Fine Tuning LLM

LLM from scratch

MinGPT

Running LLM's locally

Context: