llama.cpp

A lightweight C++ implementation for running and optimizing large language models locally on a variety of hardware.