Today I learned about SmolLM2-135M-Instruct, a tiny LLM which quantizes down to just below 100MB... which means it can fit in a PyPI package!
Here's the first LLM plugin that includes a full model as part of the package:
llm install llm-smollm2
https://simonwillison.net/2025/Feb/7/pip-install-llm-smollm2/