Will LLaMa become a Linux of AI?
I thought it would be a few more years before I could run a GPT-3 class model on hardware that I owned. I was wrong: that future is here already.
LLaMA Notion
LLaMA Descendents
transformers/src/transformers/models/llama/modeling_llama.py at main · huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. - huggingface/transformers
https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/modeling_llama.py
Llama from scratch (or how to implement a paper without crying)
I want to provide some tips from my experience implementing a paper. I'm going to cover implementing a dramatically scaled-down version of Llama for training...
https://blog.briankitano.com/llama-from-scratch/
Even can run on raspberry Pi 4 lol and Pixel 6
Large language models are having their Stable Diffusion moment
The open release of the Stable Diffusion image generation model back in August 2022 was a key moment. I wrote how Stable Diffusion is a really big deal at the …
https://simonwillison.net/2023/Mar/11/llama/?utm_source=tldrnewsletter

Introducing LLaMA: A foundational, 65-billion-parameter language model
Today, we’re releasing our LLaMA (Large Language Model Meta AI) foundational model with a gated release. LLaMA is more efficient and competitive with previously published models of a similar size on existing benchmarks.
https://ai.facebook.com/blog/large-language-model-llama-meta-ai/

Meta unveils a new large language model that can run on a single GPU [Updated]
LLaMA-13B reportedly outperforms ChatGPT-like tech despite being 10x smaller.
https://arstechnica.com/information-technology/2023/02/chatgpt-on-your-pc-meta-unveils-new-ai-model-that-can-run-on-a-single-gpu
![Meta unveils a new large language model that can run on a single GPU [Updated]](https://www.notion.so/image/https%3A%2F%2Fcdn.arstechnica.net%2Fwp-content%2Fuploads%2F2023%2F02%2Fmeta_llm_hero_1-760x380.jpg?table=block&id=29b8737d-932a-4028-a566-437c55b73ee7&cache=v2)

Seonglae Cho
![Meta unveils a new large language model that can run on a single GPU [Updated]](https://www.notion.so/image/https%3A%2F%2Fcdn.arstechnica.net%2Fwp-content%2Fthemes%2Fars%2Fassets%2Fimg%2Fmaterial-ars-db41652381.png?table=block&id=29b8737d-932a-4028-a566-437c55b73ee7&cache=v2)