LLaMa

Creator

Creator

Created

Created

2023 Feb 25 15:56

Editor

Editor

Edited

Edited

2025 Jan 25 20:4

Refs

Refs

facebookresearch • Updated 2023 Mar 5 12:37

Will LLaMa become a Linux of AI?

I thought it would be a few more years before I could run a GPT-3 class model on hardware that I owned. I was wrong: that future is here already.

LLaMA Notion

LLaMA Implementation

LLaMA Descendents

Stanford Alpaca

GOod at Arithmetic Tasks

transformers/src/transformers/models/llama/modeling_llama.py at main · huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. - huggingface/transformers

transformers/src/transformers/models/llama/modeling_llama.py at main · huggingface/transformers

https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/modeling_llama.py

transformers/src/transformers/models/llama/modeling_llama.py at main · huggingface/transformers

Llama from scratch (or how to implement a paper without crying)

I want to provide some tips from my experience implementing a paper. I'm going to cover implementing a dramatically scaled-down version of Llama for training...

https://blog.briankitano.com/llama-from-scratch/

Even can run on raspberry Pi 4 lol and Pixel 6

Large language models are having their Stable Diffusion moment

The open release of the Stable Diffusion image generation model back in August 2022 was a key moment. I wrote how Stable Diffusion is a really big deal at the …

Large language models are having their Stable Diffusion moment

https://simonwillison.net/2023/Mar/11/llama/?utm_source=tldrnewsletter

Large language models are having their Stable Diffusion moment

Introducing LLaMA: A foundational, 65-billion-parameter language model

Today, we’re releasing our LLaMA (Large Language Model Meta AI) foundational model with a gated release. LLaMA is more efficient and competitive with previously published models of a similar size on existing benchmarks.

Introducing LLaMA: A foundational, 65-billion-parameter language model

https://ai.facebook.com/blog/large-language-model-llama-meta-ai/

Introducing LLaMA: A foundational, 65-billion-parameter language model

Meta unveils a new large language model that can run on a single GPU [Updated]

LLaMA-13B reportedly outperforms ChatGPT-like tech despite being 10x smaller.

Meta unveils a new large language model that can run on a single GPU [Updated]

https://arstechnica.com/information-technology/2023/02/chatgpt-on-your-pc-meta-unveils-new-ai-model-that-can-run-on-a-single-gpu

Meta unveils a new large language model that can run on a single GPU [Updated]

Backlinks

Test-time RL Neuron SAE Implementation PowerInfer Tinygrad RMS Normalization

Recommendations

////////