Commit pack
bigcode/commitpack · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/bigcode/commitpack
Open and responsible development of LLMs for code
BigCode is an open scientific collaboration working on the responsible development of large language models for code
https://www.bigcode-project.org/

The Stack: 3 TB of permissively licensed source code
Large Language Models (LLMs) play an ever-increasing role in the field of Artificial Intelligence (AI)--not only for natural language processing but also for code understanding and generation. To...
https://arxiv.org/abs/2211.15533


Seonglae Cho