The Stack: 3 TB of permissively licensed source code

Denis Kocetkov, Raymond Li, Loubna Ben allal et al.

Action editor: Swarat Chaudhuri.

https://openreview.net/forum?id=pxpbTdUEpD

#bigcode #text2code #dataset

The Stack: 3 TB of permissively licensed source code

Large Language Models (LLMs) play an ever-increasing role in the field of Artificial Intelligence (AI)--not only for natural language processing but also for code understanding and generation. To...

OpenReview