So that's me received the confirmation that my stuff is removed from Bigstack.

Which is good. It shows the Optout requests are being done.

Go to check again for librecasts old github account. Looks like I missed some.

*Opens new ticket*

https://huggingface.co/spaces/bigcode/in-the-stack

While I did think it is important for Software Heritage to archive code, I wish it was done Opt-in.

It would be nice to be asked and for that code to be curated. This is not curation. This is automation.

#SoftwareHeritage #BigCode

Am I in The Stack? - a Hugging Face Space by bigcode

This app lets you check if your GitHub repositories are part of the The Stack dataset. Enter your GitHub username and select the dataset version to see if your code is included. If you want your da...

It’s especially rich that the logo for #BigCode, an org that trains LLMs so is massively accelerating #climateChange, uses a sakura blossom. Sakura are suddenly blooming earlier each year due to climate change.

You might want to check if your code is used for training the #BigCode AI model:
https://huggingface.co/spaces/bigcode/in-the-stack

#BigCodeProject #FuckAI

Am I in The Stack? - a Hugging Face Space by bigcode

This app lets you check if your GitHub repositories are part of the The Stack dataset. Enter your GitHub username and select the dataset version to see if your code is included. If you want your da...

StableCode von Stability AI ist ein neu entwickeltes großes Sprachmodell (LLM) zur Unterstützung der Programmcode-Erstellung

#AI #KI #StabilityAI #StableCode #CodeGenerierung #BigCode #LLM #Programmierung #RoPE #Technologie #ZukunftDerCodierung

https://kinews24.de/stability-ai-stable-code-ein-neues-kapitel

Stability AI Stable Code: Ein neues Kapitel - KiNews24.de

Stability AI präsentiert StableCode, ein neuartiges LLM zur Code-Generierung. Basiert auf BigCode Daten und nutzt Rotary Position Embedding.

KI NEWS24

The Stack: 3 TB of permissively licensed source code

Denis Kocetkov, Raymond Li, Loubna Ben allal et al.

Action editor: Swarat Chaudhuri.

https://openreview.net/forum?id=pxpbTdUEpD

#bigcode #text2code #dataset

The Stack: 3 TB of permissively licensed source code

Large Language Models (LLMs) play an ever-increasing role in the field of Artificial Intelligence (AI)--not only for natural language processing but also for code understanding and generation. To...

OpenReview

#StarCoder: A State-of-the-Art #LLM for #Code by Hugging Face 🤗

https://huggingface.co/blog/starcoder

More about the Big Code project:
https://www.bigcode-project.org/

Find out, whether your code was used for training and opt-out, if you don't want to be "in the stack":
https://huggingface.co/spaces/bigcode/in-the-stack

#AI #ArtificialIntelligence #LLMs #DevTools #BigCode

StarCoder: A State-of-the-Art LLM for Code

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

I want to like HuggingChat open source LLM AI so much. But at least for coding it is nowhere near the same league as ChatGPT. If I would hire a new developer for my team and could conduct interviews only per keyboard, I would be impressed by ChatGPT and offer it the position. With HuggingChat I’d terminate the interview after 10 mins. Tried Java and JS. Calling APIs from any library that might do the job without importing, explanation mixing up cause and effect.. #chatgpt #huggingface #bigcode

#BigCode is an open scientific collaboration working on responsible training of large language models for coding applications.

In this organization you can find the artefacts of this collaboration:
👉 #StarCoder, a state-of-the-art language model for code,
👉 The #Stack, the largest available pretraining dataset with perimssive code, and 👉 #SantaCoder, a 1.1B parameter model for code.

#StarCoder is a 15.5B parameters language model for code trained for 1T tokens on 80+ programming languages.
It uses MQA for efficient generation, has 8,192 tokens context window and can do fill-in-the-middle.

Chat with StarCoder here: https://huggingface.co/chat/?model=bigcode/starcoder

https://huggingface.co/bigcode

HuggingChat

Making the community's best AI chat models available to everyone.

#BigCode #OpenSource

"#StarCoder is a 15.5B parameters language model for code trained for 1T tokens on 80+ programming languages. It uses MQA for efficient generation, has 8,192 tokens context window and can do fill-in-the-middle."

https://huggingface.co/bigcode

bigcode (BigCode)

Org profile for BigCode on Hugging Face, the AI community building the future.

This one has different methodologies and philosophies that try to mitigate some of the ethical issues with other similar #GenerativeAI programming systems.

Hugging Face and ServiceNow Research release StarCoder, a free alternative to code-generating #AI like GitHub's #Copilot, as part of the #BigCode project.

https://techcrunch.com/2023/05/04/hugging-face-and-servicenow-release-a-free-code-generating-model/

#MachineLearning

TechCrunch is part of the Yahoo family of brands