Today’s learning 🗒️

Instead of spending an hour building a SAS Viya Decision Tree manually, I:

→ Built it with Python scikit-learn
→ Answered workbook questions before opening SAS
→ Verified, screenshot, export PDF
→ A full SASPy automation pipeline

#datascience #scikit

sketch-a-day/2024/sketch_2024_12_06 at main · villares/sketch-a-day

One visual idea a day. Contribute to villares/sketch-a-day development by creating an account on GitHub.

GitHub
NumPy, SciPy, Scikit and Cython for Numeric Computing (20Jul-05Aug), Sat, Jul 20, 2024, 7:00 PM | Meetup

(20Jul-05Aug) NumPy, SciPy, Scikit and Cython for Numeric Computing We focus on Numeric Computing using Python derived libraries. This is the next step, now that we have c

Meetup
Notes on linear regression using scikit-learn and chapter 15 of Numerical Recipes

the best way to learn is to try to teach A convenient starting point for linear regression would be scikit-learn’s LinearRegression() function, as illustrated in MOOC’s excellent course…

Felix qui nihil debet
Didn't do it. The 3.11 runtime helped. But Python's latency with #scikit-learn is still at-least twice as high on #FreeBSD as it is on #Debian. The reasons for this are not immediately clear to me. This doesn't bode well for my argument that BSD is a good alternative to Linux for data science workflows. 😅. I guess because I've always used Bhyve + Docker + Debian for DS, I never noticed the speed issues before. I'll send a note to the FreeBSD Python mailing list. Maybe they'll know.

📝🐍 Шпаргалка з #Python з акцентом на #DataScience

Зокрема, наводяться приклади використання #JupyterNotebook, #NumPy, #SciPy, #Pandas, #Scikit, #Matplotlib, #Seaborn, #Bokeh.

The Python community is huge and diverse, in some aspects. I always like to point how professional software development is just a fraction of the huge number of ways that software is created. I remember seeing threads here o Mastodon of #Python developers who didn't understand why people used #conda (from Anaconda inc.) package management... Well, it is 2024 and there are parts of #scikit, scikit-geometry, you can't install with #pip :(( ... I wish I could have it on pip, to be honest, but large parts of the scientific community will use conda, and that's it.

It's a miracle! I've managed to make #SciKit Image tests work offline in #Gentoo!

That said:

1. Upstream includes most of the test data in sdist. Except they don't use it, and instead fetch it all from the GitHub repository. Sigh. (If you're upstream, please pretend you didn't see this and do not remove the data from sdist. Thanks.)
2. On top of that, they fetch more test data from an additional GitLab repository, and move and rename it around.
3. Yes, you've read right. They keep code on GitHub, but test data on GitLab.com (not self-hosted).

https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=0110ec020dbdb46c859e584609ab056fcafa10c6

#Python

repo/gentoo.git - Official Gentoo ebuild repository

Browse the Gentoo Git repositories

Wydarzył się cud! Udało mi się doprowadzić testy #SciKit Image do działania bez Internetu w #Gentoo!

Na marginesie:

1. Autorzy włączają większość niezbędnych danych w archiwum sdist. Tyle że ich nie używają, i zamiast tego ściągają je drugi raz, prosto z repozytorium na GitHubie. Wzdych. (Jeżeli czyta to któryś z autorów, proszę o udawanie, żeście tego nie widzieili i nie usuwanie tych plików z archiwum sdist. Dzięki.)
2. Na dodatek, pobierają dodatkowe dane dla testów z dodatkowego repozytorium na GitLabie, i przenoszą i zmieniają nazwy plików.
3. Tak, dobrze widzicie. Kod jest na GitHubie, dane dla testów na GitLab.com (nie własnej instancji).

https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=0110ec020dbdb46c859e584609ab056fcafa10c6

#Python

repo/gentoo.git - Official Gentoo ebuild repository

Browse the Gentoo Git repositories

and, if needed, modify it.

there are many python libraries out there:

#OpenCV
#SciPy
#imageio
#scikit-image (skimage)
#matplotlib
#Wand
#Pillow (PIL Fork)