Francesc Alted

@FrancescAlted@masto.social
92 Followers
53 Following
240 Posts

πŸ“’ We are pleased to announce the integration of a new stack feature in #Blosc2 πŸš€, which allows for stacking large arrays along a new axis.

Performance benchmarks show that while aligned chunks yield the best results, #Blosc2 with unaligned chunks can still outperform #NumPyβ€”a welcome discovery! πŸŽ‰

Many thanks to Luke Shaw for his excellent work on this new functionality. πŸ™

We've updated our recent blog post:
Check it out! πŸ”— https://www.blosc.org/posts/blosc2-new-concatenate/#stacking-arrays

#Python #DataScience #Performance #OpenSource

NumExpr 2.11.0 is here! πŸŽ‰

Key highlights:

πŸš€ Experimental support for free-threaded Python 3.13!
✨ Imaginary number evaluation like 1.1e1j is now fixed.
βœ… Test suite modernized to pytest for easier contributions.
🐍 Python 3.10 is now the minimum supported version.

Check out the release notes for more details!

https://github.com/pydata/numexpr/blob/master/RELEASE_NOTES.rst

#NumExpr #Python #DataScience #Performance

πŸš€ Excited to share more about Caterva2, your ultimate gateway to Blosc2/HDF5 repositories! πŸš€

Caterva2 is designed to redefine how you interact with large datasets.

Want to see it in action? πŸ€” We've just released a new introductory video showcasing Caterva2's main functionalities! 🎬

πŸ‘‰ https://ironarray.io/caterva2

#Caterva2 #Blosc2 #HDF5 #BigData #DataManagement #FreeSoftware #Python #DataScience #Tech

πŸš€ **Exciting News!** After 15 years of developing #Blosc/#Blosc2, we're thrilled to announce the beta program for Cat2Cloud! πŸŽ‰

- πŸ”„ Share complex data securely and effortlessly
- πŸ—œοΈ Access to the best compression algorithms available
- ⚑ Perform advanced computations directly in the cloud

...and more!

https://ironarray.io/cat2cloud

Join our beta program today and be among the first to experience the power of Cat2Cloud!

#DataScience #Compression #SaaS #CloudComputing #BetaProgram

Share Data Faster!⚑

ironArray SLU

<img src="/img/cat2cloud-logo2.png" width="35%" alt="Cat2Cloud Logo" center="auto" style={{

πŸ“’ πŸ”₯ Updated article on Blosc2: Compute with TB-sized datasets on your own hardware, within human timeframes!

Highlights:

πŸš€ Outperforms NumPy by 10x ~ 100x for large computations
πŸ’Ύ Maintains performance with datasets far exceeding physical memory
🐍 Integrates seamlessly with the Python data science ecosystem
πŸ’» Works both in-memory and on-disk with minimal performance differences

Read more: https://ironarray.io/blog/compute-bigger

#DataScience #BigData #Compression #HighPerformanceComputing

Compress Better, Compute Bigger | ironArray SLU

How Blosc2 can compress data better and compute bigger

Since the arrival of AI coding tools, getting matplotlib right starts to be pleasant again. The plot below just took 20 min to look this nice (even surpassing my expectations) πŸ˜€

Is it possible to compute with arrays that are 100x larger than memory and still achieve good performance? 🀯

With the new compute engine in Python-Blosc2, you can! 😊 Check out our blog post for more details: https://ironarray.io/blog/compute-bigger

Compress Better, Compute Bigger!

Compress Better, Compute Bigger | ironArray SLU

How Blosc2 can compress data better and compute bigger

Do you like to learn by example? Look at my materials for the tutorial πŸ‘¨β€πŸ« on (Python-) @Blosc2 3.0 for ongoing PyData Global 2024:

https://github.com/Blosc/Python-Blosc2-3.0-tutorial

Learn how to:

πŸ’Ύ Create large and compressed arrays in-memory and disk, and how to manage them efficiently.
πŸ’» Operate with them with complex mathematical expressions and reductions.
🐍 Efficiently create NDArrays using Python (and Numba!) functions.
☁️ Upload Blosc2 data to the cloud (via cat2.cloud) to visualize and share it.

Enjoy!

GitHub - Blosc/Python-Blosc2-3.0-tutorial: Materials for the PyData Global 2024 tutorial on Python-Blosc2 3.0.0

Materials for the PyData Global 2024 tutorial on Python-Blosc2 3.0.0 - Blosc/Python-Blosc2-3.0-tutorial

GitHub
This happened near València a few hours ago. Science is saying that catastrophes like this will be more frequent in the future due to climate change. This is why I refuse to travel as much as possible, and when necessary, only via public transportation (specially train). If we want a future, we need to put an end to CO2 emissions, SOON.

Big news! #Caterva2 enters advanced beta stage πŸŽ‰ πŸŽ‰

Caterva2 is a FOSS distributed system written in Python meant for sharing Blosc2 datasets (either native or converted on-the-fly from HDF5) among different hosts.

It follows the pub-sub paradigm, so it can publish data once and allow multiple subscribers to access it, saving time and resources. It comes with a Python API and a Web interface for easy browsing.

Learn more in https://ironarray.io/caterva2

Make Compression Better πŸ˜€
#blosc2 #ironarray

Caterva2: On-demand access to Blosc2 data repositories | ironArray SLU

[Caterva2]//github.com/ironArray/Caterva2