Is Parquet becoming the bottleneck? Why new storage formats are emerging in 2025 (Lance, Vortex, and more)

Parquet gave data lakes a common language: columnar layout, good compression, and fast scans. That still works well for classic analytics. But workloads have changed. We now mix wide scans with point lookups, handle embeddings and images, and run on S3-first stacks. On NVMe you want lots of tiny random reads. On S3 you want fewer, larger range requests. A format tuned for one world can feel chatty or slow in the other.

Databend Cloud
Why We Built Our Own SQL Parser From Scratch: A Rust Implementation Story

The story of why Databend chose to build its own SQL parser, the challenges faced, and how Rust's type system guided the solution.

Databend Cloud
Protect me from what I am. #databend #glitch #netart #art
Bluesky

Bluesky Social
The only way to be sure. #ripley #glitch #databend #art
Bluesky

Bluesky Social
Bluesky

Bluesky Social
Direct Memory Access Violation #databend #glitch #art #netart #cyberpunk
Bluesky

Bluesky Social
Ripley Glitch, 2014 π˜₯𝘒𝘡𝘒𝘣𝘦𝘯π˜₯ 𝘰𝘯 𝘱π˜ͺ𝘹𝘦𝘭 #gif #glitch #notepad++ #databend #netart

Discover how to use SQL to write a white-box model for predicting iris categories with #Databend, without the need for model services interaction! Dive into our latest blog for a step-by-step guide on leveraging #HuggingFace datasets in #DatabendCloud.

https://www.databend.com/blog/2024-01-18-analyzing-hugging-face-datasets-with-databend/

#DataScience #MachineLearning #ETL #DataAnalysis #AI
https://www.databend.com/blog/2024-01-18-analyzing-hugging-face-datasets-with-databend/

Analyzing Hugging Face Datasets with Databend

Directly query Hugging Face datasets through Databend to simplify your data science workflow.

Databend Cloud

The future is interconnected! Connectivity cloud databases like #Databend offer seamless multi-cloud support, hybrid workloads and analysis at ultra-large scale.

An introduction to the book "Databend Systems": https://databend.systems/introduction/

Stay tuned!
https://databend.systems/introduction/

Introduction

Databend Systems

If you're looking for a large #Rust codebase to test compilation, I recommend trying #Databend, an open source cloud data warehouse.

Check out our posts on compile optimization through caching, removing .dependencies & refactoring. Also, PGO included.

πŸ‘‡ (Links in sub-post)