@kenkousen Ken, can you refer me to some Groovy references about invoking external processes and building command pipelines? How might I use Groovy like I use Bash?
@rotnroll666 I agree that #DuckDB is great for querying small or huge CSV (and Parquet) files. I also often use #csvq to query smaller CSV files.
@mdfranz I have also observed a #duckdb Python script that gradually fills available memory as it repeatedly invokes method fetchmany to read the rows of a very large Parquet fille that contains a timestamp column. Change this timestamp column to an integer UNIX epoch timestamp and DuckDB reads the entire Parquet file while keeping its memory consumption remains stable.
https://duckdb.org/docs/api/python/reference/#duckdb.DuckDBPyConnection.fetchmany
@kenkousen Ken, can you refer me to some Groovy references about invoking external processes and building command pipelines? How might I use Groovy like I use Bash?