Devs and data scientists really like our #ChicagoCrimes EDA public scripts, notebooks ๐Ÿ“š and data snapshots repository we created last October. That sample data/demo repository covers many different tools, libraries and notebooks to parse #LargeData:

โญ๏ธ โค‘ https://github.com/RandomFractals/chicago-crimes

๐Ÿ“œ โค‘ https://twitter.com/search?q=(%23ChicagoCrimes)%20(from%3ATarasNovak)&src=typed_query

#DataTools ๐Ÿ› ๏ธ ...

GitHub - RandomFractals/chicago-crimes: Exploring Chicago crimes dataset with Jupyter notebooks, DuckDB, Malloy and new Panel/PyScript data and dashboard tools.

Exploring Chicago crimes dataset with Jupyter notebooks, DuckDB, Malloy and new Panel/PyScript data and dashboard tools. - GitHub - RandomFractals/chicago-crimes: Exploring Chicago crimes dataset w...

GitHub

Quick demo of our new #DuckDBSqlTools vscode extension loading and querying 7,687,725 #ChicagoCrimes recorded in 2001 through the end of November 2022 from a large 1.68 GB CSV data file in seconds ... See demo gif at:

๐Ÿ“ฐ https://github.com/RandomFractals/chicago-crimes#with-duckdb-sql-tools

#DuckDB #SqlTools #VSCode #DataTools ๐Ÿ’Ž๐Ÿ’Ž๐Ÿ’Ž

GitHub - RandomFractals/chicago-crimes: Exploring Chicago crimes dataset with Jupyter notebooks, DuckDB, Malloy and new Panel/PyScript data and dashboard tools.

Exploring Chicago crimes dataset with Jupyter notebooks, DuckDB, Malloy and new Panel/PyScript data and dashboard tools. - GitHub - RandomFractals/chicago-crimes: Exploring Chicago crimes dataset w...

GitHub

Our new #DuckDBSQLTools VSCode extension is almost ready for prime time.

You'll be able to load remote CSV and #parquet data files via httpfs extension and create in-memory #DuckDB instances too.

See demo gif of loading #ChicagoCrimes parquet data from a GitHub repository into memory, creating a CrimeReports table, and querying it on twitter:

https://twitter.com/TarasNovak/status/1617542770184577024

#VSCode #SQLTools / #DataTools ๐Ÿ”ฌ๐Ÿ’Ž๐Ÿ’Ž๐Ÿ’Ž...

Taras ๐Ÿ‡บ๐Ÿ‡ฆ on Twitter

โ€œOur #DuckDBSQLTools ext. is almost ready for prime time. You'll be able to load remote CSV & #parquet data files & create in-memory #DuckDB instances too. Demo of loading #ChicagoCrimes parquet data into memory, creating a table, and querying it: #VSCode #SQLTools/#DataTools ๐Ÿ”ฌ๐Ÿ’Žโ€

Twitter
2022 Chicago Crime Reports Malloy Fiddle App

Updated #ChicagoCrimes #PyScript #dataApp with gzipped CSV (~3.25MB). The app now loads 215,551 crime reports with #pyodide in a browser in about 8 seconds total for the #Python runtime, data transformation with #pandas ๐Ÿผ & charting with #Altair ๐Ÿ“Š๐Ÿ“ˆ

https://randomfractals.github.io/chicago-crimes/apps/pyscript/

Chicago Crimes PyScript App

Running some quick data summary queries with #Malloy on a 2001-2022 #ChicagoCrimes parquet data file that is 533MB, created form a larger 1.66GB CSV data, without any compression. Very responsive and fast query execution thanks to #DuckDB and Malloy #VSCode extension.

View those queries in action in this GIF: https://twitter.com/TarasNovak/status/1601650935402725376

#dataTools ๐Ÿ› ๏ธ ...

Taras ๐Ÿ‡บ๐Ÿ‡ฆ ... on Twitter

โ€œRunning some quick data summary queries with #Malloy on a 2001-2022 #ChicagoCrimes parquet data file that is 533MB, created form a larger 1.66GB CSV data, without any compression. I'd say this is fairly responsive and fast query execution thanks to #DuckDB and Malloy VSCode ext.โ€

Twitter

Our #DataPreview ๐Ÿˆธ for #vscode now has over 350,000 installs. You can load large CSV files, sort & graph results with aggregate functions, and much more.

See an example of loading 48MB of #ChicagoCrimes CSV data: https://twitter.com/TarasNovak/status/1600439658810585088

Note: change data.preview.theme to light. See: https://github.com/RandomFractals/vscode-data-preview#configuration

๐Ÿ“ฅ https://marketplace.visualstudio.com/items?itemName=RandomFractalsInc.vscode-data-preview

#dataViz ๐Ÿ“Š๐Ÿ“ˆ #dataTools ๐Ÿ› ๏ธ for #dataScientists ...

Taras ๐Ÿ‡บ๐Ÿ‡ฆ ... on Twitter

โ€œOur #DataPreview ๐Ÿˆธ for @code has over 350,000 installs. You can load large CSV files, sort & graph results with aggregate functions. Bellow is loading 48MB of CSV data. Note: change preview.theme to light. See: https://t.co/okrOjTZpsX ๐Ÿ“ฅ https://t.co/X511HcdaxW #dataTools ๐Ÿ› ๏ธ ...โ€

Twitter
chicago-crimes/chicago-crimes-malloy-composer.gif at main ยท RandomFractals/chicago-crimes

Exploring Chicago crimes dataset with Jupyter notebooks, DuckDB, Malloy and new Panel/PyScript data and dashboard tools. - chicago-crimes/chicago-crimes-malloy-composer.gif at main ยท RandomFractals...

GitHub
so cool! :)
---
RT @TarasNovak
Created a web page with #PyScript loading 2022 #ChicagoCrimes CSV data with #pandas and visualizing that data with #Altair charting lib:
https://github.com/RandomFractals/chicago-crimes/blob/main/apps/pyscript/index.html
#dataApps ...
https://twitter.com/TarasNovak/status/1597379498756173824
chicago-crimes/index.html at main ยท RandomFractals/chicago-crimes

Exploring Chicago crimes dataset with Jupyter notebooks, DuckDB, Malloy and new Panel/PyScript data and dashboard tools. - chicago-crimes/index.html at main ยท RandomFractals/chicago-crimes

GitHub
chicago-crimes/chicago-crimes-altair-charts-ipynb.gif at main ยท RandomFractals/chicago-crimes

Exploring Chicago crimes dataset with Jupyter notebooks, DuckDB, Malloy and new Panel/PyScript data and dashboard tools. - chicago-crimes/chicago-crimes-altair-charts-ipynb.gif at main ยท RandomFrac...

GitHub