That's my slides all baked and ready for @EdinbR in a couple of hours! I'll paste the repo link later 😎

#DataScience #RStats #DataEngineering

@EdinbR As promised, you can get the slides and code from my talk here: https://github.com/mikerspencer/arrow_test

As an extra special treat, @nic_crane came along too 🤩

#DataScience #RStats #DataEngineering

GitHub - mikerspencer/arrow_test: Testing Arrow performance

Testing Arrow performance. Contribute to mikerspencer/arrow_test development by creating an account on GitHub.

GitHub
@mikerspencer @EdinbR @nic_crane gonna steal about a quarter of that material as I am to present on parquet in a few weeks at the GASP workshop. My typical applications have a lot of categorical variables so I do not have to agonize over what to partition by :).
@statstas steal isn't a nice word...
@mikerspencer OK I am going to fork and PR then

@mikerspencer @EdinbR

Hi and thanks for the talk last night. I had to exit early but found it all informative, clear and great first dive into a sense what you can do / how you can operate with R and data.

Also nice, as on a procedural and conceptual level it was all totally followable for someone with just some basics in python.

Hope to make it back for more after the summer!

@mossfactory Hey Rachel, I'm really glad you enjoyed it! Great you could make it along.