1. R Scripts to Databricks: Lessons in Production Workflow with Eric Leung from Disney.
1️⃣ Don't reinvent the wheel
2️⃣ Use the best (available) tool for the job
3⃣ Budget time to get used to your tools
#PositConf2024 #RStats #PyData 🧵 2/15
1. R Scripts to Databricks: Lessons in Production Workflow with Eric Leung from Disney.
1️⃣ Don't reinvent the wheel
2️⃣ Use the best (available) tool for the job
3⃣ Budget time to get used to your tools
#PositConf2024 #RStats #PyData 🧵 2/15
2. "The Expanse - Navigating the R Package Universe" by Ben Arancibia from GSK
💬 Communication is hard. Stats people and package devs might use same term for different things (or vice versa)
❓ How do you avoid issues? Set up a culture of collaboration & values. Eg radical transparency, trust, respect, courage, commitment
Connect and check out the package!
#PositConf2024 #RStats #PyData 🧵 3/15
3. "Translating clinical guidance to actionable insights with R" by Claire Bai from COTA
{rwnavigator} 📦
Survivorship analysis connector for oncology (general clinical trial maybe?) data
(no luck finding the package in a search, might not be openly available)
#PositConf2024 #RStats #PyData 🧵 4/15
4. "Templated Analyses within R Packages for Collaborative, Reproducible Research" with Christopher Kenney (PhD candidate at Harvard)
Algorithmic redistricting
Goal: use algorithms to evaluate each state's redistricting process
Problem: social science data is really messy
Enlist smart ambitious undergrads to manually fix data
#RStats 📦 to standardize / smooth processes
https://github.com/alarm-redist/fifty-states
#PositConf2024 #RStats #PyData 🧵 5/15
5. "Why’d you load that package for?" by Luis D. Verde Arregoitia, Instituto de Ecología AC
Explain what a 📦 is for via comments
Tools to help with comments by reading DESCRIPTION and code.
Add-ons to help annotate.
https://luisdva.github.io/rstats/annotaterv020/
Consider adding them, can help a lot for comprehension.
#PositConf2024 #RStats #PyData 🧵 6/15
6. "JSquarto: Bridging JavaScript Documentation with Quarto's Power" by Richie Milano
https://github.com/Open-Science-Community-Saudi-Arabia
(I have trouble listening to recordings with issues his had, had to tune out a bit during this talk)
Project looks interesting. I may need to play with this for some js code we have for #OpenRBQM.
#PositConf2024 #RStats #PyData 🧵7/15
7. "DataPages for interactive data sharing using Quarto" by Mika Braginsky, Stanford University
FAIR: findable, accessible, interoperable, and reusable
Upload csv: easy for you, hard for users
Repo & website: hard for you, easy for users
Data page: best of both worlds
https://github.com/datapages/datapage use template there for quarto
I definitely need to check this out for #TidyTuesday!
#PositConf2024 #RStats #PyData 🧵 8/15
8. "Event Automation with Posit Connect" Kelly O'Briant from @Posit
There are times when a document is better than an app (batch/rendered sometimes > interactive)
Robotic automation (render + email, etc) w/ Connect
Chaining content together is awkward to automate in Connect
{connectapi} 📦 taken over by Connect team, and being updated. There's also a python thing.
#PositConf2024 #RStats #PyData 🧵 9/15
9. "Breaking data identities: Making a case for language-agnosticity" by Albert Rapp @rappa753
Used to be too narrowly focused on papers from his academic field. Reading papers from other areas helped him solve problems in his field.
Same rule applies to programming languages. Html & CSS for reports, javascript for web apps, python for AI & ML
https://albert-rapp.de/posts/27_language_r_guide/27_language_r_guide.html
#PositConf2024 #RStats #PyData 🧵 10/15
This blog post is a collection of resources that I find helpful for learning other programming languages from the perspective of someone who comes from the R world. Some of these resources are mine. Some are blog posts that I found online.
10. "Using the Kyber R package to connect Google Sheets, RMarkdown, GitHub, and Agenda docs for open education" by Stefanie Butland from @openscapes
https://github.com/Openscapes/kyber
#PositConf2024 #RStats #PyData 🧵 11/15
11. "Ten Simple Rules for Teaching an Introduction to R" by Ava Hoffman from the Fred Hutchinson Cancer Center
Synchronous instruction = valuable!
Intensive
Teach as team
Teach reproducibly w/ dynamic docs
Prioritize intuition over memorization.
Boost live code and lab time
Start with data wrangling
End the course with a project
Know your learners
Commit to tidyverse (or base)
Know the common pitfalls
Give & to get feedback often
#PositConf2024 #RStats #PyData 🧵 12/15
12. "Teaching and learning data science in the era of AI" by Andrew Gard, Lake Forest College
Slides are flickering, I have to close my eyes, sorry! Can't type for 5 mins...
#PositConf2024 #RStats #PyData 🧵 13/15
13. "@rainbowR - a community that supports, connects and promotes LGBTQ+ people who code in R" by @ellakay , University of Warwick
Community is >100 people
Connect - monthly Iin meeting, buddy system (next run: beginning of September)
Support - https://github.com/r-lgbtq/tidyrainbow
Promote
(sorry, had tech difficulties for a sec, but go to https://rainbowr.org)
#PositConf2024 #RStats #PyData 🧵 14/15