Thomas Leeper

@thosjleeper
1.4K Followers
227 Following
83 Posts
Websitethomasleeper.com
Blueskyhttps://bsky.app/profile/thosjleeper.bsky.social
Githubgithub.com/leeper

Some (actual) personal news: In two weeks, I'll be moving back to my home state of Minnesota.

Aging parents, young children, significant West Coast work collaborations, and the rising cost of living in London have all nudged us to change our lives.

I'll continue to work as a research scientist for Meta, albeit more remotely.

London - and Europe - have been my home for a decade. I'm sad to be leaving.

Not The Onion.

Of course there are variations. Some self-checkout setups allow you to put your bags on, but usually only all at once at the beginning of the process.

In a belt setup, you could also try to sort one bag at a time by playing an online knapsack game (sorting items in bagging order, one bag at a time, repeatedly) but that feels quite challenging.

Self-checkout at the grocery store breaks the mental model for the order in which to ring up your purchases. When someone else is scanning, you want first in first out (FIFO) order so that the first things out of your cart are the things to put at the bottom of your shopping bags.

Self-checkout puts all items in a single heap on a scale. This means your last in first out (LIFO) order with heaviest items on top of the heap to go in the bottom of bags.

I find this annoying.

Watching it for the first time in many years, one wonders who in the writers' room for Peter Pan was like, "you know what this movie needs: some more racism."

My team at Meta (Core Data Science) is currently advertising a full-time/paid PhD internship role for this summer. Can be based in London or Tel Aviv.

Would involve working on challenging data science problems across one of many of our areas of specialization (causal inference, statistics, ML, Econ/CS, etc.).

Ad here: https://www.metacareers.com/jobs/734978838226167/

Happy to answer questions.

Research Scientist Intern, Central Science, Strategy and Research

Meta's mission is to give people the power to build community and bring the world closer together. Together, we can help people build stronger communities - join us.

Meta Careers
#rstats question: Anyone know why the Posit Package Manager (https://packagemanager.rstudio.com/client/#/repos/2/overview) doesn't return the MD5sum field in `available.packages()` ? This is usually provided by CRAN but is missing for all snapshot dates. cc @Posit
Posit Package Manager

Long-time RStudio user. I haven't used Python in 10+ years and have never been a regular user. But now I need to do some stuff in Python.

How do #rstats users do Python? Do you integrate your Python code into RStudio/Posit? Or use a different IDE?

#rstats #openscience

we talked last week about what can go wrong with floating point numbers, so -- what can go wrong when using integers?

so far I have:

* 32 bit integers are smaller than you think (they only go up to 4 billion!)
* overflow
* sometimes you need to switch the byte order
* ?? (maybe something about shift / bitwise operations? not sure what can go wrong with that exactly)

I'd especially love real-world examples of things that have gone wrong, if you have them!

I just learned that MRAN will be shut down this year
So, am copying all their CRAN binaries, to eventually put in a public page

This is for `groundhog`, but will be open to all

If you know of alternatives or have thoughts: email me
#rstats
(BTW: http://datacolada.org/100)

[100] Groundhog 2.0: Further addressing the threat R poses to reproducible research - Data Colada

About a year ago I wrote Colada[95], a post on the threat R poses to reproducible research. The core issue is the 'packages'. When using R, you can run library(some_package) and R can all of a sudden scrape a website, cluster standard errors, maybe even help you levitate. The problem is that packages get updated...

Data Colada