Mastodawn

hrbrmstr Apr 9, 2025

Deyan Ginev Apr 8, 2025

arXiv is freshly hiring for 3 positions:

- Software Engineer
- DevOps Software Engineer
- Software Engineer Scientist

US-only, NYC-based, hybrid/remote possible.

Share and help us build a backbone of Open Science.

https://info.arxiv.org/hiring/index.html

#openscience #arxiv #hiring

Careers at arXiv - arXiv info

hrbrmstr Jan 2, 2024

If you want to start or get better at data visualization in 2024, you should def take advantage @andykirk's few-remaining open slots in his inaugural 2024 'Fundamentals' course.

Zero pre-req’s.

https://visualisingdata.com/2023/09/fundamentals-of-data-visualisation-jan-2024/

New Course: 'Fundamentals of Data Visualisation' (Virtual, Jan 2024) - Visualising Data

I'm happy to announce details of a new public training course, the two half-day virtual 'Fundamentals of Data Visualisation' will take place online on 10-11 January 2024, 2pm to 5pm (UK) each day.

Visualising Data

hrbrmstr Dec 23, 2023

So, I don't 👀 at "follower" (terrible term) lists b/c narcissism corrupts & why repeat the mistakes of Twitter/X?

That means I don't know who from fosstodon follows this acct.

So, I'm letting said fosstodon-ers know I asked them to ban this acct since they banned my primary @hrbrmstr acct in Oct.

I wld not want to accidentally offend their sensitive sensibilities via this one, too.

Will be banning the fosstodon domain from this acct as well after a short delay.

Was nice chatting w/y'all!

hrbrmstr Mar 16, 2023

What a roller coaster of a week between manageable and ugh days.

Delighted I've got the new R WASM toy to keep me focused on something when not engaged in work or fam stuff, tho. Esp when this thing decides to not let me sleep.

I haven't spammed that R stuff here (I don't think I have). It's more "tech" than vis, but recent stuff has focused on vis.

https://github.com/hrbrmstr/webr-experiments for ref.

GitHub - hrbrmstr/webr-experiments: 🕸️ 🧪 hrbrmstr's WebR Experiments

🕸️ 🧪 hrbrmstr's WebR Experiments. Contribute to hrbrmstr/webr-experiments development by creating an account on GitHub.

GitHub

hrbrmstr Mar 14, 2023

Link to the aforementioned @observablehq notebook https://observablehq.com/d/dbb6b16326e16d00

A Week In The Life Of A GreyNoise Sensor

A look at tagged, malicious traffic. Link to the blog post featuring these Tagged Malicious Traffic Started Coming In As Soon As The Sensors Were Functional 217,852 total malicious events encoutered during the ~7.8 day sampling period. The Four Largest "Spike" Hours Had Mostly Similar Characteristics August 28 malicious traffic focused mainly on SMB exploits, and originated from the Data Communication Business Group autonomous system in Taiwan We Saw The Usual Suspects Rise To The Top Of 13,576 Ports Telnet

Observable

hrbrmstr Mar 14, 2023

In other good news, I used @observablehq for all the vis in a blog post coming out later today (will link when it's out). The provider we uses makes embedding the non-Observable branded iFrame's oddly hard (stuff comes out weird). If Observable had a white-on-black version of the bottom row branding, I likely could have used it.

I also need to provide feedback abt SVG and PNG exports not working well, but that's not happening today.

hrbrmstr Mar 14, 2023

Sunday was "good", relative to actually contracting this bugger.

Monday was…not.

Today, back to the Sunday "good" level.

Taking the small wins with serious appreciation. I cannot imagine what it was/is like for the folks who didn't/don't have "good" days during this.

hrbrmstr Mar 14, 2023

Mike Mahoney Mar 14, 2023

📣 I've got a new #preprint out, with @juliasilge , @hfrick , @topepo , plus Lucas Johnson and Colin Beier!

We give a head-to-head comparison of spatial cross-validation methods, give advice on applying spatial CV for applied modeling projects, and walk through how these techniques are implemented in the {spatialsample} #rstats package.

Preprint: https://arxiv.org/abs/2303.07334
Repo: https://github.com/cafri-labs/assessing-spatial-cv

#gischat #DataScience #rspatial

Assessing the performance of spatial cross-validation approaches for models of spatially structured data

Evaluating models fit to data with internal spatial structure requires specific cross-validation (CV) approaches, because randomly selecting assessment data may produce assessment sets that are not truly independent of data used to train the model. Many spatial CV methodologies have been proposed to address this by forcing models to extrapolate spatially when predicting the assessment set. However, to date there exists little guidance on which methods yield the most accurate estimates of model performance. We conducted simulations to compare model performance estimates produced by five common CV methods fit to spatially structured data. We found spatial CV approaches generally improved upon resubstitution and V-fold CV estimates, particularly when approaches which combined assessment sets of spatially conjunct observations with spatial exclusion buffers. To facilitate use of these techniques, we introduce the `spatialsample` package which provides tooling for performing spatial CV as part of the broader tidymodels modeling framework.

arXiv.org

hrbrmstr Mar 14, 2023

fwiw, this is clearly the bestest mastodon instance, hand's down. (or, is that supposed to be plural?)

hrbrmstr Mar 14, 2023

I 💯% had 💯% empathy for anyone who contracted covid since the start of the pandemic, regardless of “why”.

Now that I am clearly in the throes of a “who knows how” long covid experience — despite taking inanely draconian precautions — please know *you* are an incredible human, & that anything you are able to do, each day, mattersl & matters dearly. Even if said “thing” is “waking up each day”.

Hang in there.

We (which inherently means *you*) — collectively – “got this”.

rud.is	https://rud.is/
observable	https://observablehq.com/@hrbrmstr
github	https://github.com/hrbrmstr