Ethan White

@ethanwhite
1 Followers
210 Following
279 Posts
Associate Prof in Environmental Data Science. Research + software + education. @weecology & Data Carpentry co-founder. #datadon #DataScience #ecology #rstats #python #OpenScience #OpenData #OpenSource
Websitehttps://www.weecology.org
Pronounshe/him
GitHubhttps://github.com/ethanwhite/
ORCIDhttps://orcid.org/0000-0001-6728-7745

Teaching SQL today so breaking out my favorite nemonic for the order of SQL clauses:

Select - So
From - Few
Where - Workers
Group by - Go
Having - Home
Order by - On time

Which I got from https://twitter.com/statsnam/status/1149431249511075840

Students definitely identify with it at the end of the semester.

Nam Nguyen on Twitter

“@beeonaposy Mnemonic that i used when i was learning SQL years ago lol. Thought you'd enjoy: So -Select Few -From Workers -Where Go -Group by Home -Having On TIme -Order by”

Twitter
@ethanwhite @powersoffour
I think perhaps “now that you’ve worked through that can you add it to the wiki” is similar to "progress isn't permanent unless it's been committed". It's an additional step to learn, but important for record-keeping!

@ethanwhite @powersoffour so, be consistent, model the behavior, and reiterate again and again the *why* we do it this way.

If you want the students to use it, then you need to use it too. It is a great way to capture all those little pieces of knowledge about navigating university bureaucracy, experimental designs, and if you analyze other people's data, their designs too. 😉

@powersoffour @ethanwhite our lab uses a wiki too (we keep project specific and university stuff behind a login, so you don't see much on the front page).

It becomes second nature in our lab. The first learning task for our lab members is to install Linux in a VM setup to allow access using lab credentials, etc. All the directions for that are on the wiki. 😉

And then all to do lists for a project initially start on the wiki, and we record what we've worked on in a given week there too.

I cannot overstate how much time and energy we have saved by having a lab wiki where we record collective knowledge and solutions to common problems. Instead of trying to recreate something someone did two years ago it's a quick search and then often just copy-paste.

Here's ours: https://wiki.weecology.org/

Weecology Wiki

Weecology's Lab Wiki

Weecology Wiki
oh no, my job

@ct_bergstrom It turns out that those citations are ENTIRELY MADE UP.

Even though one would expect that the ACLU and Wired would've written about the NSA's collection practices, as far as I can tell, those articles never existed; nor did the Times or Smithsonian articles on Clever Hans even though it would've been natural to see such a piece in their pages.

Even the URLs are constructed to look plausible. But they're total fictions. I think.

Very creepy construction of fake knowledge.

Playing around with #OpenAI's #GPT3 text generator led me to perhaps the creepiest behavior yet. As @ct_bergstrom noted, the AI bots are able to cite the sources of their own writing.

I decided to see if it could function as a plagiarism detector. But when I put a few snippets of my own writing in, it found several uncited sources. (Attached.)

Aha! Someone must have plagiarized me! Or so I thought. But the truth was much stranger....

I have literally waited all my life for this moment.
God, search results are about to become absolute hot GARBAGE in 6 months when everyone and their mom start hooking up large language models to popular search queries and creating SEO-optimized landing pages with plausible-sounding results.