@aparrish Data* science**
*the data might be made up whole-cloth.
**the science is pretty fucken sus too
I work in biomedical informatics, and it's upsetting the number of data requests that come to me from people wanting to use LLMs to do a whole bunch of stuff for them. They don't ask for specific and targeted data points, just "yeah give me all the diagnosis codes" etc. (for basically all the domains, not just a single one). And they don't say which model they're using in their IRB protocol, so who knows whether they're sending PHI to OpenAI or whatever. It gets approved anyway.
@aparrish I don't do heavy-duty data analysis, but I have found the AI useful to write analysis and report generation programs that, in the past, I would have written myself.
I review the code of these before I run them, to confirm that they will do what I asked for.
Perhaps this isn't the sort of thing you had in mind?
@aparrish As someone who hasn't done any LLM code generation (but keeps reading about it here): I've often found it easier to reach a good solution when I had some bad code to look at (whether my own or someone else's), compared to starting with a blank page. There are only a few times when I have felt it was a mistake not to start from scratch. So as a workflow, I can understand wanting to start with something LLM-generated and improving on it myself.
But it always took me time and a lot of thinking to reach the solution I wanted, and I have *no* confidence in my ability to review and verify and take responsibility for LLM-generated code in the volumes and by the deadlines that people commonly experience these days. I very much want to think of myself as the steely-eyed operator whose stern gaze and firm hands keep the wayward minions in check and drive them productively onwards.
But I experienced plenty of "just merge it so that we can make <x> SUPER DUPER IMPORTANT sale that will definitely result in financial ruin if it doesn't go through right away" and "we can always fix it later" and "this is just a one-shot, we won't ever need this again" sort of pressure at work even before LLMs could be used for any of it, and knowing what a constant battle it was to maintain high standards for anything, I cannot imagine *myself* being able to succeed at that in today's environment.
(My comments are all about "ordinary" code. I think I would be terrified to the point of paralysis if I had to write data analysis code of the sort where anyone's implicit biases might matter to the results, never mind how the code may be written. So I'm not meaning to agree or disagree with what you said, just commenting on the "blank page" aspect.)
I believe getting some text on a blank page because you are afraid of the blank page seems like the only valid usecase.
but then, you could just put your question into google. open the first page and copy paste the text there into your document.
Then start editing to make it work for your intended text.
This is how I used to write my student essays. I used latex so I could just comment out all that was there so the page isnt blank at the start.
@aparrish I don't trust Excel with my datasets, why the hell would I trust an LLM
e: I care a lot about my work and that's why I don't currently have a job
@aparrish, it is equally mind-bending that people are adopting them for qualitative data analysis.
You cannot provide a positionality statement for an LLM.
You cannot conduct interpretivist work when one of your coders is a probabilistic mishmash of online stereotypes.
@V @hannah @aparrish it's been a huge snake oil disservice to use the same label for "type 'best peanut butter recipes are' on your keyboard and then keep on accepting the middle suggestion" as a "search engine" versus "do a web search for 'best peanut butter recipes', return the result summaries here, and summarize their summaries: A B C D go"
Slop is possible with both, but one truly is hallucination divorced from any sense of reality and not what a "language model" is good at
@jeffers00n It could also be that the incentive structure (read: "Keeping my Job") is misaligned, in that it relies on the rate of completed tasks, not their correctness?
People whose livelihoods rely on not getting fired are not incentivized to engage in the finer points of whether LLM output is actually trustworthy, if the people dictating the incentives also do not care.
The question is who gets to carry the can once this shit show collapses under its own weight.
@jeffers00n @slott56 @hannah @aparrish
I think we teach kids this in school. It's about turning in a 5-page essay, properly formatted, by the assignment's due date. Content is secondary to form and schedule.