Fred

@fredbenenson
706 Followers
253 Following
336 Posts
@fredbenenson on Twitter, etc.

#llm This is a followup of my last transcription experiment post - I actually did a complete writeup for those that want all the details (and see all the code and output, some of which is hilariously bad).

I really believed that for a “simple” task like this, especially since the open models score so well on eval benchmarks lately, that if gpt-3.5 could do this, that one of the open models would as well, but some more testing has disabused me of the notion. Only one of the largest 70B models got close in testing but still hallucinated output for the actual trancript.

In the end, using custom chunking code for gpt-3.5-turbo-16k was probably the best output. Lastly I applied similar code to the Claude 2 API which gave good output as well. (Claude 2 has 100K context, but it turns out that with the developer access I have, it kills calls at exactly 300s and I’d need about twice that to finish the task).

Akkoma

It's ironic that this is written about Twitter / Muskrat's dumb Russian-roulette product management, but this thread also really applies to all the rough edges of the product UX on Mastodon: https://twitter.com/MosquitoCapital/status/1650830660318162946
Mosquito Capital on Twitter

“Have you wondered why everyone is freaking out about the changes to Twitter Blue? Without talking about any particulars, I'll talk about why it's worrying. Background: I was a Site Reliability Engineer at Facebook/Instagram, and I've seen a lot of product changes come and go.”

Twitter
@jmsdnns Sort of! I keep forgetting to post on here lol
So who is the @dril of Mastodon?
@almodozo @profcarroll I just downloaded the main mastodon.social app for iOS and have to admit its pretty good, this will help I hope
trying to watch the new season of Love is Blind but I'm so impatient with all the melodrama that I set the playback to 1.25x and now everyone fidgeting makes it seem like they're meth-heads
@misc It took a couple tries but this was the original prompt:
@helvetica it's truly weird and feels like the kind of thing only a robot could come up with
@helvetica long time no see !
@profcarroll Fair! It does require a lot more intentionality ...