Me every time I find a tutorial for something I need help with, only to discover it's actually a video and not a text with images:
@astro_jcm @rysiek The AI app that watches YouTube (i.e. video-to-text) or listens to podcasts (i.e. audio-to-text) should just take my money.

@paninid @astro_jcm @rysiek I already run local LLMs and am working on interfacing Audio-to-text ;) This will soon be here. Free. Opensource.

Will not likely be me that manages to put something together - but all the tech is here and omg we are many that need to just get the actual content out of these 8 min+ (needed for monetization) Youtube videos ...

@troed @paninid @astro_jcm @rysiek

Any advice or resources on building local LLMs? I'm working on a personal project of a local LLM and am always looking for ideas.

@faberfedor @paninid @astro_jcm @rysiek Well building if you mean training from scratch is way out of consumer hw league still, but just using one of the LLMs through Oobabooga works fine. I have a 12GB VRAM GPU and have used various GPTQ 13b models.

I run a local chatbot for the family matrix channel that way: https://blog.troed.se/2023/03/19/create-your-own-locally-hosted-family-ai-assistant/

What I'm working on now is which audio2text product to interface this with. For text2audio I already use Tortoise.TTS.

So the thinking here is that when I have the audio2text up I can then pipe that through a model with a large enough context window and simply ask the LLM for a summary. In chat mode, this could then also be probed with detailed questions (keeping the same context).

I've had realtime usecases in mind mostly, but that's not needed here, which means there are probably quite a few audio2text projects to go back to.

Create your own locally hosted family AI assistant

it's in my head

@troed @paninid @astro_jcm

> one of the LLMs through Oobabooga

That's a(nother) new one to me.

My idea is to train in the cloud, fine-tune it (locally?) and host it locally. The fine-tuning is going to be more personal data: notes, tweets, emails, etc. How to do updates, OTOH...<shrug>

ATM I'm building an MLOps pipeline just 'cuz. I hadn't thought about an agent UI since my initial goal didn't require a UI.

Thanks for the blog post. It'll give me something to do while my bread bakes. 🙂

@faberfedor @troed @astro_jcm I came across this framing on LinkedIn which I thought was helpful:

@paninid @troed @astro_jcm

Thanks for that. Looks like a decent way to differentiate customers for my freelance business.

@astro_jcm how can I listen to video+audio when I'm in a room with people already listening to video+audio?
@astro_jcm one of my pet peeves. I quickly move away if it's not text/images.
@astro_jcm There's a Perl script at https://github.com/obra/Youtube2Webpage that will turn instructional videos into static screenshots-plus-text webpages.
GitHub - obra/Youtube2Webpage: I learn much better from text than from videos

I learn much better from text than from videos. Contribute to obra/Youtube2Webpage development by creating an account on GitHub.

GitHub
@BenMonreal best link this evening. Thank you :)
@BenMonreal @astro_jcm anyone here who can put this in a simple website for me? Just a box where we paste the link to the video and poof there’s the transcript…
@astro_jcm no need to attack me like that, thanks ;)
@astro_jcm This is an interesting preference. Oral culture, learning and transmission (ie video) tends to be easier for the teacher to produce (ie lecture) than translating their skill in to written formats (ie books, blogs etc). They can simply show how, instead of needing to tell how.
@BlinkPopShift For me on the receiving end it's the opposite: I lack the patience to sit through a 20 min video to learn something I could've read in 2 min. Also, given the elaborate production I see in many videos, and factoring in the prep time, scripting, extra takes, editing and such, I often wonder if just writing it down wouldn't have been faster. Not trying to police how others share knowledge, of course; and I get that other people might learn better from videos.
@astro_jcm oh interesting. I hadn't thought of high production quality as a factor. I wonder if they teachers simply get more money/views in that format. Can't remember the last time I read a blog post. I tend to be a video learner and just skip to each section I want to watch.
@BlinkPopShift @astro_jcm videos make more money because they keep you on the page longer, thus the statistics used by advertisers will favor a 20 minute video over a short table of text on a webpage that show the same thing. Advertising creates incentive to waste time.
@astro_jcm @BlinkPopShift as well as fluff, the opposite is true - eg "how to replace this part in your xyz " where I'm stutter- stop-rewind-start the video as I do each step of the disassembly. Of course this is after 20 mins of "it is easy to replace this part so I'll show you how to save time & money" fluff.
@Winwaed @astro_jcm @BlinkPopShift That is what annoys me. If this video is about "How to do X" then just immediately tell me how to do X. Enough waffle!
@aroundthehills @Winwaed @astro_jcm @BlinkPopShift

They (creators) get paid for the waffle. Time watched = $$ do the incentives are misaligned
@astro_jcm @BlinkPopShift I'm with @astro_jcm on this one. Videos always feel like an atrocious waste of time to me, no matter how much I skip ahead. Lol
@BlinkPopShift @astro_jcm an alternative approach is the one taken by Stanford’s “Code in Place” which offer alternatives side by side of video, examples and/or text with images.

@BlinkPopShift @astro_jcm I suspect you're right, but "easier for the teacher" only matters when we overburden instructors to the Nth degree.

Redundancy of information across stylistic platforms - making the data available in "translations" for various learning strengths, is really necessary for a robustly educated, diverse populace.

Showing how is good. I can show you how an internal combustion
engine works, but this is of limited value when studying the chemistry of it. Words have value.

@BlinkPopShift @astro_jcm Easy for them; useless for the audience.

Who wants to constantly try to find the exact second on a video where the line of code you need to laboriously copy off the screen is shown? Who benefits from seeing a video shown linearly, one moment a time, vs. a page of text you can see all at once and skim visually to find the precise paragraph you need? Even being able to place markers at time points on a video doesn't help much, because you go to the marker, then wait 20 seconds for them to finish talking, then try to hold in your mind the exact, precise, thing they said, because programming isn't about getting the gist of a thing -- it's about an exact sequence of characters.

@astro_jcm My issue is when the tutorials you find are written for old and outdated versions of software you're trying to learn and use are completely different from the latest version where everything is changed.
@astro_jcm Best part of this videos is, that the information I searched is very often not in the video, even though the description tries to tell it is in the video.

@Nefex @astro_jcm I *loathe* click-baity video titles >.<

A youtuber had a video that claimed to be on figuring out what the right yarn for a crochet bag was.

Not so. It was a video where she crocheted a bag using a specific yarn >.<

@astro_jcm It’s the best with coding tutorials. 👨‍🍳 💋
@numbertheory @astro_jcm I bet the only reason apple allows to copy text from videos is exactly this. Coding videos are sometimes nice, but they should be an addition to the text.
@astro_jcm I second that! Why does everything have to be a video these days?
@astro_jcm ABSOFREAKINGLUTELY AHHH just let me read how to fix it and get on with my life. If it was 'how to save the world' and it begins with, HI my name is Bambi and let me start by telling you a little about myself....IM GONE!!!
@astro_jcm Me every time. I just want to read the how to, not watch it. That's just how I learn.

@astro_jcm

What I hate is how the "help" video is used for advertising, before, during, & after.

Surveillance capitalism.

The "advertisers" in Russia now know the make & model of your computer, operating system, software version, age, and that you're a novice in a particular area.

The perfect target for a scam, fraud, & microtargeted spam.

@astro_jcm Old woman here, same thing.
@astro_jcm Hey. I resemble that remark.
@astro_jcm Yes! I like watching videos ... but not really when there's something I need help with (at least most of the time--some things don't do well as text + images)
@astro_jcm i agree just a pain in my ass
@astro_jcm a video? Guess I don't really need to learn about it...
@astro_jcm You really are sub-posting me on this one, aren't you? I can 100% relate to this. If it's a video, I'll bounce as fast as possible.

@astro_jcm @aleen Q: “Why isn’t this a text document?”

A: “Just join the Discord.”

🤬🤬🤬

@astro_jcm Amen, brother! I hate video tutorials.
@astro_jcm Shaking that same fist.

@astro_jcm but how will you learn the content without the memes, a word from our sponsor, and of course you need to like and subscribe and slap that notification bell to get spicy takes like this in your inbox daily.

For real though, I’ve found that copying the video transcript into an ai summarizer works ok most of the time.

@astro_jcm I am Autistic and I have visual processing problems, or, as I like to call it, "Ooh! It's a picture of a blue! No, there's white specks... Oh! A red patch! Oh it's ... the SpaceX rocket, pre-deconstruction... of course.". ... or, A boat on the ocean, or Spiderman. See the problem?

Now, imagine that, but the pictures MOVE! *sigh*

*headtodesk*

#ActuallyAutistic #SensoryNeeds

@astro_jcm ugh. Same. I can read faster than you can talk in your video!

@astro_jcm Yep, most of the time, some text with pictures will do the job better and faster than a video.

Only a few tasks I need to learn actually require me to be shown the process step by step in video form. In those cases, a video is helpful, but they're not the majority.

@astro_jcm
LOGO REVEAL
LENS FLARE BLOOM
FADEOUT
Hey guys thanks for reading my reply to your toot! Sorry I haven't posted many replies lately. Today I'm going to tell you what I think about what you wrote, but first I want to cover a couple of personal updates and tell you about the future of my mastodon account.

@astro_jcm how the hell am I going to copy-paste from a friggin YouTube video? Was that a lower-case L or a number 1? Your screen recorder is potato so I can't say for sure.

It's almost as bad as "Quick and Easy Recipe for Falafel: Once upon a time in a land far far away, my aunt's cousin's neighbor's best friend's dog-walker's roommate's great-great-great-great-great-great-great-grandmother had a cow..."

@astro_jcm omg Yes! Why do they *do* that...?
Does anyone actually find them helpful?

Aaargg

@astro_jcm

"blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah blah ...,

Let's get started!

Blah blah..."

@astro_jcm
The "pivot to video" of the past decade was never based on efficiency or user preference. It was based on video being a more profitable medium for advertising and monetization. It's easy to ignore 100 ads surrounding a block of text, but hard to ignore a 30s preroll before a video.