At the AI Sauna hackathon today, we experimented with generating alt text for historical photos using the LLaVA-13B visual LLM running on the LUMI supercomputer. The results were surprisingly good!

Project page: https://meta.wikimedia.org/wiki/AI_Sauna/Generate_alt-texts_for_historical_images

Code/results: https://github.com/osma/llava-alttext-hkm

#AISauna #hackathon #LUMI #llava #llava13b #alttext #museum #photo #accessibility #AI #genAI

AI Sauna/Generate alt-texts for historical images - Meta

I have one image, and two very differing descriptions of said image. The first is from #Llava13B, a local model installed on my #Mac, and the second is from #GPT. The GPT response is so huge that I'm going to have to post it as threaded replies, but it shows the vast difference between models. For this first post, the Llava13B response has been included as #AltText.

1/