Hertz' AI System That Scans for "Damage" on Rental Cars Is Turning Into an Epic Disaster

https://pawb.social/post/28818618

Hertz' AI System That Scans for "Damage" on Rental Cars Is Turning Into an Epic Disaster - Pawb.Social

Lemmy

You mean an LLM that doesn’t have ability to understand context fails to make decisions that require context to do properly? Shocking /s
Except they are using computer vision, not an LLM
And what is processing that information?
Computer vision commonly uses convolutional neural networks on the input, which is different from the transformer neural networks used in LLMs. If you have more info indicating LLMs are used here please share

If you have more info indicating LLMs are used here please share

two seconds of research would reveal LLMs are ALL OVER COMPUTER VISION. Are convolutional networks used? Yes. Are LLMs used? Yes. And MLLMs.

Tell you what sparky: you find me a source that says ONLY CNNs are used, then you can act like a subject matter expert.

arxiv.org/abs/2311.16673

techcommunity.microsoft.com/blog/…/3927912

medium.com/…/multimodal-large-language-models-mll…

github.com/OpenGVLab/VisionLLM

chooch.com/…/how-to-integrate-large-language-mode…

Large Language Models Meet Computer Vision: A Brief Survey

Recently, the intersection of Large Language Models (LLMs) and Computer Vision (CV) has emerged as a pivotal area of research, driving significant advancements in the field of Artificial Intelligence (AI). As transformers have become the backbone of many state-of-the-art models in both Natural Language Processing (NLP) and CV, understanding their evolution and potential enhancements is crucial. This survey paper delves into the latest progressions in the domain of transformers and their subsequent successors, emphasizing their potential to revolutionize Vision Transformers (ViTs) and LLMs. This survey also presents a comparative analysis, juxtaposing the performance metrics of several leading paid and open-source LLMs, shedding light on their strengths and areas of improvement as well as a literature review on how LLMs are being used to tackle vision related tasks. Furthermore, the survey presents a comprehensive collection of datasets employed to train LLMs, offering insights into the diverse data available to achieve high performance in various pre-training and downstream tasks of LLMs. The survey is concluded by highlighting open directions in the field, suggesting potential venues for future research and development. This survey aims to underscores the profound intersection of LLMs on CV, leading to a new era of integrated and advanced AI models.

arXiv.org
I was actually referring to UVEye which was referenced in the article. I looked into UVEye and nowhere did it say it used LLMs with their computer vision. That’s why I asked if anyone had any info on them using it. The comment I replied to assumed LLMs were used but supplied no evidence. None of the links you shared have anything to do with UVEye either.

Computer vision commonly uses convolutional neural networks on the input,

no where do you specify UVEye.

You could admit they’re all over, but instead double down on how I assumed lol

Except they are using computer vision, not an LLM

That’s what I initially said, referring to the article. If you have nothing to say regarding the technology in this article that’s fine, but don’t just assume that since there is research of incorporating LLMs into computer vision means it was used in this specific case.

If you have more info indicating LLMs are used here please share

so I did. whine about it, but they’re used in this field, if not this particular case. you asked, I provided.