AA (@measure_plan)

three.js, tsl-webgpu, mediapipe 손 추적, kimi k2.5를 활용해 라센간(Rasengan) 블라스트 효과를 구현한 데모와 코드 공유 트윗입니다. AI/컴퓨터비전과 웹GPU를 결합한 창의적 인터랙티브 시연으로, 개발자들에게 참고할 만한 흥미로운 구현 사례입니다.

https://x.com/measure_plan/status/2038660367014891687

#threejs #webgpu #mediapipe #computervision #aidemo

AA (@measure_plan) on X

made a rasengan blast effect with threejs, tsl-webgpu, mediapipe hand tracking, kimi k2.5 live demo and code below 🌀

X (formerly Twitter)
de-embedding

Amsterdam, March 2026
Activations in the first layer of the YOLO (You Only Look Once) object detection AI model determine from which video frame to load pixels.

#urbanphotography #computationalart #generativeart #yolo #computervision #abstractstreet #amsterdam #video

Vision-language models are impressive—until you ask them something simple.
A recent study shows that state-of-the-art systems struggle with basic visual tasks like counting shapes or detecting overlaps, achieving only ~58% accuracy on average—far below human performance
So what are they actually “seeing”?
AI doesn’t perceive images the way we do. It approximates, infers, guesses. And sometimes, it fails where humans succeed instantly.
#AI #ComputerVision #AIRealityCheck

https://anhnguyen.me/2024/vlms-are-blind/

Vision Language Models Are Blind | Anh Totti Nguyen

While large language models with vision capabilities (VLMs), e.g., GPT-4o and Gemini-1.5 Pro, are powering various image-text applications and scoring high on many vision-understanding benchmarks, we find that they are surprisingly still struggling with low-level vision tasks that are easy to humans. Specifically, on BlindTest, our suite of 7 very simple tasks such as identifying

Anh Totti Nguyen | Associate Professor of Computer Science, Auburn University

AA (@measure_plan)

얼굴 사진과 링크드인용 프로필 이미지를 무료로 생성하는 앱 ‘Cursed Potato Head’가 공개되었다. 최신 컴퓨터 비전과 얼굴 인식 기술을 활용해 셀피를 전문적인 헤드샷으로 바꾸는 소비자용 AI 응용 사례로 주목된다.

https://x.com/measure_plan/status/2038278690488926266

#computervision #faceid #aiapp #headshot #product

AA (@measure_plan) on X

introducing Cursed Potato Head a free app to create profile photos and professional linkedin headshots built with SOTA computer vision and face ID tech please share your lovely selfies: https://t.co/pALWCnguiU

X (formerly Twitter)

Muhammad Rizwan Munawar (@muhammdrizwanmr)

공항에서 수천 개의 짐을 실시간으로 세어 운영 효율을 높이는 시스템을 직접 구축했다. 수하물 추적을 수동으로 처리하던 비효율을 개선하기 위해 Ultralytics의 YOLO26을 활용한 실시간 luggage counting AI를 구현한 사례다.

https://x.com/muhammdrizwanmr/status/2038101917466063142

#yolo #ultralytics #computervision #airportoperations #aiapplication

Muhammad Rizwan Munawar (@muhammdrizwanmr) on X

Real-time luggage counting at airports for smoother operations 🛄✈️ Airports handle thousands of suitcases and carry-ons every day. Tracking each bag manually is slow and error-prone, so I automated it. I built a real-time luggage counting system using @ultralytics YOLO26.

X (formerly Twitter)

fly51fly (@fly51fly)

통합 토큰화와 잠재 노이즈 제거를 위한 엔드투엔드 학습을 제안한 최신 연구입니다. CV 분야에서 토큰화와 denoising을 함께 최적화하는 접근으로, 새로운 모델 학습 방법론에 관한 기술적 발전을 다룹니다.

https://x.com/fly51fly/status/2038028760512139761

#computervision #tokenization #denoising #training #research

fly51fly (@fly51fly) on X

[CV] End-to-End Training for Unified Tokenization and Latent Denoising S Duggal, X Bai, Z Wu, R Zhang… [MIT & Adobe] (2026) https://t.co/40jwy5OVEW

X (formerly Twitter)

Alexander Dobrindt und das Bundesinnenministerium forcieren den Einsatz von Edge-KI zur Echtzeit-Gesichtserkennung an Bahnhöfen.

Computer-Vision-Modelle analysieren Videostreams lokal auf verdächtige Bewegungsmuster und scannen auf Waffen, während biometrische Daten kontinuierlich mit Polizeiregistern abgeglichen werden.

#ComputerVision #EdgeAI #Biometrie #Datenschutz #News
https://www.all-ai.de/news/news26/bahnhof-ueberwachung-ki

So sollen deutsche Bahnhöfe bald überwacht werden

Ein neuer politischer Vorstoß plant tiefgreifende technologische Änderungen an Verkehrsknotenpunkten. Diese KI-Modelle kommen zum Einsatz.

All-AI.de

AA (@measure_plan)

Three.js, MediaPipe, 디자인 실험 코드를 모아둔 프로젝트 모음이 공개되었습니다. 25개 이상의 프로젝트가 이미 있으며, 앞으로도 계속 추가될 예정이라고 밝혔습니다. 웹 기반 3D/비전 기술을 활용한 창의적 AI·인터랙티브 실험 사례로 볼 수 있습니다.

https://x.com/measure_plan/status/2037969073846309147

#threejs #mediapipe #webgl #computervision #opensource

AA (@measure_plan) on X

i post the code for my threejs - mediapipe - design experiments here. there's 25+ projects so far with more to come: https://t.co/szuzm1c0qf

X (formerly Twitter)

🗞️ NEWS: Meow.camera — a playful name hiding a serious tool in the AI camera space — has landed with notable community interest.

🔍 ANALYSIS: The 228 score signals real traction. As vision AI matures, tools like this show how quickly specialized use cases are emerging. Worth watching for privacy implications.

📎 Source: [meow.camera]

#AI #OpenSource #DevTools #ComputerVision #PrivacyTools

🗞️ NEWS: Meow.camera — a playful name hiding a serious tool in the AI camera space — has landed with notable community interest.

🔍 ANALYSIS: The 228 score signals real traction. As vision AI matures, tools like this show how quickly specialized use cases are emerging. Worth watching for privacy implications.

📎 Source: [meow.camera]

#AI #OpenSource #DevTools #ComputerVision #PrivacyTools