merve (@mervenoyann)

메타가 Sapiens2를 공개한 것으로 보인다. 10억 장의 인간 이미지로 학습된 고해상도 모델 패밀리로, 자세 추정, 신체 부위 분할, 표면 법선, 포인트맵 등에서 SOTA를 내며 0.1B~5B 크기와 1024×768, 4K 해상도를 지원한다.

https://x.com/mervenoyann/status/2054187884417102319

#meta #sapiens2 #computervision #opensource #foundationmodel

merve (@mervenoyann) on X

Meta silently dropped Sapiens2 last week 🔥 a family of high-res models trained on 1B human images > for pose estimation, body-part segmentation, surface normals, pointmaps (sota) > 6 sizes: 0.1B → 5B params (all ViT patch 16) > high-res: 1024×768 and 4K

X (formerly Twitter)
Project Manager for Data Science -- Arnaout Lab @UCSF

Post a job in 3min, or find thousands of job offers like this one at jobRxiv!

jobRxiv
Geniatech launches Renesas RZ/V2N, RZ/V2H, and RZ/V2L OSM Size-M/L system-on-modules

Geniatech has introduced three OSM system-on-modules powered by Renesas RZ/V2N/V2H/V2L Cortex-A55/M33 microprocessors, namely the OSM Size-M (45x35mm) SOM-V2N-OSM, plus the OSM Size-L (45x45mm) SOM-V2H-OSM and SOM-V2L-OSM modules, all designed for Edge AI and computer vision applications. Geniatech SOM-V2N-OSM Specifications: SoC – Renesas RZ/V2N CPU Quad-core Arm Cortex-A55 @ 1.8 GHz Arm Cortex-M33 @ 200 MHz GPU – Arm Mali-G31 3D graphics engine (GE3D) with OpenGL ES 3.2 and OpenCL 2.0 FP VPU – Encode & decode H.264 – Up to 1920×1080 @ 60 fps (Renesas specs, but SOMDEVICES also mentions up to 4K @ 30 FPS) H.265 – Up to 3840×2160 @ 30 fps AI accelerator – DRP-AI3 up to 4 dense TOPS / 15 sparse TOPS System Memory – 8GB LPDDR4x RAM Storage – 64GB eMMC flash 476 LGA contacts with Display - 4-lane MIPI-DSI Camera - 2x 4-lane MIPI CSI-2 Audio - 2x I²S Networking - 2x Gigabit Ethernet

CNX Software - Embedded Systems News

XGRIDS (@XGRIDS_OFFICIAL)

Lixel K2가 포토리얼리스틱 색감, 더 조밀한 포인트 클라우드, 더 선명한 디테일을 제공하는 기능을 소개했다. LixelUpSample 알고리즘 덕분에 포인트 클라우드 안에 있는 QR 코드까지 스캔 가능하다고 강조한다.

https://x.com/XGRIDS_OFFICIAL/status/2053867609792106771

#lixelk2 #pointcloud #spatialintelligence #3dscan #computervision

XGRIDS (@XGRIDS_OFFICIAL) on X

Scan Challenge 🎈 Spot it in the video and scan it! Lixel K2 delivers photo-realistic color, denser point clouds, and sharper detail with LixelUpSample™ Algorithm— even making the QR code scannable from the point cloud. Tell us what surprise you found 👀 #SpatialIntelligence

X (formerly Twitter)

ICYMI 👉 Faster pipelines, smarter inference, and sharper playback.

How our multimedia engineering team helped shape GStreamer 1.28 with hardware acceleration, zero-copy improvements, HDR and color support, AI integration, and key codec, RTP, and WebRTC fixes: http://www.collabora.com/news-and-blog/news-and-events/16-contributors-cross-stack-improvements-collabora-work-gstreamer-128.html

#GStreamer #AIInference #ComputerVision #EdgeAI

16 contributors, cross-stack improvements: Collabora's work on GStreamer 1.28

Our multimedia engineering team delivered major improvements to GStreamer 1.28 including hardware acceleration and zero-copy pipelines, HDR and color support for Wayland, and more.

Collabora | Open Source Consulting
How a multi-head CNN with position embeddings achieved 100% accuracy on fixed-length CAPTCHA OCR without using CRNNs or CTC loss. https://hackernoon.com/building-a-fixed-length-captcha-ocr-model-with-multi-head-classification #computervision
Building a Fixed-Length CAPTCHA OCR Model With Multi-Head Classification | HackerNoon

How a multi-head CNN with position embeddings achieved 100% accuracy on fixed-length CAPTCHA OCR without using CRNNs or CTC loss.

Ultralytics (@ultralytics)

Embedded Vision Summit에서 최신 Vision AI 발전과 실시간 데모를 소개하며, 산업 현장에 적용 가능한 생산용 컴퓨터 비전 모델 구축·배포 방법을 다룹니다.

https://x.com/ultralytics/status/2053532082391880010

#visionai #computervision #embeddedvision #ai #models

Ultralytics (@ultralytics) on X

Embedded Vision Summit - see you tomorrow! 🚀 Meet the team and experience live demos showcasing the latest advancements in Vision AI. Discover how to build and deploy production-ready computer vision models that drive efficiency and innovation across industries, from

X (formerly Twitter)

Trump's Border Spending Spurs Boom in AI-Infused Surveillance

트럼프 행정부의 국경 보안 예산 증액으로 AI 기반 감시 기술 산업이 급성장하고 있습니다. 여러 기업들이 AI를 활용해 사람과 동물, 무기 소지 여부 등을 실시간으로 구분하는 감시 시스템을 선보이며, 드론과 센서 등 다양한 하드웨어와 결합해 국경 보안에 적용 중입니다. 이는 AI가 보안 분야에서 실질적이고 즉각적인 활용 가치를 입증하는 사례로 평가됩니다.

https://www.wsj.com/tech/trumps-border-spending-spurs-boom-in-ai-infused-surveillance-4714521b

#ai #surveillance #bordersecurity #drones #computervision

How to use OpenCV in Python, Make Your Hand Invisible Using OpenCV Magic Effect

While generative AI dominates the current landscape, the foundational principles of computer vision remain the bedrock of real-time spatial computing in 2026. This classic OpenCV implementation demons...

📺 Watch here: https://www.youtube.com/watch?v=hATXgqsfiJo

##OpenCV ##ComputerVision ##PythonProgramming

Python OpenCV Project 🔥 Make Invisible Hand | Computer Vision Magic Trick

YouTube
CFP: “Computational Approaches to Art” in Computational Humanities Research. A sign that the debate is moving from “Can we use AI in art history?” toward “How does computation reshape what art history actually is?” #DigitalArtHistory #AI #ComputerVision #VisualCulture

CHR Special Issue: Computation...
CHR Special Issue: Computational Approaches to Art

Lin Du, UCLA. Deadline: Jun 30, 2026