Prithiv Sakthi (@prithivMLmods)

Map-Anything v1 데모가 Hugging Face Spaces에 공개되었습니다. 다중 이미지와 비디오를 이용해 3D 재구성, 깊이 추정, 노멀 맵 생성, 인터랙티브 측정을 수행하는 범용 3D 재구성 모델로, Gradio와 Rerun이 통합되었습니다.

https://x.com/prithivMLmods/status/2035055111358357957

#huggingface #3dreconstruction #gradio #computervision #opensource

Prithiv Sakthi (@prithivMLmods) on X

Map-Anything v1 (Universal Feed-Forward Metric 3D Reconstruction) demo is now available on Hugging Face Spaces. Built with @Gradio and integrated with @rerundotio , it performs multi-image and video-based 3D reconstruction, depth, normal map, and interactive measurements.

X (formerly Twitter)

Just published free word embeddings that beat the original word2vec.

66.5% on Google analogies vs 61%
Trained on 1/3 the data. Wikipedia, Gutenberg, arXiv, Stack Exchange, government docs. No web scrapes. Everything DFSG-compliant, GPL-3.0 licensed.

One GPU, four days, 107MB download.

https://huggingface.co/hackersgame/Free_Language_Embeddings

#NLP #OpenSource #FreeSoftware #AI #huggingface

hackersgame/Free_Language_Embeddings · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Sudo su (@sudoingX)

한 독립 연구자가 개인 자금으로 GPU를 빌려 Hugging Face에 29개 모델을 공개했고, GLM-4.7을 맥북에서 돌릴 수 있게 압축하고 Nemotron Super를 출시 직후 양자화했다는 내용이다. 소규모 개인 연구자가 빠르게 고품질 오픈소스 모델을 배포하는 사례로 주목된다.

https://x.com/sudoingX/status/2034903929105141831

#huggingface #opensource #quantization #llm #macbook

Sudo su (@sudoingX) on X

this guy has 29 models on huggingface at page 2 ranking. no lab behind him. no sponsorship. $2,000 from his own pocket on GPU rentals. he compressed GLM-4.7 to run on a MacBook and quantized Nemotron Super the week it dropped. all public. all free. nvidia is a trillion dollar

X (formerly Twitter)

Brie Wensleydale (@SlipperyGem)

Fibo-Edit-RMBG가 한 달 전에 공개되어 Hugging Face에서 이용 가능하다고 알립니다. 관련 커스텀 노드도 존재하지만 별점이 없어 주의가 필요하다는 경고가 있으며, 작성자는 배경 제거(RMBG) 워크플로우에 더 나은 모델을 적용할 의향이 있음을 언급합니다.

https://x.com/SlipperyGem/status/2034319825233924583

#rmbg #huggingface #fiboeditrmbg #opensource

Brie Wensleydale🧀🐭 (@SlipperyGem) on X

Fibo-Edit-RMBG has been out a month, you can try it out on HuggingFaces. Also, found a custom node for it, but it has ZERO stars, so proceed with caution. I have many workflows that need a good RMBG model, and I don't mind switching to a better one. https://t.co/BpcJaMhiR3

X (formerly Twitter)

merve (@mervenoyann)

Allen Institute for AI(allenai)가 공개한 MolmoPoint-8B 및 MolmoPoint-GUI-8B의 웹 데모가 Hugging Face Spaces에 올라와 체험 가능하다는 알림입니다. 제공된 링크를 통해 MolmoPoint 모델의 동작과 GUI 기반 인터페이스를 직접 확인하고 실험해볼 수 있습니다.

https://x.com/mervenoyann/status/2034346787243172120

#huggingface #allenai #molmopoint #demo #8b

merve (@mervenoyann) on X

they also have demos 🤩 https://t.co/z6mh8rvjJ1 https://t.co/t7f8efLTEp 🙌🏼

X (formerly Twitter)

Baidu Inc. (@Baidu_Inc)

리소스 모음: Qianfan-OCR 관련 논문(arXiv) 링크와 Baidu Qianfan 플랫폼 모델 페이지, HuggingFace에 공개된 Qianfan-OCR 모델 페이지, 및 GitHub 레포지토리 링크 등 참고 자료가 제공됨.

https://x.com/Baidu_Inc/status/2034265155253723256

#qianfanocr #arxiv #huggingface #github

Baidu Inc. (@Baidu_Inc)

배포 정보: 4B 파라미터 Qianfan-OCR이 단일 GPU 서빙 가능. W8A8 양자화 적용 시 단일 NVIDIA A100에서 1.024 페이지/초 처리. 단일 vLLM 인스턴스만으로 동작해 다단계 오케스트레이션이 필요없음. Baidu Qianfan 플랫폼에 배포되었고 가중치는 HuggingFace에 공개됨.

https://x.com/Baidu_Inc/status/2034265152267415770

#qianfanocr #quantization #vllm #huggingface

Jeff Boudier (@jeffboudier)

GTC 기조연설에서 젠슨은 Hugging Face를 소개하며 NVIDIA AI가 공개한 새로운 오픈 모델, 데이터셋, 블로그를 강조했습니다. 대표 발표로는 추론형 LLM인 Nemotron 3 Super 120A12B와 헬스케어 로보틱스용 데이터셋인 Open-H-Embodiment 등이 포함되어 있습니다.

https://x.com/jeffboudier/status/2033959279510884631

#nvidia #huggingface #nemotron #datasets #healthcare

Jeff Boudier 🤗 (@jeffboudier) on X

💚🤗💚 Jensen showing @huggingface during GTC keynote, where @NVIDIAAI dropped amazing new open models, datasets and blogs! Some of my favorites, links in comments: 🧠 Nemotron 3 Super 120A12B - Reasoning LLM 🏥 Open-H-Embodiment - Healthcare Robotics Dataset 🩻

X (formerly Twitter)

The entire stack is #opensource: dots.ocr model from #HuggingFace, vLLM for inference, #FastAPI proxy with parallel rendering + streaming. Total model size ~12GB, runs comfortably on any 24GB+ GPU.

Vision LLMs are making traditional OCR engines obsolete. No templates, no preprocessing rules, no layout config — just send an image, get structured text back. 🎯

Hands on with Adobe Firefly: Finally an image generator that can be used in school

Generative AI is a multimodal technology, with applications in text, image, video, audio, and code. Unfortunately, up until now, the actual usefulness of GAI in schools has been limited by technical and practical barriers. ChatGPT, for example, is easy to access but problematic in the classroom due to its obscure terms and conditions and dubious privacy and data storage. There are also ethical concerns with its construction, the bias in the output, and the potential to generate inappropriate […]

https://leonfurze.com/2023/09/18/hands-on-with-adobe-firefly-finally-an-image-generator-that-can-be-used-in-school/