Black Latents | Latent Diffusion is a gradio application that allows you to spawn audio items from Black Latents, a RAVE V2 VAE trained on the Black Plastics series using RAVE-Latent Diffusion models.
Play around with the demo here: https://huggingface.co/spaces/martstilde/black-latents-latent-diffusion-demo
LaDiR (Latent Diffusion Reasoner) kết hợp VAE và mô hình khuếch tán tiềm ẩn để cải thiện khả năng suy luận của LLM. Nhờ không gian tiềm ẩn có cấu trúc và khả năng tinh chỉnh vòng lặp, LaDiR tăng độ chính xác, đa dạng và khả năng giải thích trên các benchmark toán học và lập kế hoạch. #AI #LLM #MachineLearning #NLP #LatentDiffusion #TríTuệNhânTạo #MôHìnhNgônNgữ
https://www.reddit.com/r/singularity/comments/1o2vc7x/ladir_latent_diffusion_enhances_llms_for_text/
#Bolt3D claims to revolutionize 3D scene generation by directly creating renderable 3D representations from one or more images. It achieves unprecedented speed and quality without requiring computationally expensive optimization or augmentation steps.
https://arxiv.org/abs/2503.14445v1
#ComputerVision #VirtualReality #3DModeling #GoogleResearch #LatentDiffusion #FeedForwardModels
We present a latent diffusion model for fast feed-forward 3D scene generation. Given one or more images, our model Bolt3D directly samples a 3D scene representation in less than seven seconds on a single GPU. We achieve this by leveraging powerful and scalable existing 2D diffusion network architectures to produce consistent high-fidelity 3D scene representations. To train this model, we create a large-scale multiview-consistent dataset of 3D geometry and appearance by applying state-of-the-art dense 3D reconstruction techniques to existing multiview image datasets. Compared to prior multiview generative models that require per-scene optimization for 3D reconstruction, Bolt3D reduces the inference cost by a factor of up to 300 times.