Web Agent: Browser-native agent · profiles · tools

Web Agent은 브라우저 내에서 직접 실행되는 오픈소스 AI 에이전트로, 별도의 설치 없이 프로필별 격리된 작업공간과 지속적인 메모리 기능을 제공합니다. Node.js 런타임을 WebContainers 위에서 구동하며, 파일 조작, 세션 관리, 자동화, 웹 검색 등 다양한 내장 도구와 스킬을 지원해 복잡한 작업 흐름을 브라우저에서 바로 처리할 수 있습니다. API 키는 로컬에 암호화 저장되고, 사용자 상태는 서버에 저장되지 않아 개인정보 보호 측면에서도 유리합니다. 개발자와 파워 유저 모두를 위한 직관적이면서도 강력한 기능을 갖춘 브라우저 네이티브 AI 에이전트입니다.

https://github.com/nikola66/web-agent

#webagent #browsernative #aiagent #webcontainers #automation

GitHub - nikola66/web-agent: Browser-native agent · profiles · tools — by aratech | Zero installs, isolated, secured and self-evolving agent.

Browser-native agent · profiles · tools — by aratech | Zero installs, isolated, secured and self-evolving agent. - nikola66/web-agent

GitHub

merve (@mervenoyann)

Allen Institute for AI(AI2)가 웹 브라우저 사용과 클릭 작업을 수행하는 MolmoWeb을 공개했다. 4B와 8B 모델 및 관련 데이터셋을 함께 제공하며, Apache 2.0 라이선스로 배포해 오픈소스 웹 에이전트 연구와 개발에 중요한 진전이다.

https://x.com/mervenoyann/status/2036474373654118777

#allenai #molmoweb #opensource #webagent #llm

merve (@mervenoyann) on X

AI2 @allen_ai released MolmoWeb: 4B and 8B browser use (clicking) models as well as their datasets 🔥 all under Apache 2.0 license as always 💗

X (formerly Twitter)

AshutoshShrivastava (@ai_for_success)

rtrvr가 Rover를 출시했습니다. Rover는 웹사이트에 임베드 가능한 세계 최초의 웹 에이전트로, 자연어로 양식 작성, 체크아웃 흐름 탐색, 다중 단계 워크플로우 실행 등을 수행합니다. 핵심은 Rover가 완전한 DOM 네이티브 방식으로 작동해 화면 기반 비전(vision)에 의존하지 않는다는 점입니다.

https://x.com/ai_for_success/status/2022517215548026916

#rtrvr #rover #webagent #dom

AshutoshShrivastava (@ai_for_success) on X

rtrvr just launched Rover, the world’s first embeddable web agent that completes tasks on your site. It fills forms, navigates checkout flows, and executes multi step workflows using natural language. This is possible because Rover is fully DOM native and does not use any vision

X (formerly Twitter)

AI in overdrive

I asked my health insurance #webagent
"My daughter lost her insurance card"

The #AI #agent replied: "Condolences with your loss. We understand a lot is coming at you in this sorrowful time. To help you as good as possible we'd like to know for which insurance you want to report a decease.". One of the options is the death of a pet.

Not shown was that I got the same reply after responding "No one has died. My daughter lost her insurance card".

#funny

AI in the browser is getting a little too bold.

Fuji-Web lets you type a task and then just… does it.
Clicks the buttons, fills the forms, scrolls like it owns the place - and explains its moves like a junior dev trying to sound confident.

Runs inside your browser.
Powered by your OpenAI/Anthropic key.
Open-source and already acting like your digital intern.

Workflows + knowledge base coming soon.

https://github.com/normal-computing/fuji-web

#AI #opensource #automation #webagent #techlife #devlife

"Các tác nhân AI đang đối mặt với thách thức phát hiện bot khi hoạt động trên các trang web thực tế. Làm thế nào để các nhà phát triển xử lý vấn đề này? Các giải pháp như "humanization" hay chạy trong phiên trình duyệt thực sự của người dùng có hiệu quả? #AI #BotDetection #WebAgent #TríTuệNhânTạo #PhátHiệnBot"

https://www.reddit.com/r/LocalLLaMA/comments/1o1zlt0/how_are_production_ai_agents_dealing_with_bot/

#WebAgent Series by #Alibaba #TongyiLab - Advanced Web Navigation Models #AI #LLM #opensource 🌐

🚀 #WebSailor achieves 12% on BrowseComp-en, 30.1% on BrowseComp-zh & 55.4% on #GAIA benchmarks

🧵 👇

#WebAgent wird aktiv 🌐 Durch #ProjectMariner kann der AI Mode Aufgaben im Web erledigen, etwa Tickets vergleichen oder Restaurantreservierungen vorschlagen.

Zukunft der #GoogleSuche 🔮 Google plant, die besten Funktionen des AI Mode langfristig in das Herzstück der Suche zu integrieren – das ist erst der Anfang.

👉 https://eicker.TV #Technik #Medien #Politik #Wirtschafthttps://eicker.BE/ratung (2/2)

eicker.TV ▹ Video News: Tech, Media, Politik, Kurzvideos

eicker.TV liefert tagtäglich die wichtigsten Tech News als Video News auf TikTok, YouTube Shorts, Instagram Reels. Alle Nachrichten: eicker.news

eicker.BEratung

Google's Project Mariner is here! 🤖 This AI agent browses the web for you, saving time on research & more. 🚀

How can Softsasi help? We integrate AI solutions to boost your business!
👉 softsasi.com

#AI #Google #ProjectMariner #WebAgent #Softsasi

Google DeepMind and the University of Tokyo Researchers Introduce WebAgent: An LLM-Driven Agent that can Complete the Tasks on Real Websites Following Natural Language Instructions

https://www.marktechpost.com/2023/07/29/google-deepmind-and-the-university-of-tokyo-researchers-introduce-webagent-an-llm-driven-agent-that-can-complete-the-tasks-on-real-websites-following-natural-language-instructions/

"Thorough research shows that linking task planning with HTML summary in specialized language models is crucial for task performance, increasing the success rate on real-world online navigation by over 50%." -- #AneeshTickoo

For more see: https://arxiv.org/abs/2307.12856

#gen_ai #api360 #webAgent

Google DeepMind and the University of Tokyo Researchers Introduce WebAgent: An LLM-Driven Agent that can Complete the Tasks on Real Websites Following Natural Language Instructions

Several natural language activities, including arithmetic, common sense, logical reasoning, question-and-answer tasks, text production, and even interactive decision-making tasks, may be solved using large language models (LLM). By utilizing the ability of HTML comprehension and multi-step reasoning, LLMs have recently shown excellent success in autonomous web navigation, where the agents control computers or browse the internet to satisfy the given natural language instructions through the sequence of computer actions. The absence of a preset action space, the lengthier HTML observations compared to simulators, and the lack of HTML domain knowledge in LLMs have all negatively impacted web navigation on real-world

MarkTechPost