Retail pricing has long combined data, experience, and instinct – but today’s market volatility demands a faster, smarter approach. https://www.zyte.com/blog/how-price-extraction-is-fuelling-insights-for-modern-retailers?utm_campaign=blog-2025-catchup&utm_activity=ORS&utm_medium=social&utm_source=mastodon

#webscraping #webdata #data #web

How price extraction is fuelling insights for modern retailers

Retail pricing has long combined data, experience, and instinct – but today’s market volatility demands a faster, smarter approach.

Zyte

🦊 Anti-detect browser Camoufox

#firefox #networking #scraping #fingerprint #webscraping #playwright #antidetect #antidetect_browser

A stealthy, minimalistic, custom build of Firefox for web scraping, robust fingerprint injection & anti-bot evasion.

Highlights

- Invisible to anti-bot systems 🎭
Page agent is hidden from JavaScript inspection.

- Fingerprint injection & rotation (without JS injection!)
- All navigator properties (device, OS, hardware, browser, etc.) ;
- Screen size, resolution, window, & viewport properties;
- Geolocation, timezone, locale, & Intl spoofing;
- WebRTC IP spoofing at the protocol level;
- Voices, speech playback rate, etc.

- Anti Graphical fingerprinting
- WebGL parameters, supported extensions, context attributes, & shader precision formats;
- Font spoofing & anti-fingerprinting.

- Human-like mouse movement
- Blocks & circumvents ads
- No CSS animations

https://github.com/daijro/camoufox

Discover how AI and LLMs are enhancing web scraping with smarter crawling, fuzzy data extraction, automated spider generation, and intelligent QA. https://www.zyte.com/blog/four-sweet-spots-for-ai-in-web-scraping?utm_campaign=blog-2025-catchup&utm_activity=ORS&utm_medium=social&utm_source=mastodon

#webscraping #webdata #data #web

Four sweet spots for AI in web scraping

Discover how AI and LLMs are enhancing web scraping with smarter crawling, fuzzy data extraction, automated spider generation, and intelligent QA.

Zyte

Nicolò Boschi (@nicoloboschi)

Apify의 스크래핑 기능이 AI 에이전트에게 더 풍부한 컨텍스트를 제공할 수 있다고 언급하며, 오픈소스 장기 기억(long-term memory)을 추가하려면 관련 벤치마크를 확인해보라고 제안한다. AI 에이전트, 데이터 수집, 메모리 평가에 관심 있는 개발자에게 유용한 참고 트윗이다.

https://x.com/nicoloboschi/status/2038954037110820992

#apify #aiagents #webscraping #longtermmemory #benchmarks

Nicolò Boschi (@nicoloboschi) on X

@moritzkremb Apify's scraping capabilities can significantly enhance the context available to AI agents. To add open source long term memory, it is worth checking the benchmarks. https://t.co/c5F9bMmgdi

X (formerly Twitter)

Moritz Kremb (@moritzkremb)

Apify가 AI 에이전트를 위한 강력한 도구로 소개되었습니다. Google Maps, Facebook, LinkedIn, Instagram 등 수천 개의 사전 구축 스크래퍼를 제공하며, CLI를 통해 OpenClaw, Claude Code, Hermes 같은 에이전트가 쉽게 연동할 수 있다는 내용입니다.

https://x.com/moritzkremb/status/2038930737567502667

#apify #aiagents #webscraping #cli #automation

Moritz Kremb (@moritzkremb) on X

Apify is one of the most powerful tools for your AI agent. It gives you access to thousands of pre-built scrapers: · Google Maps · Facebook · LinkedIn · Instagram ... you name it With the CLI it's easier than ever to have your OpenClaw, Claude Code or Hermes agent set this up

X (formerly Twitter)

Scaling your business’ web data gathering – acquiring, monitoring and storing a growing amount of data from a growing number of sources over time – requires holistic planning. https://www.zyte.com/blog/ten-building-blocks-to-scale-web-scraping?utm_campaign=blog-2025-catchup&utm_activity=ORS&utm_medium=social&utm_source=mastodon

#webscraping #webdata #data #web

From script to system: 10 building blocks to scale web scraping

Scaling your business’ web data gathering – acquiring, monitoring and storing a growing amount of data from a growing number of sources over time – requires holistic planning.

Zyte

As the web continues to evolve, Zyte API is evolving right alongside it—adding powerful new features and refinements designed to make data extraction smarter, faster, and more adaptable than ever. https://www.zyte.com/blog/new-in-zyte-scroll-control-lower-costs-and-more?utm_campaign=blog-2025-catchup&utm_activity=ORS&utm_medium=social&utm_source=mastodon

#webscraping #webdata #data #web

New in Zyte: Scroll Control, Lower Costs, and More

As the web continues to evolve, Zyte API is evolving right alongside it—adding powerful new features and refinements designed to make data extraction smarter, faster, and more adaptable than ever.

Zyte

Agent Reach는 AI 에이전트에 트위터·유튜브·레딧·깃허브·소셜 플랫폼 등을 한 줄 명령으로 바로 연결해 주는 오픈소스 툴킷입니다. yt-dlp·bird·gh·Jina Reader 등 기존 도구를 플러그인처럼 연결하고 agent-reach doctor로 상태 진단, 로컬 쿠키 저장·안전모드·dry-run 등 보안 옵션을 제공하며 대부분 무료(서버 프록시 비용 별도), Claude/OpenClaw/Cursor 등과 호환됩니다.

https://github.com/Panniantong/Agent-Reach

#ai #agents #opensource #webscraping #privacy

GitHub - Panniantong/Agent-Reach: Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees.

Give your AI agent eyes to see the entire internet. Read & search Twitter, Reddit, YouTube, GitHub, Bilibili, XiaoHongShu — one CLI, zero API fees. - Panniantong/Agent-Reach

GitHub
Quo Vadis, Crawlers? Progress and what’s next on safeguarding our infrastructure

One year ago, the Wikimedia Foundation reported a significant increase in bot traffic to the Wikimedia projects, largely coming from crawlers who extract content to train generative AI systems. We …

Diff

What does the future hold for the tool some describe as “the gift that revolutionised web scraping”? https://www.zyte.com/blog/the-future-of-scrapy?utm_campaign=blog-2025-catchup&utm_activity=ORS&utm_medium=social&utm_source=mastodon

#webscraping #webdata #data #web

The future of Scrapy: Smarter, faster and ready for AI-powered scraping

What does the future hold for the tool some describe as “the gift that revolutionised web scraping”?

Zyte