NO SOUP FOR YOU!

Playwright
+ Ollama
==TRANSLITERATE==
BEAUTIFUL DATA

Build a self-auditing data pipeline that keeps my MariaDB in perfect sync.

Full workflow: https://dufospy.com/artificial-intelligence/data-mining-web-scraping-playwright-ollama

#Beautifulsoup #Playwright #data #scraping

@playwrightweb

@JamieWitter

@ollama

Scraping without proxies is like driving without a GPS - you'll get lost in the "real" web 🚀 #webautomation #scraping

https://medium.com/code-your-own-path/why-proxies-are-essential-for-web-scraping-and-automation-c63a1eaa7bee

Why Proxies Are Essential For Web Scraping And Automation

One of the biggest mistakes people make in web scraping is assuming that code is the hard part.

Medium

Scrapy's Spider classes are overengineered for most projects, restricting efficiency with unnecessary complexity 💡 #webdev #scraping

https://medium.com/@aleksei.aleinikov.gr/mastering-web-scraping-using-scrapy-on-python-to-extract-data-255953b5ae3b

Mastering Web Scraping: Using Scrapy on Python to Extract Data

Today, we embark on an exciting journey into the world of web scraping, armed with one of the most powerful and reliable tools in Python’s…

Medium

OpenAI violated Canadian privacy laws, federal and provincial watchdogs say

Commissioners from four of Canada’s privacy watchdogs have found that OpenAI violated Canadian privacy laws while developing and training its early models of ChatGPT.

Philippe Dufresne, Canada’s privacy commissioner, was joined by his provincial counterparts from British Columbia, Alberta, and Québec to announce the findings of a joint investigation into the tech giant. The investigation examined how OpenAI sourced training data for its early, GPT-3.5 and GPT-4 models, which included scraped content from publicly accessible internet sources like social media and blog posts, licensed third party sources like media outlets and stock image vendors, and user interactions with ChatGPT.

Dufresne noted that all four regulators found OpenAI had violated various federal and provincial privacy laws, including the federal Personal Information Protection and Electronic Documents Act (PIPEDA), and its provincial counterparts in Alberta, BC, and Québec. 

Read more at BetaKit

#Alberta #BritishColumbia #consent #OPC #scraping

Los actuales modelos comerciales de IA generativa han sido desarrollados vulnerando el Reglamento General de Protección de Datos (RGPD) y la Ley de Propiedad Intelectual (LPI). Jamás hubo un pedido de consentimiento por parte de las empresas tecnológicas. Por eso hablamos de ROBO DE DATOS.

#AI #genAI #generativeAI #data #datos #robo #robodedatos #theft #stolen #illegal #technology #bigtech #author #scraping

#Nvidia & co are bit like the #Monsanto of cyberspace. genAI does favour the very wealthy, thus, independent creative companies will be unable to compete, in the so-called "AI" era, namely, the #scraping era:

AI doesn't produce data ex nihilo.

Creative companies will be forced to sell their #data to content brokers for cheap, there's no competition. Thus, creative companies, will no longer be creative : R&D is expensive, and slow, they'll use Nvidia & co products to sprint faster. #fashion

Nos está escrapeando AliyunSecBot/Aliyun de Alibaba, ya esta bloqueado, en breve el bloqueo lo levanta el firewall y chau pinela... nos estan escrapeando nuestra instancia de Wikipedia local #bot #scraping #ddos
J'ai open-sourcé mon pipeline de prospection B2B 🧲
Lead Scraper Pro v2.1
→ 6 sources scrappées en parallèle
→ Déduplication + enrichissement email auto
→ 1 CSV propre pour Mailchimp / Lemlist / CRM
Node.js + Playwright · Licence MIT · Gratuit
⭐ github.com/molokoloco/lead-scraper-pro
#B2B #OpenSource #Scraping #LeadGen
⚖️ Landgericht München, Urteil vom 29.09.2023, 11 O 1884-22: Öffentliche Profildaten müssen nicht vor der Erhebung durch Dritte geschützt werden. #Scraping #Soziale #Netzwerke #Schadensersatz #teamdatenschutz #dsgvoportal https://www.dsgvo-portal.de/gerichtsentscheidungen/2023-09-29-LGM-11-O-1884-22-Scraping-Soziale-Netzwerke-Schadensersatz-2709.php
Öffentliche Profildaten müssen nicht vor der Erhebung durch Dritte geschützt werden. | 24.04.2026 | dsgvo-portal.de

Kein DSGVO-Schadensersatz bei Scraping öffentlich zugänglicher Daten ohne nachweisbaren Verstoß, Schaden und Kausalität.

Compliance Essentials GmbH

Onlinewerbung: Mangelhafte Umsetzung von Betroffenenrechten?

Onlinewerbung führt zunehmend zu datenschutzrechtlichen Beschwerden, insbesondere weil automatisierte Verfahren wie das Scraping von personenbezogenen Daten immer häufiger eingesetzt werden. Viele Unternehmen, die solche Methoden nutzen, verfügen jedoch nicht über die notwendigen Prozesse oder das e(...)
https://www.dr-datenschutz.de/onlinewerbung-mangelhafte-umsetzung-von-betroffenenrechten/

#Betroffenenrechte #E-Mail-Werbung #Informationspflicht #Marketing #Scraping

Onlinewerbung: Mangelhafte Umsetzung von Betroffenenrechten?

Online-Werbung rückt stärker in den Datenschutzfokus; insbesondere Scraping und die Gewährleistung der Betroffenenrechten werden anhand aktueller Hinweise des HmbBfDI beleuchtet.

Dr. Datenschutz