norden.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Moin! Dies ist die Mastodon-Instanz für Nordlichter, Schnacker und alles dazwischen. Folge dem Leuchtturm.

Administered by:

Server stats:

3.4K
active users

#scraping

12 posts5 participants1 post today
Frontend Dogma<p>Meet LLMs.txt, a Proposed Standard for AI Website Content Crawling, by @searchengineland.bsky.social:</p><p><a href="https://searchengineland.com/llms-txt-proposed-standard-453676" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">searchengineland.com/llms-txt-</span><span class="invisible">proposed-standard-453676</span></a></p><p><a href="https://mas.to/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ai</span></a> <a href="https://mas.to/tags/crawling" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>crawling</span></a> <a href="https://mas.to/tags/scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scraping</span></a> <a href="https://mas.to/tags/robotstxt" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>robotstxt</span></a></p>
@reiver ⊼ (Charles) :batman:<p>5/</p><p>For example, if software request data from a web-site, and the web-site returns HTML, but parts of the HTML has semantics marked up with a machine-legible format such as microformats, microdata, RDFa, etc, then it is NOT scraping.</p><p>(microformats, microdata, RDFa, etc, are machine-legible format, designed to express semantics to machines.)</p><p><a href="https://mastodon.social/tags/Scraper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraper</span></a> <a href="https://mastodon.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://mastodon.social/tags/WebScraper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebScraper</span></a> <a href="https://mastodon.social/tags/WebScraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebScraping</span></a></p>
@reiver ⊼ (Charles) :batman:<p>4/</p><p>For example, if software request data from a web-site, and the web-site returns HTML, but that HTML contains a &lt;script&gt; tag with JSON-LD in it, and the software consumes that JSON-LD, then it is NOT scraping.</p><p>(JSON-LD is a machine-legible format, designed to express semantics to machines.)</p><p><a href="https://mastodon.social/tags/Scraper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraper</span></a> <a href="https://mastodon.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://mastodon.social/tags/WebScraper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebScraper</span></a> <a href="https://mastodon.social/tags/WebScraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebScraping</span></a></p>
@reiver ⊼ (Charles) :batman:<p>3/</p><p>For example, if software request data from a web-site, and the web-site returns JSON, XML, or some other machine-legible format, then it is NOT scraping.</p><p><a href="https://mastodon.social/tags/Scraper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraper</span></a> <a href="https://mastodon.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://mastodon.social/tags/WebScraper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebScraper</span></a> <a href="https://mastodon.social/tags/WebScraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebScraping</span></a></p>
@reiver ⊼ (Charles) :batman:<p>2/</p><p>Scraping (as in Web Scraping) is the act of extracting data from HTML web-pages where the data is NOT machine-legible.</p><p>If the data, even in an HTML web-page, is in a machine-legible format, then it is NOT scraping.</p><p><a href="https://mastodon.social/tags/Scraper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraper</span></a> <a href="https://mastodon.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://mastodon.social/tags/WebScraper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebScraper</span></a> <a href="https://mastodon.social/tags/WebScraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebScraping</span></a></p>
@reiver ⊼ (Charles) :batman:<p>1/</p><p>I am understanding when a non-technical person uses the noun "scraper" (as in "web scraper") or the verb "scrape" in a way that isn't accurate.</p><p>But, I am surprised when what seems to be a technical person uses the word "scraper", "scrape", or "scraping" inaccurately — either claiming things that are NOT scrapers to be scrapers, or claiming that acts that are NOT scraping are scraping.</p><p>...</p><p><a href="https://mastodon.social/tags/Scraper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraper</span></a> <a href="https://mastodon.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://mastodon.social/tags/WebScraper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebScraper</span></a> <a href="https://mastodon.social/tags/WebScraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebScraping</span></a></p>
DSGVO-Portal<p>⚖️ Oberlandesgericht Düsseldorf, Urteil vom 14.03.2025, 16 U 157-24: Schadensersatzanspruch wegen "Scraping". <a href="https://social.tchncs.de/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://social.tchncs.de/tags/Schadensersatz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Schadensersatz</span></a> <a href="https://social.tchncs.de/tags/Datenschutzversto%C3%9F" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Datenschutzverstoß</span></a> <a href="https://social.tchncs.de/tags/Immaterieller" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Immaterieller</span></a> <a href="https://social.tchncs.de/tags/Schaden" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Schaden</span></a> <a href="https://social.tchncs.de/tags/teamdatenschutz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>teamdatenschutz</span></a> <a href="https://social.tchncs.de/tags/dsgvoportal" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>dsgvoportal</span></a> <a href="https://www.dsgvo-portal.de/gerichtsentscheidungen/2025-03-14-OLGDUS-16-U-157-24-Scraping-Schadensersatz-Datenschutzverstoß-Immaterieller-Schaden-2204.php" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">dsgvo-portal.de/gerichtsentsch</span><span class="invisible">eidungen/2025-03-14-OLGDUS-16-U-157-24-Scraping-Schadensersatz-Datenschutzverstoß-Immaterieller-Schaden-2204.php</span></a></p>
DSGVO-Portal<p>⚖️ Landgericht Darmstadt, Urteil vom 09.10.2024, 13 O 227-23: Keine Schadensersatzpflicht bei unbegründeter Missbrauchsbefürchtung und bloßer Belästigung nach Scraping-Ereignis.. <a href="https://social.tchncs.de/tags/Schadensersatz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Schadensersatz</span></a> <a href="https://social.tchncs.de/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://social.tchncs.de/tags/Identifikation" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Identifikation</span></a> <a href="https://social.tchncs.de/tags/Personenbezogene" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Personenbezogene</span></a> <a href="https://social.tchncs.de/tags/Daten" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Daten</span></a> <a href="https://social.tchncs.de/tags/teamdatenschutz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>teamdatenschutz</span></a> <a href="https://social.tchncs.de/tags/dsgvoportal" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>dsgvoportal</span></a> <a href="https://www.dsgvo-portal.de/gerichtsentscheidungen/2024-10-09-LGDAM-13-O-227-23-Schadensersatz-Scraping-Identifikation-Personenbezogene-Daten-2199.php" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">dsgvo-portal.de/gerichtsentsch</span><span class="invisible">eidungen/2024-10-09-LGDAM-13-O-227-23-Schadensersatz-Scraping-Identifikation-Personenbezogene-Daten-2199.php</span></a></p>
DSGVO-Portal<p>⚖️ LG Mönchengladbach, Urteil vom 21.11.2024, 3 O 391-23: Kein Schadensersatz bei fehlendem Kausalzusammenhang und fehlendem Nachweis eines gegenwärtigen Schadens. <a href="https://social.tchncs.de/tags/Schadensersatz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Schadensersatz</span></a> <a href="https://social.tchncs.de/tags/Soziale" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Soziale</span></a> <a href="https://social.tchncs.de/tags/Netzwerke" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Netzwerke</span></a> <a href="https://social.tchncs.de/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://social.tchncs.de/tags/Immaterieller" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Immaterieller</span></a> <a href="https://social.tchncs.de/tags/Schaden" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Schaden</span></a> <a href="https://social.tchncs.de/tags/teamdatenschutz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>teamdatenschutz</span></a> <a href="https://social.tchncs.de/tags/dsgvoportal" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>dsgvoportal</span></a> <a href="https://www.dsgvo-portal.de/gerichtsentscheidungen/2024-11-21-LGMöG-3-O-391-23-Schadensersatz-Soziale-Netzwerke-Scraping-Immaterieller-Schaden-2192.php" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">dsgvo-portal.de/gerichtsentsch</span><span class="invisible">eidungen/2024-11-21-LGMöG-3-O-391-23-Schadensersatz-Soziale-Netzwerke-Scraping-Immaterieller-Schaden-2192.php</span></a></p>
DSGVO-Portal<p>⚖️ Oberlandesgericht Dresden, Urteil vom 10.12.2024, 4 U 653-24: Ansprüche wegen des Verlustes der Kontrolle über personenbezogene Daten. <a href="https://social.tchncs.de/tags/Soziale" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Soziale</span></a> <a href="https://social.tchncs.de/tags/Netzwerke" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Netzwerke</span></a> <a href="https://social.tchncs.de/tags/Schadensersatz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Schadensersatz</span></a> <a href="https://social.tchncs.de/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://social.tchncs.de/tags/Unterlassungsanspruch" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Unterlassungsanspruch</span></a> <a href="https://social.tchncs.de/tags/teamdatenschutz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>teamdatenschutz</span></a> <a href="https://social.tchncs.de/tags/dsgvoportal" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>dsgvoportal</span></a> <a href="https://www.dsgvo-portal.de/gerichtsentscheidungen/2024-12-10-OLGDD-4-U-653-24-Soziale-Netzwerke-Schadensersatz-Scraping-Unterlassungsanspruch-2187.php" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">dsgvo-portal.de/gerichtsentsch</span><span class="invisible">eidungen/2024-12-10-OLGDD-4-U-653-24-Soziale-Netzwerke-Schadensersatz-Scraping-Unterlassungsanspruch-2187.php</span></a></p>
DSGVO-Portal<p>⚖️ Oberlandesgericht Stuttgart, Urteil vom 04.12.2024, 4 U 97-24: Keine Ansprüche nach DSGVO bei ungewissem Zeitpunkt eines Scraping-Vorfalls. <a href="https://social.tchncs.de/tags/Schadensersatz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Schadensersatz</span></a> <a href="https://social.tchncs.de/tags/Soziale" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Soziale</span></a> <a href="https://social.tchncs.de/tags/Netzwerke" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Netzwerke</span></a> <a href="https://social.tchncs.de/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://social.tchncs.de/tags/teamdatenschutz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>teamdatenschutz</span></a> <a href="https://social.tchncs.de/tags/dsgvoportal" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>dsgvoportal</span></a> <a href="https://www.dsgvo-portal.de/gerichtsentscheidungen/2024-12-04-OLGSTUT-4-U-97-24-Schadensersatz-Soziale-Netzwerke-Scraping-2185.php" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">dsgvo-portal.de/gerichtsentsch</span><span class="invisible">eidungen/2024-12-04-OLGSTUT-4-U-97-24-Schadensersatz-Soziale-Netzwerke-Scraping-2185.php</span></a></p>
Jonathan Bailey<p>Last week, Wikimedia reported that AI bots saturated their available bandwidth. Here's why the bad bots are getting so much worse...</p><p><a href="https://www.plagiarismtoday.com/2025/04/10/the-battle-against-the-bots/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">plagiarismtoday.com/2025/04/10</span><span class="invisible">/the-battle-against-the-bots/</span></a></p><p><a href="https://mastodon.world/tags/Copyright" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Copyright</span></a> <a href="https://mastodon.world/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://mastodon.world/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://mastodon.world/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ArtificialIntelligence</span></a></p>
Lukas Mezger<p>1. Vereinbarung geschlossener KI-Systeme: die eigenen Daten bleiben hier in einem "Silo".<br>2. <a href="https://social.tchncs.de/tags/Anonymisierung" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Anonymisierung</span></a> der eigenen Daten: aufwändig und meistens letztlich nicht wirksam genug (verbleibende Rest-Informationen können de-anonymisiert werden).<br>3. Schutz der eigenen Daten vor KI-<a href="https://social.tchncs.de/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> durch <a href="https://social.tchncs.de/tags/Wasserzeichen" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Wasserzeichen</span></a> und Widerspruchs-Hinweise – leider nur sehr eingeschränkt wirksam.</p><p>Die Frage, die sich jede Organisation stellen sollte: Wie schütze ich meine IP vor "datenhungrigen" KI-Anbietern?</p>
Martinus Hoevenaar<p>Had to adjust my .htaccess file today, because a SEO company had their bot trying to scrape my site. It didn't get further than the index-page, but it was comparable to a small DDoS, as in 5700 hits per minute. <br>Now let's hope the adjustment helps.<br>If it doesn't then their domain will be added to the firewall. And if they continue, I'll ask my lawyer to send a cease &amp; desist. But for now: let's hope those motherfuckers stay away.</p><p><a href="https://mastodon.art/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ai</span></a> <a href="https://mastodon.art/tags/bots" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>bots</span></a> <a href="https://mastodon.art/tags/seo" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>seo</span></a> <a href="https://mastodon.art/tags/ddos" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ddos</span></a> <a href="https://mastodon.art/tags/scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scraping</span></a> <a href="https://mastodon.art/tags/internet" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>internet</span></a></p>
DSGVO-Portal<p>⚖️ Oberlandesgericht München, Beschluss vom 09.01.2025, 7 W 1979-24 e: Streitwertbeschwerde in einem "Scraping"-Verfahren. <a href="https://social.tchncs.de/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://social.tchncs.de/tags/Streitwert" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Streitwert</span></a> <a href="https://social.tchncs.de/tags/Immaterieller" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Immaterieller</span></a> <a href="https://social.tchncs.de/tags/Schaden" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Schaden</span></a> <a href="https://social.tchncs.de/tags/teamdatenschutz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>teamdatenschutz</span></a> <a href="https://social.tchncs.de/tags/dsgvoportal" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>dsgvoportal</span></a> <a href="https://www.dsgvo-portal.de/gerichtsentscheidungen/2025-01-09-OLGM-7-W-Scraping-Streitwert-Immaterieller-Schaden-2181.php" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">dsgvo-portal.de/gerichtsentsch</span><span class="invisible">eidungen/2025-01-09-OLGM-7-W-Scraping-Streitwert-Immaterieller-Schaden-2181.php</span></a></p>
DSGVO-Portal<p>⚖️ Landgericht Lüneburg, Urteil vom 24.01.2025, 15 O 104-23: Schadensersatz bei Scraping von personenbezogenen Daten. <a href="https://social.tchncs.de/tags/Soziale" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Soziale</span></a> <a href="https://social.tchncs.de/tags/Netzwerke" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Netzwerke</span></a> <a href="https://social.tchncs.de/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a> <a href="https://social.tchncs.de/tags/Schadensersatz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Schadensersatz</span></a> <a href="https://social.tchncs.de/tags/Personenbezogene" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Personenbezogene</span></a> <a href="https://social.tchncs.de/tags/Daten" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Daten</span></a> <a href="https://social.tchncs.de/tags/teamdatenschutz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>teamdatenschutz</span></a> <a href="https://social.tchncs.de/tags/dsgvoportal" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>dsgvoportal</span></a> <a href="https://www.dsgvo-portal.de/gerichtsentscheidungen/2025-01-24-LGLÜ-15-O-104-23-Soziale-Netzwerke-Scraping-Schadensersatz-Personenbezogene-Daten-2179.php" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">dsgvo-portal.de/gerichtsentsch</span><span class="invisible">eidungen/2025-01-24-LGLÜ-15-O-104-23-Soziale-Netzwerke-Scraping-Schadensersatz-Personenbezogene-Daten-2179.php</span></a></p>
Dr. Datenschutz<p>KI-Verordnung: verbotene Praktiken und die DSGVO</p><p>Die Umsetzung der KI-Verordnung wird von der Europäischen Kommission vorangetrieben. Ein Schwerpunkt liegt dabei auf den Praktiken, die nach der KI-Verordnung verboten sind. Zum Teil bietet auch die DSGVO bereits Schutz vor solchen Praktiken. Der Beitrag geht hierauf anhand eines Beispiels genauer e(...)<br><a href="https://www.dr-datenschutz.de/ki-verordnung-verbotene-praktiken-und-die-dsgvo/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">dr-datenschutz.de/ki-verordnun</span><span class="invisible">g-verbotene-praktiken-und-die-dsgvo/</span></a></p><p><a href="https://mastodon.social/tags/BiometrischeDaten" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BiometrischeDaten</span></a> <a href="https://mastodon.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Scraping</span></a></p>
ResearchBuzz: Firehose<p>The Markup: A Guide on How to Legally Web Scrape EU Data. “At The Markup, some of our data journalists recently had questions about the legal risks involved in scraping websites hosted in the European Union. We conducted our own research to answer this question, and offer a summary of what we learned below. Our goal is to help other journalists, researchers, and advocates come up with a […]</p><p><a href="https://rbfirehose.com/2025/04/06/the-markup-a-guide-on-how-to-legally-web-scrape-eu-data/" class="" rel="nofollow noopener noreferrer" target="_blank">https://rbfirehose.com/2025/04/06/the-markup-a-guide-on-how-to-legally-web-scrape-eu-data/</a></p>
Lowtide<p>Unpopular opinion: copyright was also violated when somebody put a didigtized Studio Ghibli image on the internet. To begin with.<br><a href="https://fairmove.net/tags/scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scraping</span></a> <a href="https://fairmove.net/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://fairmove.net/tags/ghibli" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ghibli</span></a></p>
Frontend Dogma<p>Web Scraping With Cheerio in 2025, by @apify.bsky.social:</p><p><a href="https://blog.apify.com/web-scraping-with-cheerio/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.apify.com/web-scraping-wi</span><span class="invisible">th-cheerio/</span></a></p><p><a href="https://mas.to/tags/guides" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>guides</span></a> <a href="https://mas.to/tags/scraping" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>scraping</span></a> <a href="https://mas.to/tags/tooling" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>tooling</span></a></p>