Phoronix Birthday: 20 years of great Linux content - let us donate for a good birthday celebration!
https://lemmy.world/post/16220304

Phoronix Birthday: 20 years of great Linux content - let us donate for a good birthday celebration! - Lemmy.World
Repost from:
https://libreddit.nl/r/linux_gaming/comments/1d8qi81/phoronix_birthday_20_years_of_great_linux_content/
[https://libreddit.nl/r/linux_gaming/comments/1d8qi81/phoronix_birthday_20_years_of_great_linux_content/]
> He really seemed downbeat in his announcements regarding the birthday. He
really puts a lot of work into the site but having a niche audience of tech
literate users is probably the worst place to be with ad sales tanking as they
do. If anybody is using adblockers, it’s us and people are cheap. I really hope
the guy has a nice birthday and gets lots of love and donations. The phoronix
content is always great and I’ve been a long time reader.
Claude AI crawlers effectively DDOSing Linux Mint forums
https://lemmy.world/post/14767952
Blocking AI crawlers with Caddy - Lemmy.World
I was reading the reddit thread on Claude AI crawlers effectively DDOSing Linux
Mint forums
https://libreddit.lunar.icu/r/linux/comments/1ceco4f/claude_ai_name_and_shame/
[https://libreddit.lunar.icu/r/linux/comments/1ceco4f/claude_ai_name_and_shame/]
and I wanted to block all ai crawlers from my selfhosted stuff. I don’t trust
crawlers to respect the Robots.txt but you can get one here:
https://darkvisitors.com/ [https://darkvisitors.com/] Since I use Caddy as a
Server, I generated a directive that blocks them based on their useragent. The
content of the regex basically comes from darkvisitors. Sidenote - there is a
module for blocking crawlers as well, but it seemed overkill for me
https://github.com/Xumeiquer/nobots [https://github.com/Xumeiquer/nobots] For
anybody who is interested, here is the block_ai_crawlers.conf I wrote.
(blockAiCrawlers) { @blockAiCrawlers { header_regexp User-Agent
"(?i)(Bytespider|CCBot|Diffbot|FacebookBot|Google-Extended|GPTBot|omgili|anthropic-ai|Claude-Web|ClaudeBot|cohere-ai)"
} handle @blockAiCrawlers { abort } } # Usage: # 1. Place this file next to your
Caddyfile # 2. Edit your Caddyfile as in the example below # # # import
block_ai_crawlers.conf # # www.mywebsite.com [http://www.mywebsite.com] { #
import blockAiCrawlers # reverse_proxy * localhost:3000 # } #
How to pirate a font - Lemmy.World
I have bought a font with a really shitty license agreement and I have a couple
of questions. 1. Are they able to detect it if I use the font in a commercial
product online by crawling my website? 2. How can I best share the font with the
community? (I am afraid of metadata in the font files, which is tied to my
payment account etc) 3. How can I remove the DSIG and other metadata from the
ttf file while keeping it usable? To my (and possibly your) surprise, I didn’t
find any free downloads of the font online.