"Computer, what's new?" - why search for news and filter what's (not) interesting, if OpenClaw can do that with my "hfnews" skill.

Enjoy: https://clawhub.ai/huberteff/hfnews

#ai #clawbot #openclaw #hfnews #stopwords #news #agenticai

hfnews — ClawHub

Fetch and filter IT and Cybersecurity news from multiple sources, excluding politics, sports, and other unwanted topics based on a configurable blacklist.

ClawHub

Still slowly working on the #keyword extraction and storage in #json file.

I didn't manage to enable #stopwords in #keyBERT, so the plan is to filter the keywords in additional step.

The "#WordPress" or "plugin" keyword for each plugin would be useless.

The next step that follows will be to find #synonyms for each keyword... and I think I'll be using NLTK #wordNet in #python. Not sure if it's the best option.

12/

#Portuguese #stopwords
# https://gist.githubusercontent.com/alopes/5358189/raw/2107d809cca6b83ce3d8e04dbd9463283025284f/stopwords.txt

from random import sample

def setup():
global palavras
size(800, 800)
palavras = load_strings('stopwords.txt')
no_loop()
text_size(40)
fill(0)

def draw():
for i in range(5):
escolhas = sample(palavras, 5)
linha = ' '.join(escolhas)
text(linha, 100, 100 + i * 40)

def key_pressed():
redraw()

#Processing #Python #py5 #genuary #genuary28

This stopword list was built by Gerard Salton and Chris Buckley for the experimental SMART information retrieval system at Cornell University. This stopword list is generally considered to be on the larger side and so when it is used, some implementations edit it so that it is better suited for a given domain and audience while others use this stopword list as it stands. This wordlist is 571 words in length.

http://www.lextek.com/manuals/onix/stopwords2.html

#English #stopwords #wordlist #talika_nimi #nimi_ilo #Inli_pasila

Stopword List 2

Onix Full Text Indexing, Search & Retrieval Toolkit Manual

List of English Stop Words

Stop Words are words which do not contain important significance to be used in Search Queries. Usually these words are filtered out from search queries because they return vast amount of unnecessar…

XPO6