"Computer, what's new?" - why search for news and filter what's (not) interesting, if OpenClaw can do that with my "hfnews" skill.
"Computer, what's new?" - why search for news and filter what's (not) interesting, if OpenClaw can do that with my "hfnews" skill.
Still slowly working on the #keyword extraction and storage in #json file.
I didn't manage to enable #stopwords in #keyBERT, so the plan is to filter the keywords in additional step.
The "#WordPress" or "plugin" keyword for each plugin would be useless.
The next step that follows will be to find #synonyms for each keyword... and I think I'll be using NLTK #wordNet in #python. Not sure if it's the best option.
12/
#Portuguese #stopwords
# https://gist.githubusercontent.com/alopes/5358189/raw/2107d809cca6b83ce3d8e04dbd9463283025284f/stopwords.txt
from random import sample
def setup():
global palavras
size(800, 800)
palavras = load_strings('stopwords.txt')
no_loop()
text_size(40)
fill(0)
def draw():
for i in range(5):
escolhas = sample(palavras, 5)
linha = ' '.join(escolhas)
text(linha, 100, 100 + i * 40)
def key_pressed():
redraw()
This stopword list was built by Gerard Salton and Chris Buckley for the experimental SMART information retrieval system at Cornell University. This stopword list is generally considered to be on the larger side and so when it is used, some implementations edit it so that it is better suited for a given domain and audience while others use this stopword list as it stands. This wordlist is 571 words in length.
http://www.lextek.com/manuals/onix/stopwords2.html
#English #stopwords #wordlist #talika_nimi #nimi_ilo #Inli_pasila
List of English Stop Words
http://xpo6.com/list-of-english-stop-words/
#English #StopWords #MostCommonWords #nimi_ilo #nimi_pasila #wordlist #talika_nimi #Inli
Fox Stop list for General Text
Christopher Fox
http://qualquant.org/wp-content/uploads/cda/Fox%20Stop%20list%20for%20General%20Text.pdf