@phpmacher #RSS nutzen: Sicher. Immer noch populär. Ob als Eingangsformat für andere Medien oder direkte Nutzung durch Menschen.
Gibt diverse Systeme die lokal aggregieren und filtern, auch self-hostable. Teilweise über die eingebetteten #Metadaten, teilweise mit #Regex gegen andere Inhaltsfelder.
Der Schritt von da zu einer Art #Bayesfilter oder den NN wie sie #rspamd zur Klassifizierung nutzt oder sogar einem SLM/kleineren #LLM auf #Ollama, ggf. mit #TPU / #APU Support sollte überschaubar groß sein.
Eins der Probleme wird aber die zunehmend schlechte Feedqualität was Tags/Metadaten und der notorische (wenn gleich sehr verständliche) Hang zur Nicht-Auslieferung des Volltext im Feed sein. Da müsste man ggf auf Verdacht das Original fetchen, ggf. als zweite Stufe.

Und natürlich ist ein #Recommender-System was deutlich anderes als ein mehr oder minder ausgefeilter #Classifier. Aber lassen wir das...

Facial Recognition with webcam

I kind of want to do a crazy project like an AI powered game for my cat. I want my program to recognize when my cat takes some kind of action, and do -

Things I Tried

'Consistent Multiclass Algorithms for Complex Metrics and Constraints', by Harikrishna Narasimhan et al.

http://jmlr.org/papers/v25/22-1137.html

#multiclass #classifier #classification

Consistent Multiclass Algorithms for Complex Metrics and Constraints

'Non-splitting Neyman-Pearson Classifiers', by Jingming Wang, Lucy Xia, Zhigang Bao, Xin Tong.

http://jmlr.org/papers/v25/22-0795.html

#classifiers #classifier #classification

Non-splitting Neyman-Pearson Classifiers

'Regimes of No Gain in Multi-class Active Learning', by Gan Yuan, Yunfan Zhao, Samory Kpotufe.

http://jmlr.org/papers/v25/23-0234.html

#classifier #classification #classes

Regimes of No Gain in Multi-class Active Learning

https://huggingface.co/datasets/mofosyne/stupidfilter

Blast from the past! Here's a dataset from the StupidFilter project around 2007 by Gabriel Ortiz and Paul Starr.

It was an effort to collate all the messages in social media at the time an rank it by stupidity to try and train a filter to filter out stupidity.

However I think the project somewhat fell though as it wasn't able to fully detect all the stupidity in youtube and other site with the technology at the time...

#ai #stupidfilter #LLMs #classifier

mofosyne/stupidfilter · Datasets at Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

How would you go about creating a filter that blocks posts about things that people hate?

I've thought I could build a text classifier, but it could be hard to train since I'd need to guess whether or not the author hates the thing they are posting about.

I wouldn't want it to become a filter for all current events news, but I suspect that's what it would become.

#fediverse #mastodon #machineLearning #tfidf #classification #socialMedia #classifier #textAnalysis #programming #tech #technology

'Multi-class Probabilistic Bounds for Majority Vote Classifiers with Partially Labeled Data', by Vasilii Feofanov, Emilie Devijver, Massih-Reza Amini.

http://jmlr.org/papers/v25/23-0121.html

#classifiers #classifier #labeling

Multi-class Probabilistic Bounds for Majority Vote Classifiers with Partially Labeled Data

'On the Learnability of Out-of-distribution Detection', by Zhen Fang, Yixuan Li, Feng Liu, Bo Han, Jie Lu.

http://jmlr.org/papers/v25/23-1257.html

#learnability #classifier #detection

On the Learnability of Out-of-distribution Detection

'A Multilabel Classification Framework for Approximate Nearest Neighbor Search', by Ville Hyvönen, Elias Jääsaari, Teemu Roos.

http://jmlr.org/papers/v25/23-0286.html

#classification #classifiers #classifier

A Multilabel Classification Framework for Approximate Nearest Neighbor Search