Heard about this: https://github.com/numenta/nupic in a podcast, it sounded very interesting. If I got it correctly, you could train (unsupervised) two models on two different corpus and resulting "semantic fingerprints" would still be comparable. I'm wondering what replicating some of the word embedding art with HTM would lead to.
This podcast btw: https://www.oreilly.com/ideas/natural-language-analysis-using-hierarchical-temporal-memory and it didn't mention numenta/nupic -- what I meant was that I found out about HTM through there, and then googled HTM and found an OSS project in Python.
I should probably start using hashtags so here goes: #nlp #word2vec #htm
@vhf With hashtags dead even on the birdsite, have to admit I'm enjoying using them here at on LinkedIn (via their new "trends" feature).

@Michael_Spencer On the birdsite I think they are either geographic or global. Here they are instance-local in the same sense as the federated timeline, which means they should be much less noisy and much more tailored to an instance's average interest.

So yeah, agreed, they seem to be, on mastodon, a new and exciting variation on the hashtag theme.

@vhf Great insights vhf, thanks for the info. I didn't know here they were only instance-local, that's a bit odd.

@Michael_Spencer Everything is a bit odd when you compare federated to centralized (or even distributed).

Having them as "global" hashtags would require each instance to maintain a full index "hashtag->toot". So, all hashtags, all hashtagged toots. Would take a bunch of space and quite some CPU over time. It would also require all hashtagged toot to be broadcasted to (every) other instance, so network-intensive as well.

That's not how a federation operates. :)

@vhf Btw, do you know how this instance is comparing to others instances in terms of size of users? I attempted to dive into a dense one but I no longer know if that's the case.
@Michael_Spencer The instance you're on, you mean? Looks like reasonably sized: https://instances.mastodon.xyz/
@vhf Thanks for the link, looks like I might have to migrate to the Cloud. :laughing: