• 2 Posts
  • 21 Comments
Joined 23 days ago
cake
Cake day: August 28th, 2025

help-circle
  • Some software solutions exist, e.g. War and Peace by Tolstoy can be downloaded with metadata, ids are assigned to all characters and when one character tells something to another, this is highlighted as “x speaks to y”, and you can run a community detection algorithms on this data. I think in the paper they’ve been mentioning some proprietary software. I suspect detecting who speaks to whom is even harder.

    Also, some form of crowd sourcing probably should be possible. At least collecting scans is possible on wikisource and wikimedia commons.

    Probably AI language models should be pretty good in distinguishing between linguistic ambiguities.

    I dream for a time when such reports as in OP post will be a matter of work for an hour or two — because data will be already collected and clean.
















  • Probably there’s too small userbase for continuous subscribable block lists. I think I’ve seen some people sharing their blocklists on anonymous imageboards. When you can import/export block lists as plain text, probably it is enough.

    Consider checking syntax where it’s already implemented and adding this import/export functionality.

    A step forward could’ve been adding some logic: e.g. some keywords are filters for posts, some for comments, some for community/instance name. Maybe complex filtering can make federated “global” feed much more satisfying.