Pretty fucking bold of these scientists to scrape one of my copyrighted photographs off the internet and then re-release it uncredited under a Creative Commons license because they used it for training data for an algorithm.
Pretty fucking bold of these scientists to scrape one of my copyrighted photographs off the internet and then re-release it uncredited under a Creative Commons license because they used it for training data for an algorithm.
@Gremriel @nero @alexwild The problem with asking on a project like that is that you need like, thousands upon thousands of pictures in order to constitute even a "small" dataset. They don't even have time to curate these things (a lot of porn ends up in them too), because it's not feasible for a small research team to go through each one and check.
When they want to monetize these things though, they really ought to spend the time and money to ethically source: they keep skipping that step.