Jesus — the legality of the poorly-policed Common Crawl training data is WAY worse than I thought, check this out:
When Lapine used it to scan the LAION database, she found an image of her own face. She was able to trace this image back to photographs taken by a doctor when she was undergoing treatment for a rare genetic condition. The photographs were taken as part of her clinical documentation, and she signed documents that restricted their use to her medical file alone. The doctor involved died in 2018. Somehow, these private medical images ended up online, then in Common Crawl’s archive and LAION’s dataset.Surely this is a straight-up violation of patient confidentiality laws?! This is appalling. LAION’s FAQs are useless regarding this; as Lapine isn’t in the EU, they can’t even use GDPR to request its removal, and even if they were, these medical images don’t contain enough data to qualify under LAION’s rules.
it’s coming up to new specs time… some amazing Michael Caine style here
More retro specs action