This approach raises questions about compliance with global regulations like GDPR, which safeguard against data misuse without explicit permission. The legal implications of using such data for AI training, including social media posts and copyrighted works, remain uncertain, resulting in lawsuits and calls for stricter regulations. The processing of this data to prevent adverse effects on AI systems is also a concern, as workers sorting through vast amounts of training data often face long hours and challenging working conditions.
Gannett, the largest newspaper publisher in the US, has filed a legal suit against Alphabet and Google, alleging monopolistic practices in the digital advertising market related to their AI technology. Additionally, Google’s AI search beta and similar products have faced criticism, with claims of being “plagiarism engines” and causing reduced traffic to websites.
In a similar vein, Twitter and Reddit have implemented significant changes to prevent unrestricted data harvesting, which has sparked a backlash from their user communities due to negative impacts on core user experiences.