Category «Intellectual Property»

Hundreds of thousands of videos from news publishers like The New York Times and Vox were used to train AI models

Nieman Lab – YouTube channels from major news publishers and creators were in video data sets used by Microsoft, Meta, Snap, Runway AI, and Bytedance: “Last month, The Atlantic dropped the latest investigation in its ongoing series on generative AI training data sets. Staff writer Alex Reisner found that at least 15 million YouTube videos …

Subjects: AI, Copyright, Education, Intellectual Property, Internet, Knowledge Management, Legal Research

How AI Browsers Sneak Past Blockers and Paywalls

Columbia Jopurnalism Review: “Last week, OpenAI released Atlas, which joins a growing wave of AI browsers, including Perplexity’s Comet and Microsoft’s Copilot mode in Edge, that aim to transform how people interact with the Web. These AI browsers differ from Chrome or Safari in that they have “agentic capabilities,” or tools designed to execute complex, …

Subjects: AI, Copyright, Intellectual Property, Internet, Knowledge Management, Legal Research, Search Engines

I’m drowning in AI features I never asked for and I absolutely hate it

MakeUseOf: “At first, all of this AI stuff felt exciting. I was curious to try everything (I was actually one of the few naive people who thought the Rabbit R1 was a good product before it eventually launched), and for a while, it felt useful.But over time, I realized AI isn’t just affecting smartphones; it’s …

Subjects: AI, E-Mail, E-Records, Intellectual Property, Internet, Knowledge Management, Legal Research, Microsoft, Search Engines

From Googlebot to GPTBot: who’s crawling your site in 2025

Cloudflare: “A new category, AI crawlers, has emerged in recent years. These bots collect data from across the web to train AI models, improving tools and experiences, but also raising issues around content rights, unauthorized use, and infrastructure overload. We aimed to confirm the growth of both search and AI crawlers, examine specific AI crawlers, …

Subjects: AI, Copyright, E-Records, Intellectual Property, Internet, Knowledge Management, Legal Research, Social Media

The Interactive GenAI Legal Hallucination Tracker

“Coming Soon: The Interactive GenAI Legal Hallucination Tracker — Sneak Peek Today! August 10, 2025 by Jenny Wondracek – “If you follow me on LinkedIn or spoke with me at AALL, you’ve probably seen me teasing this project like it’s the season finale of a legal tech drama. Well, the wait is (almost) over — …

Subjects: AI, Copyright, Government Documents, Intellectual Property, Internet, Knowledge Management, Legal Research, Search Engines

Sloppy AI defenses take cybersecurity back to the 1990s, researchers say

SCWorld: LAS VEGA: “Just as it had at BSides Las Vegas earlier in the week, the risks of artificial intelligence dominated the Black Hat USA 2025 security conference on Aug. 6 and 7. We couldn’t see all the AI-related talks, but we did catch three of the most promising ones, plus an off-site panel discussion …

Subjects: Cybercrime, Cybersecurity, E-Records, Intellectual Property, Internet, Knowledge Management, Legal Research, Microsoft, Search Engines

OpenAI offers 20 million user chats in ChatGPT lawsuit. NYT wants 120 million.

Ars Technica: “OpenAI is preparing to raise what could be its final defense to stop The New York Times from digging through a spectacularly broad range of ChatGPT logs to hunt for any copyright-infringing outputs that could become the most damning evidence in the hotly watched case. In a joint letter (PDF) Thursday, both sides …

Subjects: AI, Copyright, Intellectual Property, Internet, Knowledge Management, Legal Research, Search Engines

Perplexity is using stealth, undeclared crawlers to evade website no-crawl directives

Cloudflare: “We are observing stealth crawling behavior from Perplexity, an AI-powered answer engine. Although Perplexity initially crawls from their declared user agent, when they are presented with a network block, they appear to obscure their crawling identity in an attempt to circumvent the website’s preferences. We see continued evidence that Perplexity is repeatedly modifying their …

Subjects: AI, Copyright, Intellectual Property, Internet, Knowledge Management, Search Engines

Left-leaning Third Way maps counter plan to Trump’s AI agenda

Semafor – [reg. required to read all articles] “Trump on Wednesday [July 23, 2025] unveiled his long-awaited AI Action Plan, a national strategy accompanied by a trio of executive orders meant to replace his predecessor’s safety-first directives. The new roadmap focuses on deregulation and industry growth to better compete against China. In response, left-center policy …

Subjects: AI, Congress, Copyright, E-Records, Government Documents, Intellectual Property, Internet, Knowledge Management, Legal Research

What is AI Reading – Report by Muck Rack

Muck Rack Complete Report – Snipped from Executive Summary • Citations affect responses: Simply enabling or disabling the ability for AI to search the web drastically modifies responses, indicating that the systems are truly basing their responses on the cited works. • Journalism and earned media are important drivers: More than 95% of links cited …

Subjects: AI, Copyright, E-Commerce, E-Government, Education, Government Documents, Intellectual Property, Internet, Knowledge Management, Legal Research, Search Engines, Social Media

Generative Artificial Intelligence and Copyright Law

Generative Artificial Intelligence and Copyright Law CRS Legal Sidebar – LSB10922, 7/18/25 – “Innovations in artificial intelligence (AI) have raised several new questions in the field of copyright law. Generative AI programs—such as Open AI’s DALL-E and ChatGPT programs, Stability AI’s Stable Diffusion program, and Midjourney’s self-titled program—are able to generate new images, texts, and …

Subjects: Copyright, Courts, E-Government, Government Documents, Intellectual Property, Legal Research

Judges Don’t Know What AI’s Book Piracy Means

Follow up to Anthropic destroyed millions of print books to build its AI model and Copyrighted books to train AI? Fair. Storing them? See also The Atlantic – no paywall – Can AI companies keep stealing books to train their models? What Two Judicial Rulings Mean for the Future of Generative AI – “Should tech …

Subjects: AI, Copyright, Courts, Government Documents, Intellectual Property, Internet, Knowledge Management, Legal Research, Libraries