Category «Search Engines»

New web crawler launched by Meta last month is quietly scraping the internet for AI training data

Fortune [no paywall]: “Meta has quietly unleashed a new web crawler to scour the internet and collect data en masse to feed its AI model. The crawler, named the Meta External Agent, was launched last month according to three firms that track web scrapers and bots across the web. The automated bot essentially copies, or …

Subjects: AI, Copyright, E-Commerce, E-Records, Intellectual Property, Internet, Knowledge Management, Legal Research, Privacy, Search Engines

Invisible Rulers – What Really Drives Online Content

Mark Scott, Digital Bridge, Politico: “After years of tracking online disinformation, propaganda and other digital nastiness, Renée diResta sees patterns where others see chaos. In her new book, “Invisible Rulers: The People Who Turn Lies into Reality,” the former Stanford University researcher tries to parse together a theory about why, seemingly out of the blue, …

Subjects: Civil Liberties, E-Commerce, Internet, Knowledge Management, Legal Research, Search Engines, Social Media

Don’t trust Google for customer service numbers. It might be a scam.

Washington Post [unpaywalled]: “Scams just keep popping up when you Google. On Monday, I found what appeared to be impostors of customer service for Delta and Coinbase, the cryptocurrency company, in the “People also ask” section high up in Google. A group of people experienced in Google’s intricacies also said this week that it took …

Subjects: Cybercrime, Cybersecurity, Economy, Financial System, Search Engines, Transportation

Rejecting Dogmas Around AI, User Privacy, and Tech Policy

Via LLRX – Rejecting Dogmas Around AI, User Privacy, and Tech Policy – The Markup’s Ross Teixeira had a virtual discussion with Jonathan Frankle, Chief Scientist at DataBricks, about the the ethics of companies using customer data to train models, the growing trend of integrating AI models into our personal devices and lives, and how people can …

Subjects: AI, Internet, Knowledge Management, Legal Research, Legislation, Privacy, Search Engines

OpenTheBooks.com – Every Dime. Online. In Real Time.

“At OpenTheBooks.com, we work hard to capture and post all disclosed spending at every level of government – federal, state, and local. In 2022, we filed 50,000 Freedom of Information Act (FOIA) requests and captured 25 million public employee pension and salary records. We also broke open the California state checkbook for the first time …

Subjects: Freedom of Information, Government Documents, Internet, Legal Research, Search Engines

Exploring Goodreads Data: An Analysis of 10 Million Books

Ammar Alyousfi’s Blog: “Goodreads is one of the largest book websites on the internet. It has data about millions and millions of books from different genres and in many languages. It’s hard not to find a book on Goodreads whether it’s published hundreds of years ago or just a few days ago. Today, I present …

Subjects: Education, Internet, Knowledge Management, Libraries, Search Engines

NationalPublicData.com Hack Exposes a Nation’s Data

Krebs on Security: “A great many readers this month reported receiving alerts that their Social Security Number, name, address and other personal information were exposed in a breach at a little-known but aptly-named consumer data broker called NationalPublicData.com. This post examines what we know about a breach that has exposed hundreds of millions of consumer …

Subjects: Cybercrime, Cybersecurity, E-Records, Financial System, Internet, Legal Research, Privacy, Search Engines

Microsoft Tweaks Fine Print To Warn Everyone Not To Take Its AI Seriously

The Register – “Microsoft is notifying users that its AI services should not be taken too seriously, echoing prior service-specific disclaimers – an update to the IT giant’s Service Agreement, which takes effect on September 30, 2024, Redmond has declared that its Assistive AI isn’t suitable for matters of consequence. “AI services are not designed, …

Subjects: AI, Knowledge Management, Legal Research, Microsoft, Search Engines

The new Google AI Overview layout is a small win for publishers

Mashable: “Google’s AI Overviews got off to a rocky start, but it hasn’t deterred the tech giant from charging ahead with foisting AI-generated summaries upon your search results, like it or not. On Thursday Google announced new updates to AI Overviews, some of which might make publishers a little happier. As of today, Google is …

Subjects: AI, Internet, Knowledge Management, Legal Research, Search Engines