Accurate, Focused Research on Law, Technology and Knowledge Discovery Since 2002

Daily Archives: July 11, 2023

Build, Access, Analyze: Introducing ARCH

Internet Archive Blogs: “We are excited to announce the public availability of ARCH (Archives Research Compute Hub), a new research and education service that helps users easily build, access, and analyze digital collections computationally at scale. ARCH represents a combination of the Internet Archive’s experience supporting computational research for more than a decade by providing large-scale data to researchers and dataset-oriented service integrations like ARS (Archive-it Research Services) and a collaboration with the Archives Unleashed project of the University of Waterloo and York University. Development of ARCH was generously supported by the Mellon Foundation…ARCH helps users easily conduct and support computational research with digital collections at scale – e.g., text and data mining, data science, digital scholarship, machine learning, and more. Users can build custom research collections relevant to a wide range of subjects, generate and access research-ready datasets from collections, and analyze those datasets. In line with best practices in reproducibility, ARCH supports open publication and preservation of user-generated datasets. ARCH is currently optimized for working with tens of thousands of web archive collections, covering a broad range of subjects, events, and timeframes, and the platform is actively expanding to include digitized text and image collections. ARCH also works with various portions of the overall Wayback Machine global web archive totaling 50+ PB going back to 1996, representing an extensive archive of contemporary history and communication…”

A Categorical Archive of ChatGPT Failures

A Categorical Archive of ChatGPT Failures. Ali Borji. Quintic AI. April 5, 2023 “Large language models have been demonstrated to be valuable in different fields. ChatGPT, developed by OpenAI, has been trained using massive amounts of data and simulates human conversation by comprehending context and generating appropriate responses. It has garnered significant attention due to… Continue Reading

TikTok: Technology Overview and Issues

CRS Report – TikTok: Technology Overview and Issues, Updated June 30, 2023: “TikTok is a globally popular video-sharing smartphone application (app) owned by ByteDance Ltd., a privately held company headquartered in Beijing, China. It is under increasing scrutiny by the U.S. government as a potential privacy and security risk to U.S. citizens. This is because… Continue Reading

Why is climate denial still thriving online?

DW: “Record global temperatures on July 3 kicked off the hottest week ever recorded as intense heat waves gripped the planet. Climate scientist Friederike Otto, of London’s Grantham Institute for Climate Change and the Environment, called the heat “a death sentence for people and ecosystems.” Yet, the next day, a political journalist in the United… Continue Reading

Google hit with lawsuit alleging it stole data from millions of users to train AI tools

CNN: “…The complaint alleges that Google “has been secretly stealing everything ever created and shared on the internet by hundreds of millions of Americans” and using this data to train its AI products, such as its chatbot Bard. The complaint also claims Google has taken “virtually the entirety of our digital footprint,” including “creative and… Continue Reading

Google Isn’t Grad School

The Atlantic [free link]: “Having so much information at our fingertips is useful but seductive, easily fooling us into thinking we know more than we do…The overconfidence of people laboring under the illusion of explanatory depth can lead to the spread of misinformation. As researchers have shown, when a person’s confidence is highest though their… Continue Reading

Google’s Searchbot Could Put Me Out of a Job

The Atlantic [free link]: “Google Search, like the rest of the internet, is pivoting to generative AI. The first step is Search Generative Experience, an experimental tool currently available as a public beta. Instead of sending you off to other corners of the web, more search results appear within Google. Sort of like ChatGPT, it… Continue Reading

AP investigation into ethics practices of Supreme Court justices

“An Associated Press examination of the ethics practices of the U.S. Supreme Court relied on documents obtained from more than 100 public records requests to public colleges, universities and other institutions that have hosted the justices over the past decade. Here’s a look at how the reporting was done: To conduct its review, the AP… Continue Reading