Accurate, Focused Research on Law, Technology and Knowledge Discovery Since 2002

Category Archives: Search Engines

Wikimedia Commons is now 10 years old – huge resource for free educational media

News release: “Wikimedia Commons is turning 10 years old [September 7, 2014] — will you help celebrate? We’re asking everyone to join the Wikimedia community by sharing a freely licensed image with world. Wikimedia Commons is one of the world’s largest resources of freely licensed educational media. It is the central repository of the majority of illustrations forContinue Reading

Links, languages and semantics – Paper

Links, languages and semantics: linked data approaches in The European Library and Europeana. Valentine Charles, Nuno Freire, Antoine Isaac. Submitted on: 8/11/2014, IFLA 2014. “The European Library and Europeana have both an extensive experience in aggregating metadata for bibliographical records or digital resources from the cultural heritage institutions of Europe. For both of them meeting the challenges offeredContinue Reading

From StreetView to the new mapping tool, Cartographer, get a glimpse of how GoogleMaps works

Google Maps Blog: “With Google Maps by your side, you have a co-pilot for everything from turn-by-turn directions, to discovering new restaurants to deciding which hiking trails to climb next. This is possible in large part because Google Maps includes information from thousands of authoritative sources as varied as the U.S. Geological Survey, the Ordnance Survey ofContinue Reading

Google – Making of Maps: Reaching a milestone

Google Official Blog: “When you head out your door, you’ve got directions in your pocket—whether you’re driving to your aunt’s place in the mountains, cycling to a new biergarten or taking the train downtown. For Google Maps to get you there, it needs to be a digital mirror of the real world. But the realContinue Reading

Investigative Report – NSA created ‘google-like search’ engine – shared access with other agencies

“Data available through ICREACH appears to be primarily derived from surveillance of foreigners’ communications, and planning documents show that it draws on a variety of different sources of data maintained by the NSA. Though one 2010 internal paper clearly calls it “the ICREACH database,” a U.S. official familiar with the system disputed that, telling The Intercept that while “itContinue Reading

Google’s fact-checking bots build vast knowledge bank – New Scientist

Hal Hodson, 20 August 2014, New Scientist – The search giant is automatically building Knowledge Vault, a massive database that could give us unprecedented access to the world’s facts “GOOGLE is building the largest store of knowledge in human history – and it’s doing so without any human help. Instead, Knowledge Vault autonomously gathers and merges information from acrossContinue Reading

Comparing Google Consumer Surveys to Existing Probability and Non-Probability Based Internet Surveys

“This study compares the responses of a probability based Internet panel, a non-probability based Internet panel and Google Consumer Surveys against several media consumption and health benchmarks. The Consumer Surveys results were found to be more accurate than both the probability and non-probability based Internet panels in three separate measures: average absolute error (distance fromContinue Reading

Google Earth expands to the Moon and Mars – Outside

Outside News from the Field: “Google couldn’t celebrate Curiosity’s second anniversary on Mars (in Earth years) with just a doodle. Instead, the California-based gods of the Internet have released two new maps to explore using the Google Earth application—on Mars and the Moon. They were assembled using images taken by various spacecraft as well as data on each body’s elevationContinue Reading

Visualizing language usage in New York Times news coverage throughout its history

Chronicle – Tracking New York Times Language Usage Over Time, Alexis Lloyd:  “News publishing is an inherently ephemeral act. A big story will consume public attention for a day, or a month or a year only to fade from memory as quickly as it erupted. But news coverage, aggregated over time, can provide a fascinating “firstContinue Reading

Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing

Research at Google – “Mesa is a highly scalable analytic data warehousing system that stores critical measurement data related to Google’s Internet advertising business. Mesa is designed to satisfy a complex and challenging set of user and systems requirements, including near real-time data ingestion and queryability, as well as high availability, reliability, fault tolerance, andContinue Reading

New York Times Launches Enhanced Archive Search and @NYTArchives Twitter Account

PressRun: “Today The New York Times launches search on its interactive digital archive: TimesMachine.  With this newly-developed search technology, users can now use both free text and subject headings from the Times Index to search the 11,298,320 Times articles published across 46,592 issues between September 18, 1851 and December 31, 1980. Unlike previous iterations of searchContinue Reading

Google News Publisher Center

Google News Blog: “If you are a news publisher, your website has probably evolved and changed over time.  Until now, when you made changes to the structure of your site, we might not have discovered them unless you told us.  And that meant they might not have shown up in Google News, which in turnContinue Reading