Accurate, Focused Research on Law, Technology and Knowledge Discovery Since 2002

Category Archives: Search Engines

Treasure Map: The NSA Breach of Telekom and Other German Firms

Spiegel OnlineAndy Müller-Maguhn, Laura Poitras, Marcel Rosenbach and Michael Sontheimer:  [Treaure Map] “is the mandate for a massive raid on the digital world. It aims to map the Internet, and not just the large traffic channels, such as telecommunications cables. It also seeks to identify the devices across which our data flows, so-called routers. Furthermore, every single end device that is connected to the Internet somewhere in the world — every smartphone, tablet and computer — is to be made visible. Such a map doesn’t just reveal one treasure. There are millions of them. The breathtaking mission is described in a Treasure Map presentation from the documents of the former intelligence service employee Edward Snowden which SPIEGEL has seen. It instructs analysts to “map the entire Internet — Any device, anywhere, all the time.” Treasure Map allows for the creation of an “interactive map of the global Internet” in “near real-time,” the document notes. Employees of the so-called “FiveEyes” intelligence agencies from Great Britain, Canada, Australia and New Zealand, which cooperate closely with the American agency NSA, can install and use the program on their own computers. One can imagine it as a kind of Google Earth for global data traffic, a bird’s eye view of the planet’s digital arteries.”

FindTheBest Launches Agency Spending Topic

Nina Quattrocchi - FindTheBest – A Research Engine: “The government awards trillions of taxpayer dollars every year in contracts, grants, and loans, and it’s nearly impossible to keep track of where that money is going. Thousands of new businesses are created in the US each year, and it’s hard for those young companies to know where toContinue Reading

Wikimedia Commons is now 10 years old – huge resource for free educational media

News release: “Wikimedia Commons is turning 10 years old [September 7, 2014] — will you help celebrate? We’re asking everyone to join the Wikimedia community by sharing a freely licensed image with world. Wikimedia Commons is one of the world’s largest resources of freely licensed educational media. It is the central repository of the majority of illustrations forContinue Reading

Links, languages and semantics – Paper

Links, languages and semantics: linked data approaches in The European Library and Europeana. Valentine Charles, Nuno Freire, Antoine Isaac. Submitted on: 8/11/2014, IFLA 2014. “The European Library and Europeana have both an extensive experience in aggregating metadata for bibliographical records or digital resources from the cultural heritage institutions of Europe. For both of them meeting the challenges offeredContinue Reading

From StreetView to the new mapping tool, Cartographer, get a glimpse of how GoogleMaps works

Google Maps Blog: “With Google Maps by your side, you have a co-pilot for everything from turn-by-turn directions, to discovering new restaurants to deciding which hiking trails to climb next. This is possible in large part because Google Maps includes information from thousands of authoritative sources as varied as the U.S. Geological Survey, the Ordnance Survey ofContinue Reading

Google – Making of Maps: Reaching a milestone

Google Official Blog: “When you head out your door, you’ve got directions in your pocket—whether you’re driving to your aunt’s place in the mountains, cycling to a new biergarten or taking the train downtown. For Google Maps to get you there, it needs to be a digital mirror of the real world. But the realContinue Reading

Investigative Report – NSA created ‘google-like search’ engine – shared access with other agencies

“Data available through ICREACH appears to be primarily derived from surveillance of foreigners’ communications, and planning documents show that it draws on a variety of different sources of data maintained by the NSA. Though one 2010 internal paper clearly calls it “the ICREACH database,” a U.S. official familiar with the system disputed that, telling The Intercept that while “itContinue Reading

Google’s fact-checking bots build vast knowledge bank – New Scientist

Hal Hodson, 20 August 2014, New Scientist - The search giant is automatically building Knowledge Vault, a massive database that could give us unprecedented access to the world’s facts “GOOGLE is building the largest store of knowledge in human history – and it’s doing so without any human help. Instead, Knowledge Vault autonomously gathers and merges information from acrossContinue Reading

Comparing Google Consumer Surveys to Existing Probability and Non-Probability Based Internet Surveys

“This study compares the responses of a probability based Internet panel, a non-probability based Internet panel and Google Consumer Surveys against several media consumption and health benchmarks. The Consumer Surveys results were found to be more accurate than both the probability and non-probability based Internet panels in three separate measures: average absolute error (distance fromContinue Reading

Google Earth expands to the Moon and Mars – Outside

Outside News from the Field: “Google couldn’t celebrate Curiosity’s second anniversary on Mars (in Earth years) with just a doodle. Instead, the California-based gods of the Internet have released two new maps to explore using the Google Earth application—on Mars and the Moon. They were assembled using images taken by various spacecraft as well as data on each body’s elevationContinue Reading

Visualizing language usage in New York Times news coverage throughout its history

Chronicle – Tracking New York Times Language Usage Over Time, Alexis Lloyd:  “News publishing is an inherently ephemeral act. A big story will consume public attention for a day, or a month or a year only to fade from memory as quickly as it erupted. But news coverage, aggregated over time, can provide a fascinating “firstContinue Reading

Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing

Research at Google – “Mesa is a highly scalable analytic data warehousing system that stores critical measurement data related to Google’s Internet advertising business. Mesa is designed to satisfy a complex and challenging set of user and systems requirements, including near real-time data ingestion and queryability, as well as high availability, reliability, fault tolerance, andContinue Reading