Accurate, Focused Research on Law, Technology and Knowledge Discovery Since 2002

Category Archives: Search Engines

Batea – The clinical browser data mining project

November 17, 2015 – “DocGraph publicly released Batea, a browser extension that tracks clinical reference URLs visited by medical students when they study. Batea was built by DocGraph with support from the Robert Wood Johnson Foundation (RWJF). Medical students across the country are encouraged to download the Batea extension for use on their personal computers. Browsing histories will be aggregated monthly and shared with WikiProject Medicine to help direct future improvements to Wikipedia medical articles. According to a 2014 study, Wikipedia is the single leading source of medical information for patients and healthcare professionals. Wikipedia’s 25,000 medical articles receive more than 200 million views per month and its 8,000 pharmacology articles receive more than 40 million views per month….DocGraph ( is an organization that works to create, maintain, and improve open healthcare datasets.  It aims to grow the open health data movement and build a community of data scientists, journalists, and clinical enterprises who use open data to understand and help evolve the healthcare system.”

Google releases report – search quality evaluation process

The human interface – search quality evaluation guideline report by Google, November 12, 2015 – 160 pages guide – “As a Search Quality evaluator, you will work on many different types of rating projects. The General Guidelines primarily cover Page Quality (PQ) rating and Needs Met( NM) rating; however, the concepts are also important forContinue Reading

Google – European privacy requests for search removals

Google Transparency Report [snipped] – “In a May 2014 ruling, Google Spain v AEPD and Mario Costeja González, the Court of Justice of the European Union found that individuals have the right to ask search engines like Google to remove certain results about them. The court decided that search engines must assess each individual’s requestContinue Reading

Medical Subject Headings (MeSH) RDF Linked Data (beta)

“The National Library of Medicine (NLM) is now offering a beta version of the Medical Subject Headings (MeSH®) data in RDF (Resource Description Framework). RDF is a well-known standard for representing structured data on the Web. Systems that use RDF are often called Linked Data because of RDF emphasis on well-described links between resources. DuringContinue Reading

Google – Updated Search Quality Rating Guidelines

Google Webmaster Central Blog: “Developing algorithmic changes to search involves a process of experimentation. Part of that experimentation is having evaluators—people who assess the quality of Google’s search results—give us feedback on our experiments. Ratings from evaluators do not determine individual site rankings, but are used help us understand our experiments. The evaluators base theirContinue Reading

Google People Operations highlights effective teamwork

“A group of us in Google’s People Operations (what we call HR) set out to answer this question using data and rigorous analysis: What makes a Google team effective? We shared our research earlier today with the Associated Press, and we’re sharing the findings here, as well. Over two years we conducted 200+ interviews withContinue Reading

GPO launches next generation of guide to US government

“The U.S. Government Publishing Office (GPO) launched an updated and redesigned version of Ben’s Guide to the U.S. Government. The educational website is named after one of our Nation’s most influential Founding Fathers, Benjamin Franklin.  The site is full of educational content on the workings of the U.S. Government and U.S. history. The updated siteContinue Reading

Google app now understands you a little better

Google Inside Search: “…the Google app is starting to truly understand the meaning of what you’re asking. We can now break down a query to understand the semantics of each piece….so we can get at the intent behind the entire question. That lets us traverse the Knowledge Graph much more reliably to find the rightContinue Reading

Find Open Access Dissertations and Theses

“PQDT Open provides the full text of open access dissertations and theses free of charge. You can quickly and easily locate dissertations and theses relevant to your discipline, and view the complete text in PDF format. Open Access Publishing – The authors of these dissertations and theses have opted to publish as open access. OpenContinue Reading

What does the Internet of Things mean for Search?

WT VOX: “The smartphone revolutionised personal computing, creating a computer that most of us have within arm’s reach 24 hours a day. The Internet of Things promises to revolutionise computing again, by connecting and collecting data from everything we live in, drive in, eat in, sleep in and work in. In just a year orContinue Reading

Exposing the Hidden Web: An Analysis of Third-Party HTTP Requests on One Million Websites

Exposing the Hidden Web: An Analysis of Third-Party – HTTP Requests on One Million Websites. International Journal of Communication, October 2015. Timothy Libert. “This article provides a quantitative analysis of privacy compromising mechanisms on one million popular websites. Findings indicate that nearly nine in ten websites leak user data to parties of which the userContinue Reading

Twiangulate powers journalism, non-profits and PR

Twiangulate: ‘Find experts and insiders. Discover which accounts are followed by key people in a scribe swarm, gossip empire, artful sphere or judicious gaggle. Understand clout. Determine which influentials and celebs follow an account using our “reach” search. Reveal a group’s favorites. Which accounts are most followed by members of a group, for example, theContinue Reading