Accurate, Focused Research on Law, Technology and Knowledge Discovery Since 2002

Unrestricted Text and Data Mining with allofPLOS

PLOS news release: “…With more than 200,000 fully Open Access research articles available for content mining, PLOS can help advance the discussion and application of content mining through real-world experiences. Through our API we provide article text and meta-data in a single XML file format according to the Journal Article Tag Suite (JATS), the National Information Standards Organization (NISO) standard tag suite for archiving and exchanging journal article content. The new allofPLOS project is a step forward in providing researchers easier opportunities for new discovery and illumination of non-obvious connections between data, research articles and fields of study. With allofPLOS, in addition to the content of every PLOS article (excluding Figures or Supplemental Data) provided in JATS XML format, the XML parsing tools are provided. By including tags, content and parsing tools together, we hope to simplify and streamline the process for those wanting to experiment with content mining and TDM tools. With content mining, scientists, educators, policymakers and others can identify and map patterns and trends across millions of articles, extract the information they want, and gain new insights to advance research. TDM results can be shared as a new research article or as a database for others to use…Visit the PLOS Text and Data Mining page to download the PLOS research article corpus and XML parsing tools, and stay tuned to this space for upcoming stories of how researchers are using these tools. Download one of the HowOpenIsIt?®  Open Access Spectrum guides to see where various permissions for machine readability fall on the Open Access continuum.”

Sorry, comments are closed for this post.