The Future of Research Communications and e-Scholarship

Biotea: RDFizing PubMed Central in support for the paper as an interface to the Web of Data

Authors:

Leyla Garcia
Casey McLaughlin
Alexander Garcia

Background

The World Wide Web has become a dissemination platform for scientific and non-scientific publications. However, most of the information remains locked up in discrete documents that are not always interconnected or machine-readable. The connectivity tissue provided by RDF technology has not yet been widely used to support the generation of self-describing, machine-readable documents.

Results

In this paper, we present our approach to the generation of self-describing machine-readable scholarly documents. We understand the scientific document as an entry point and interface to the Web of Data. We have semantically processed the full-text, open-access subset of PubMed Central. Our RDF model and resulting dataset make extensive use of existing ontologies and semantic enrichment services. We expose our model, services, prototype, and datasets athttp://biotea.idiginfo.org/ webcite

Conclusions

The semantic processing of biomedical literature presented in this paper embeds documents within the Web of Data and facilitates the execution of concept-based queries against the entire digital library. Our approach delivers a flexible and adaptable set of tools for metadata enrichment and semantic processing of biomedical documents. Our model delivers a semantically rich and highly interconnected dataset with self-describing content so that software can make effective use of it.

 

Journal: Journal of Biomedical Semantics 2013

Archive: https://www.force11.org/node/6361

Leave a Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Related Articles

Membership

Join the FORCE11 community and take part in our groups, conference, summer school, post on FORCE11, and attend other events.

Membership