VIVO Sharing Data for Research Discovery Mike Conlon

  • Slides: 19
Download presentation
VIVO: Sharing Data for Research Discovery Mike Conlon University of Florida mconlon@ufl. edu

VIVO: Sharing Data for Research Discovery Mike Conlon University of Florida mconlon@ufl. edu

Public, structured linked data about investigators interests, activities and accomplishments, and tools to use

Public, structured linked data about investigators interests, activities and accomplishments, and tools to use that data to advance science

VIVO Searchlight

VIVO Searchlight

Data Production

Data Production

Producing Data

Producing Data

process. Org<-function(uri){ x<-xml. Parse(uri) u<-NULL name<xml. Value(get. Node. Set(x, "//rdfs: label")[[1]] ) subs<get. Node.

process. Org<-function(uri){ x<-xml. Parse(uri) u<-NULL name<xml. Value(get. Node. Set(x, "//rdfs: label")[[1]] ) subs<get. Node. Set(x, "//j. 1: has. Sub. Organization") if(length(subs)==0) list(name=name, subs=NULL) else { for(i in 1: length(subs)){ sub. uri<get. URI(xml. Attrs(subs[[i]])["resource"]) u<-c(u, process. Org(sub. uri)) } list(name=name, subs=u) } } VIVO produces human and machine readable formats Software reads RDF from VIVO and displays

Data Sharing Photograph by J. G. Park. Flicker. com Photograph by Ell Brown Flicker.

Data Sharing Photograph by J. G. Park. Flicker. com Photograph by Ell Brown Flicker. com

A Web of Data – The Semantic Web Information is stored using the Resource

A Web of Data – The Semantic Web Information is stored using the Resource Description Framework (RDF) as subjectpredicate-object “triples” Jane Smith Subject Dept. of Genetics professor in has affiliation with author of Predicate Book chapter College of Medicine Genetics Institute Journal article Book Object

Data The Archive Role of the • Archive Collate data, final semantics, ready for

Data The Archive Role of the • Archive Collate data, final semantics, ready for consumption

Institutions record activities, interests, accomplishments

Institutions record activities, interests, accomplishments

Data, Tools and Scientists

Data, Tools and Scientists

Data Consumption Photograph by Scoop. Media. Flicker. com Photograph by Janet Tarbox. Flicker. com

Data Consumption Photograph by Scoop. Media. Flicker. com Photograph by Janet Tarbox. Flicker. com

A Consumption Scenario Find all faculty members whose genetic work is implicated in breast

A Consumption Scenario Find all faculty members whose genetic work is implicated in breast cancer VIVO will store information about faculty and associate to genes. Diseaseome associates genes to diseases. Query resolves across VIVO and data sources it links to.

Data Reasoning Data integration continues to be a serious bottleneck for the expectations of

Data Reasoning Data integration continues to be a serious bottleneck for the expectations of increased productivity in the pharmaceutical and biotechnology domain. “Linked Life Data” integrates common public datasets that describe the relationships between gene, protein, interaction, pathway, target, drug, disease and patient and currently consist of more than 5 billion RDF statements. The dataset interconnects more than 20 complete data sources and helps to understand the “bigger picture” of a research problem by linking previously unrelated data from heterogeneous knowledge. From the Lar. KC (Large Knowledge Collider) http: //www. larkc. eu/overview/

http: //vivo. ufl. edu/individual/mconlon

http: //vivo. ufl. edu/individual/mconlon