VIVO Sharing Data for Research Discovery Mike Conlon
- Slides: 19
VIVO: Sharing Data for Research Discovery Mike Conlon University of Florida mconlon@ufl. edu
Public, structured linked data about investigators interests, activities and accomplishments, and tools to use that data to advance science
VIVO Searchlight
Data Production
Producing Data
process. Org<-function(uri){ x<-xml. Parse(uri) u<-NULL name<xml. Value(get. Node. Set(x, "//rdfs: label")[[1]] ) subs<get. Node. Set(x, "//j. 1: has. Sub. Organization") if(length(subs)==0) list(name=name, subs=NULL) else { for(i in 1: length(subs)){ sub. uri<get. URI(xml. Attrs(subs[[i]])["resource"]) u<-c(u, process. Org(sub. uri)) } list(name=name, subs=u) } } VIVO produces human and machine readable formats Software reads RDF from VIVO and displays
Data Sharing Photograph by J. G. Park. Flicker. com Photograph by Ell Brown Flicker. com
A Web of Data – The Semantic Web Information is stored using the Resource Description Framework (RDF) as subjectpredicate-object “triples” Jane Smith Subject Dept. of Genetics professor in has affiliation with author of Predicate Book chapter College of Medicine Genetics Institute Journal article Book Object
Data The Archive Role of the • Archive Collate data, final semantics, ready for consumption
Institutions record activities, interests, accomplishments
Data, Tools and Scientists
Data Consumption Photograph by Scoop. Media. Flicker. com Photograph by Janet Tarbox. Flicker. com
A Consumption Scenario Find all faculty members whose genetic work is implicated in breast cancer VIVO will store information about faculty and associate to genes. Diseaseome associates genes to diseases. Query resolves across VIVO and data sources it links to.
Data Reasoning Data integration continues to be a serious bottleneck for the expectations of increased productivity in the pharmaceutical and biotechnology domain. “Linked Life Data” integrates common public datasets that describe the relationships between gene, protein, interaction, pathway, target, drug, disease and patient and currently consist of more than 5 billion RDF statements. The dataset interconnects more than 20 complete data sources and helps to understand the “bigger picture” of a research problem by linking previously unrelated data from heterogeneous knowledge. From the Lar. KC (Large Knowledge Collider) http: //www. larkc. eu/overview/
http: //vivo. ufl. edu/individual/mconlon
- Linda strube unt
- Jim conlon
- Komunikasi data merupakan gabungan 2 macam teknik yaitu....
- Sharing data
- Improved data sharing
- What is smart data discovery
- Data conditioning refers to
- Cardholder data discovery
- Introduction to data mining and knowledge discovery
- Knowledge data discovery
- Data discovery
- Data discovery
- Kontinuitetshantering
- Typiska novell drag
- Tack för att ni lyssnade bild
- Ekologiskt fotavtryck
- Shingelfrisyren
- En lathund för arbete med kontinuitetshantering
- Kassaregister ideell förening
- Vilotidsbok