Download presentation
Presentation is loading. Please wait.
Published byStanley Hamilton Modified over 9 years ago
1
The Web-Enabled Research Commons: Applications, Goals, and Trends Thinh Nguyen October 2009
2
Use Case #1 NeuroCommons Project: Science Commons project using Semantic Web to link massive amounts of data
4
27,266 papers 4,563 papers 41,985 papers 10,365 papers 128,437 papers
5
NeuronDB BAMS Literature Homologene SWAN Entrez Gene Gene Ontology Mammalian Phenotype PDSPki BrainPharm AlzGene Antibodies PubChem MESH Reactome Allen Brain Atlas credit: W3C HCLS
6
NeuronDB BAMS Literature Homologene SWAN Entrez Gene Gene Ontology Mammalian Phenotype PDSPki BrainPharm AlzGene Antibodies PubChem MESH Reactome Allen Brain Atlas
7
Web page links to making computers understand linkages (the WWW)
8
receptorCell membrane is located in http://ontology.foo.org/receptor directed, contextual links
9
receptorCell membrane is located in “URI” (unique names for things on the web) http://ontology.foo.org/receptorhttp://ontology.foo.org/compartmenthttp://ontology.foo.org/receptor http://ontology.foo.org/is_located_in
10
receptorCell membrane is located in channelCell membrane is located in neuronCell membrane has
11
Cell membrane “compartment” “container” “doohickey” http://ontology.foo.org/compartment using the web to integrate data and databases
13
prefix go: http://purl.org/obo/owl/GO# prefix rdfs: df-schema#> prefix owl: <http://www.wprefix owl: prefix mesh: mmons/record/mesh/> prefix sc: prefix ro: <http://www.obofoundry.oprefix ro: select ?genename ?processname wheree { graph <http://purl.org/commons/hcls/pu{ graph { ?paper ?p mesh:D017966. ?article sc:identified_by_pmid ?paper.dentified_by_pmid ?paper. ?gen ?gene sc:describes_gene_or_gene_product_mentioned_by ?article. } graph.org/commons/hcls/goa> { ?protei { ?protein rdfs:subClassOf ?res. ?res owl:onProperty ro:has_function. ?res owl:someValuesFrom ?res2. ?res2 owl:onProperty ro:realized_as. ?res2 owl:someValuesFrom ?process. graph ttp://purl.org/commons/hcls/2007 {{?process go:GO_0007166} union {?process rdfs:subClassOf go:GO_0007166 }} ?protein rdfs:subClassOf ?parent. ?parent owl:equivalentClass ?res3. ?res3 owl:hasValue ?gene.owl:hasValue ?gene. } graph <http://purl.org/ graph { ?gene rdfs:label ?genename } graph purl.org/commons/hcls/20070416> { ?process rdfs:label ?processname} } Mesh: Pyramidal Neurons Pubmed: Journal Articles Entrez Gene: Genes GO: Signal Transduction better answers through better formats:
14
reformat what we already have reformat into a commons, not a closed system get the materials into the emerging research web
15
What data sharing protocol (legal and policy) best enables use of Web technology?
16
“Licensing” Archetypes Public Domain: No restrictions on use or distribution, no contracts, copyright waived. Community Licenses: standard “open access” licenses, a range of rights, some rights reserved, available to all Private Licenses: custom agreements, varies by institution, privately negotiated, may be offered only to some
17
Goals Interoperable: data from many sources can be combined without restriction Reusable: data can be repurposed into new and interesting contexts Administrative Burden: low transaction costs and administrative costs over time Legal Certainty: users can rely on legal usability of the data Community Norms: consistent with community expectations and usages
18
Interoperability Public Domain **** –Can be combined with other data sources with ease Community Licenses *** / ** –Depends on type of license: share-alike or copyleft are unsuitable, but attribution-only licenses are less problematic Private Licenses * / ** –Depends on restrictions, but not scalable; permutations too large
19
Reusable Public Domain **** –No restrictions on subsequent use Community Licenses *** –Depends on license, but some licenses such as NC / ND can be restrictive Private Licenses ** –Depends on license, but typically restrictive
20
Administrative Burden Public Domain **** –No paperwork or legal review needed Community License *** –Little paperwork, but some legal review needed (attribution stacking issues) Private Licenses * –Large amounts of paperwork, frequent legal review needed
21
Legal Certainty Public Domain **** / *** –Clear rights; generally irrevocable; (copyright should be addressed) Community Licenses *** –Generally credible, good track record with open access and open source licenses Private Licenses ** –Must be considered individually; few private licenses tested by time
22
Community Norms Public Domain *** –Traditional method for scientific data sharing (citation) Community Licenses *** –Relatively new, but familiar to computer scientists and open source community (attribution) Private Licenses ** –tendency to emphasize private / individual interests rather than community norms
23
Overall Grade Public Domain *** –Easiest and least restrictive form of sharing Community Licenses ** –Can be used to implement community expectations, but can be burdensome / restrictive Private Licenses * –High transaction costs, burdensome, unpredictable
24
Convergence
25
CC0 Released by Creative Commons in 2009 Result of a 3-year policy exploration process Not a license but a waiver of copyright
26
Why is it needed “Borderline” copyright European sui generis database rights Varying legal standards for copyright protection in different countries
27
CC0 [deed]
28
CC0 Waiver of copyright Waiver of sui generis database rights Waiver of “neighboring rights” Does not affect trademarks or patents Only affects rights of person making assertion
29
Use Case #2 Coordination and Sustainability of International Mouse Informatics Resources (CASIMIR) (EU Project) Commentary in Letter to Nature (Sept 2009) recommends PD and use of CC0 for sharing mouse genomic data Recommendations endorsed by scientists, NIH representatives, Jackson Labs, and editors of top scientific journals
30
Use Case #3 Personal Genome Project - personalized medicine project from George Church lab Adopted CC0 to release sequence and medical data collected from volunteers
31
Summary Solving some bioinformatics problems require ability to integrate massive quantities of data from diverse sources Public Domain sharing best fits this need CC0 waiver can be used to enrich public domain and provide clarity
32
Thank You Thinh Nguyen (thinh@creativecommons.org)thinh@creativecommons.org On the Web: http://www.sciencecommons.org http://www.sciencecommons.org
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.