INTERNATIONAL POLAR YEAR 2007-08 P.A. Berkman - Science 301:1669 (19 September 2003)

Slides:



Advertisements
Similar presentations
THREE1 E-Commerce. THREE 2 §Pentagons Arpanet was first attempt to use computer networks to share knowledge electronically §National Science Foundation.
Advertisements

The New online Scar Map Catalogue Australian Antarctic Data Centre Australian Antarctic Division.
SERACHING SKILLS TRAINING MAY NOT BE ENOUGH Experiences from information Competency course B. Niedźwiedzka, K. Czabanowska Information Studies Department.
Lecture # 7. Topics Storage Techniques of Bits Storage Techniques of Bits Mass Storage Mass Storage Disk System Performance Disk System Performance File.
1 University of Palestine Information technology college Electronic Document Management System Technologies Electronic Document Management System Technologies.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
© Tefko Saracevic, Rutgers University1 digital libraries and human information behavior Tefko Saracevic, Ph.D. School of Communication, Information and.
Predecessor to the Database: Traditional File Processing Records are stored in files. Programs are customized to process the data.
CPSC 695 Future of GIS Marina L. Gavrilova. The future of GIS.
Political and legal Barriers to Data Availability World Water Forum, Istanbul Andrew Allan Centre for Water Law, Policy and Science.
© Tefko Saracevic, Rutgers University1 digital libraries and human information behavior Tefko Saracevic, Ph.D. School of Communication, Information and.
Anne R. Kenney SCLD Annual Conference April 24-26, 2006 The Sum of its Parts: Consolidated Storage, Management, and Delivery Services.
1 Information Retrieval and Extraction 資訊檢索與擷取 Chia-Hui Chang, Assistant Professor Dept. of Computer Science & Information Engineering National Central.
1 CS 502: Computing Methods for Digital Libraries Lecture 27 Preservation.
© Tefko Saracevic, Rutgers University1 digital libraries and human information behavior Tefko Saracevic, Ph.D. School of Communication, Information and.
Introduction to Communication Research
Unit 17: Communication Technology1 Communication Technology What is it? Provides a transfer of knowledge to people all over the world. Provides a transfer.
Text Mining: Finding Nuggets in Mountains of Textual Data Jochen Dijrre, Peter Gerstl, Roland Seiffert Presented by Drew DeHaas.
Chapter 3 Computer Science and the Foundation of Knowledge Model
Databases & Data Warehouses Chapter 3 Database Processing.
Global Discovery: Turning Vision into Reality Presented by Abe Lederman, President and CTO Deep Web Technologies, LLC Symposium: Global Discovery on the.
Data Mining Techniques
Digital Library Architecture and Technology
Chapter 1: Business Intelligence and its Impacts
1 Peter Fox Data Science – ITEC/CSCI/ERTH Week 1, August 31, 2010 History of Data and Information, Data Science, Current Challenges.
The Fundamentals of Preserving Knowledge Assets Pacific Neighborhood Consortium 2010 Catherine Quinlan, Dean of the USC Libraries USC's Dual Approach.
Introduction to the Electronic Geophysical Year, (eGY)
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
N ew Security Approaches Biometric Technologies are Coming of Age ANIL KUMAR GUPTA & SUMIT KUMAR CHOUDHARY.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
I.T MEDIA MAISRUL www.roelsite.yolasite.com
Satish Ramanan April 16, AGENDA Context Why - Integrate Search with BI? How - do we get there? - Tool Strategy What - is in it for me ? - Outcomes.
Semantic Learning Instructor: Professor Cercone Razieh Niazi.
Interactive Science Publishing: A Joint OSA-NLM Project Michael J. Ackerman National Library of Medicine.
Similar Document Retrieval and Analysis in Information Retrieval System based on correlation method for full text indexing.
MULTIMEDIA DATABASES -Define data -Define databases.
Science Teaching & Instructional Technology By: Asma, Melissa & Susan.
Thomson Reuters ISI (Information Sciences Institute) Azam Raoofi, Head of Indexing & Education Departments, Kowsar Editorial Meeting, Sep 19 th 2013.
MTA SZTAKI Department of Distributed Systems The problems of persistent identifiers in the context of the National Digital Data Archives of Hungary András.
Building the Mother of all Collections: the future of the National Library’s discovery services Warwick Cathro Assistant Director-General, Innovation National.
Antarctic Data Management Lee Belbin Manager, Australian Antarctic Data Centre Chairman, Joint Committee on Antarctic Data Management.
BAA - Big Mechanism using SIRA Technology Chuck Rehberg CTO at Trigent Software and Chief Scientist at Semantic Insights™
Graph Data Analytics Arka Mukherjee, Ph.D. Global IDs Resolving Complexity at an Enterprise Scale.
The European Heritage Network HEREIN
Indexing of Tables and Figures: Scientists’ Reaction Carol Tenopir University of Tennessee web.utk.edu/~tenopir/
WEB MINING. In recent years the growth of the World Wide Web exceeded all expectations. Today there are several billions of HTML documents, pictures and.
IPY Education and Outreach 21 July 2005 The eGY Opportunity: Education and Outreach Emily CoBabe-Ammann eGY_Team.
Applying the Inherent Structure of Digital Records Paul Arthur Berkman, Ph.D. Research Professor, University of California Santa Barbara Chief Executive.
National Research Council Of the National Academies
DOE Data Management Plan Requirements
EGY The Electronic Geophysical Year Generic Power Point slides for use by eGY participants in developing presentations
Storage Devices.
Ed Kearns National Climatic Data Center Asheville, NC.
By: Cathrine Moyo.   We have been applying traditional approaches to a new problem, and we have not been motivated to change the ways we do things,
Publishing and Visualizing Large-Scale Semantically-enabled Earth Science Resources on the Web Benno Lee 1 Sumit Purohit 2
Rich Environments for Active Learning R. Scott Grabinger University of Colorado at Denver Chapter 23 Section III. Soft Technologies: Instructional and.
Deep Indexing in ProQuest Health and Medical Databases.
SVBIT SUBJECT:- Operating System TOPICS:- File Management
DIGITAL INFORMATION SOURCES, RESOURCES AND E-LEARNING : SCOPE AND CHARACTERISTICS.
Open Data Discussions in Japan and DRR
Computer Note.
Research Methods for Computer Science
3.5 Databases Relationships.
® T i e r I T Dig in to your information
EDS Discovery Health & EBSCO eBooks Workflow Optimization
ONCE IN A HUNDRED GENERATIONS
TDM=Text Mining “automated processing of large amounts of structured digital textual content for purposes of information retrieval, extraction, interpretation.
Marine Mammal Commission Digital Library of International Environmental and Ecosystem.
Search and Retrieval in a Virtual World
Information Retrieval and Web Design
Presentation transcript:

INTERNATIONAL POLAR YEAR P.A. Berkman - Science 301:1669 (19 September 2003)

How can digital technologies enhance information management and knowledge discovery in our global society?

CLAY PAPYRUS PAPER DIGITAL TIME (years before present) INFORMATION TRANSPORT INFORMATION INTEGRATION INFORMATION VOLUME STONE HISTORY OF INFORMATION THRESHOLDS INFORMATION ERAS © 2005 EvREsearch LTD FUTURE

EFFECTIVELY INFINITE INFORMATION More than 20,000 petabytes of digital information is stored in various media in our world every year – and the rate is growing exponentially 240 terabytes/year 627,000 terabytes/year 427,000 terabytes/year Print Media Magnetic (Server) Media Optical (CD –DVD) Media Data Source: Film Media 83 terabytes/year 1,066,000 terabytes/year Magnetic (PC and Tape) Media

INSTANTANEOUS INFORMATION Data Source: Speed of computer microprocessors is increasing exponentially.

THE CHALLENGE OF OUR DIGITAL ERA Information Resources Relational Schema RESPOND TO THE FLOOD OF INFORMATION >80% <20%

INTEGRATION IS PROPORTIONAL TO GRANULARITY Permutations 3 15  p i =n!/p i !(n-p i )! 255 A model of the exponentially increasing volume of metadata by simply doubling the number of information granules (data), each of which is half its original size, while the volume of metadata per granule is fixed independent of granule size. CONCEPT OF INTEGRATION

OBSERVATIONS Information has content, context and structure. The notion of “unstructured” really means information that is “unmanaged” with conventional technologies (i.e., metadata, markup and databases). Databases, metadata and markup provide control for managing digital information, but they are not convenient. This is why less than 20% of the available digital records are managed with these technologies (i.e., conventional technologies are not scaleable). Search engines are extremely convenient, but provide limited control for managing digital information (i.e., long lists of ranked results conceal relationships within and between digital records). The search engine problem is that accessing more information does not equal more knowledge. We already have effectively infinite and instantaneous access to digital information. The challenge is no longer access, but being able to objectively integrate information based on user-defined criteria independent of scale to discover knowledge.

BORROMEAN RINGS Three interlinked circles that represent inseparable parts of the whole. Remove any one ring and the other two fall apart. Because of this property, Borromean Rings have been used as a symbol of unity in many fields. THE PHYSICS OF INFORMATION Information has three indivisible ingredients – content, context and structure.Information has three indivisible ingredients – content, context and structure. The ability to automatically utilize the inherent structure of information is the threshold in information management from hardcopy to digital media.The ability to automatically utilize the inherent structure of information is the threshold in information management from hardcopy to digital media. © 2005 EvREsearch LTD EvREsearch©

GRANULARITY MODULE INDEX MODULE INTEGRATION MODULE AGGREGATION MODULE DIGITAL RECORDS Digital Integration System TM – DIGIN ® Operates by first utilizing the inherent structure or patterns in digital information (text, binary data, or other symbolisms) without requiring an a priori understanding of the content (i.e., content agnostic); Automatically creates, comprehensively indexes and objectively integrates sets of information (i.e., automated granularity); Uniquely generates objective relational schema that automatically turn qualitative information into quantitative information without markup, metadata or databases; and, Provides both convenience and control to manage all types of digital information (“structured” as well as “unstructured”) independent of scale.

INTERNATIONAL APPLICATIONS After Invited Presentation at the Scientific Committee on Antarctic Research (SCAR) Meeting in Tokyo, Japan - July 2000 "I really appreciate the work you have put in the with the [Antarctic Treaty] database. I use it almost weekly. It is a brilliant resource that should be used by SCAR, COMNAP, CEP etc. In fact, all output from Treaty groups should be in such a system." Dr. Lee Belbin Convenor, Joint Committee on Antarctic Data Management Scientific Committee on Antarctic Research (SCAR) Council of Managers of National Antarctic Programs (COMNAP)

FOR FURTHER READING:

“Knowledge is the common wealth of humanity.” Adama Samassekou Convener of the United Nations World Summit on the Information Society

RECOMMENDATION eGY acknowledges that information management with digital technologies still is in its infancy in comparison to the paper, papyrus, clay and stone media that each had been used for millennia to transfer knowledge in our civilization. eGY identifies that conventional technologies with metadata, markup and databases do scale to the increasing volume of digital information that is being produced. eGY recognizes that information traditionally has been managed based on its content and context. eGY further recognizes that all information has inherent structure, which has not been possible to utilize for information management purposes with hardcopy media. eGY supports innovations that seek to utilize the inherent structure of digital information to automate information management and facilitate discovery of objective relationships within and between information resources.