CODE (Committee on Digital Environment) July 26, 2000 Rice University THE NET OF THE 21st CENTURY: Concepts across the Interspace Bruce Schatz CANIS Laboratory Graduate School of Library and Information Science University of Illinois at Urbana-Champaign
THE THIRD WAVE OF NET EVOLUTION PACKETS OBJECTS CONCEPTS
from Objects to Concepts from Syntax to Semantics Infrastructure is Interaction with Abstraction Internet is packet transmission across computers Interspace is concept navigation across repositories CONCEPT SPACES
LEVELS OF INDEXES Technology Engineering Electrical FORMAL INFORMAL (manual) (automatic) IEEE communities groups individuals
THE DISTRIBUTED WORLD Community Repositories in the Interspace Every Person performs Every Role USERrequest LIBRARIANreference INDEXERclassify PUBLISHERquality AUTHORgenerate
CROSS-OVERS IN SEMANTIC INDEXING
COMPUTING CONCEPTS ‘92: 4,000 (molecular biology) ‘93: 40,000 (molecular biology) ‘95: 400,000 (electrical engineering) ‘96: 4,000,000 (engineering) ‘98: 40,000,000 (medicine)
SIMULATING A NEW WORLD Obtain discipline-scale collection MEDLINE from NLM, 10M bibliographic abstracts human classification: Medical Subject Headings Partition discipline into Community Repositories 4 core terms per abstract for MeSH classification 32K nodes with core terms (classification tree) Community is all abstracts classified by core term 40M abstracts containing 280M concepts concept spaces took 2 days on NCSA Origin 2000 Simulating World of Medical Communities 10K repositories with > 1K abstracts (1K w/ > 10K)
INTERSPACE NAVIGATION Semantic Indexes for Community Repositories Navigating Abstractions within Repository concept space category map Interactive browsing by Community experts
Interspace Remote Access Client
Navigation in MEDSPACE For a patient with Rheumatoid Arthritis Find a drug that reduces the pain (analgesic) but does not cause stomach (gastrointestinal) bleeding Choose Domain
Concept Search
Concept Navigation
Retrieve Document
Navigate Document
Retrieve Document
Category Map
Category Navigation
Concept Navigation
SWITCHING In the Interspace... each Community maintains its own repository Switching is navigating Across repositories use your vocabulary to search another specialty
Medicine Session
Categories and Concepts
Concept Switching
Document Retrieval
Support for Scholarship Analysis (Browsing) Synthesis (Sharing) Special (Community) Public (Library) Within (Navigating) Across (Linking)
Community Systems browse and share all the knowledge of a community data results (database management)(electronic mail) literature news (information retrieval) (bulletin boards ) knowledge (hypertext annotations) Formal Informal
Worm Community System WCS Information: Literature BIOSIS, MEDLINE, newsletters, meetings Data Genes, Maps, Sequences, strains, cells WCS Functionality Browsingsearch, navigation Filteringselection, analysis Sharinglinking, publishing WCS: 250 users at 50 labs across Internet (1991)
WCS Molecular
WCS Cellular
Rice Interspace Gather the Information Sources each (college, class, student) has repository each (department, lab, faculty) has repository Generate the Community Repositories text documents with articles and annotations specialty datatypes: databases and motifs Construct the Analysis Environment federated concept switching across sites type-dependent parsing for text/data interlinks
Rice Student Testbed Class by Class Propagation (College by College) from Special Community Repositories Similar to Illinois Digital Library Testbed UIUC College of Engineering (Rice scale) NSF DLI, , then University Library Federated Search using Document Structure Full-text Journals used in classes and labs
Rice Faculty Testbed Department by Department Propagation from Public Library Repositories Similar to UIC Medical Center Testbed pilot Surgery residents; plan Medical Center Concept Spaces for Community Repositories All Medical Literature (MEDLINE, BIOSIS) All Medical Records (Cerner narratives)
THE NET OF THE 21st CENTURY Beyond Objects to Concepts Beyond Search to Analysis Problem Solving via Cross-Correlating Multimedia Information across the Net Every community has its own special library Every community does semantic indexing true The Interspace is true Cyberspace