CS 431 The Semester in Elevator Speak Carl Lagoze – Cornell University May 5, 2004.

Slides:



Advertisements
Similar presentations
Fast Data at Massive Scale Lessons Learned at Facebook Bobby Johnson.
Advertisements

Data Mining and the Web Susan Dumais Microsoft Research KDD97 Panel - Aug 17, 1997.
A study of teachers and researchers practices with digital documents - grey or not Céline Bourasseau Cédric Dumas Ecole des Mines de Nantes Nantes.
GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
C Introduction to the Geostat project Session on User needs (Geostat workshop in Bled 1-3 october 2008) Lars H. Backer
ICT 2010: "Global Information Structures for Science & Cultural heritage: The Interoperability Challenge" Networking Session Coordination Action on Digital.
Background Chronopolis Goals Data Grid supporting a Long-term Preservation Service Data Migration Data Migration to next generation technologies Trust.
Symposium on Digital Curation in the Era of Big Data: Career Opportunities and Educational Requirements: A Data Scientist Perspective Dr. Vicki Lynn Ferrini.
CNI 2003/Herlocker, Jung, and Webster1 Collaborative Filtering: Possibilities for Digital Libraries Jon Herlocker Janet Webster Seikyung Jung Oregon State.
Supporting Further and Higher Education Building the UK National Information Environment - Lessons from the Past and Pointers To the Future Norman Wiseman.
The current state of Metadata - as far as we understand it - Peter Wittenburg The Language Archive - Max Planck Institute CLARIN Research Infrastructure.
Object Re-Use and Exchange Mellon Retreat, Nassau Inn, Princeton, NJ, March Herbert Van de Sompel, Carl Lagoze The OAI Object Re-Use & Exchange.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
1 Knowledge, Action and Systems Some emerging foundational issues in Computing … Can Information Studies Help? Eric Yu Faculty of Information Studies University.
Third-generation information architecture November 4, 2008.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
CS 431 Architecture of Web Information Systems Spring 2004.
Evolution of NBII Search-Based Technologies Oct 24, 2002 Donna Roy USGS Center for Biological Informatics.
Integration and Insight Aren’t Simple Enough Laura Haas IBM Distinguished Engineer Director, Computer Science Almaden Research Center.
SING* and ToNC * Scientific Foundations for Internet’s Next Generation Sirin Tekinay Program Director Theoretical Foundations Communication Research National.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
January 2013 CDMI: An Introduction. Big Data Complexity Volume Speed “Big Data” refers to datasets whose size is beyond the ability of typical tools to.
Catherine C. Marshall Akshay Kulkarni.  Explores practices associated with ◦ Collaborative Authoring ◦ Reference Use ◦ Informal Creation of Personal.
Proposition: Digital Collections Are Easier to Find and Use through DLF Aquifer’s American Social History Online Katherine Kott, Aquifer Director Library.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
November 2003 Presented to “Commercializing RDF” Semantic Software Solutions for Enterprise Web Management International World Wide Web Conference 2004.
The role of Parthenos for CLARIN ERIC Steven Krauwer CLARIN ERIC Executive Director 1.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Future role of DMR in Cyber Infrastructure D. Ceperley NCSA, University of Illinois Urbana-Champaign N.B. All views expressed are my own.
Information Retrieval in Libraries: Silos and (Tentative) Solutions Daniel Hickey 15 Sept 2010.
Web Scale Discovery Service Vs Federated Search NIKESH NARAYANAN
Characterizing the Web CSCI 572: Information Retrieval and Search Engines Summer 2011.
Fedora Content Models for the National Science Digital Library Data Repository Fedora User’s Group Meeting Copenhagen, September 28, 2005 Carl Lagoze Cornell.
Mehdi Ghayoumi Kent State University Computer Science Department Summer 2015 Exposition on Cyber Infrastructure and Big Data.
The DiVA System: Current Status and Ongoing Development Uwe Klosa Electronic Publishing Centre, Uppsala University, Sweden Eva Müller.
XML (with a bias towards query language issues) A boring research topic? A new frontier? A means to keep standards people busy? Prepared by S. Abiteboul.
Information Management LIS /1/99 Martha Richardson.
Connecting different ethnomusicological archives with ethnoArc Maurice Mengel Music Archive of the Ethnological Museum, National Museum in Berlin (EMEM)
Scientific Data and Electronic Publishing Renze Brandsma, Head, Digital Production Centre University of Amsterdam Maarten Hoogerwerf, Project Manager,
HIGHER ED ACTION AGENDA ON COPYRIGHT be knowledgeable resources for their communities, sources of accurate and current information about copyright aggressively.
ORGANIZATIONS AT THE MARGINS: PROSPECTS AND NEW DIRECTIONS Deanna B. Marcum July 20, 2002.
CS315-Web Search & Data Mining. A Semester in 50 minutes or less The Web History Key technologies and developments Its future Information Retrieval (IR)
Big Heads July 10, 2009 Next Generation Technical Services Rethinking Library Technical Services for the University of California.
The Library of the Future: Embedded in E-Science Presentation to conference “Women in Science” Alexandria, October 23-24, 2007 Carol A. Mandel Dean, Division.
Cyberinfrastructure What is it? Russ Hobby Internet2 Joint Techs, 18 July 2007.
OWL Representing Information Using the Web Ontology Language.
Co-funded by the European Union Semantic CMS Community Content and Knowledge Management From free text input to automatic entity enrichment Copyright IKS.
Introduction to the Semantic Web and Linked Data
Breakout # 1 – Data Collecting and Making It Available Data definition “ Any information that [environmental] researchers need to accomplish their tasks”
Strategies for subject navigation of linked Web sites using RDF topic maps Carol Jean Godby Devon Smith OCLC Online Computer Library Center Knowledge Technologies.
Scalable Hybrid Keyword Search on Distributed Database Jungkee Kim Florida State University Community Grids Laboratory, Indiana University Workshop on.
1 Not So Strange Bedfellows: Information Standards For Librarians AND Publishers November 6, 2015.
E-resource with no wall and no firewalls Dr.H.S.Siddamallaiah Principal Library and Information officer (Rtd) NIMHANS, Bangalore.
PLANETS - DP sustainability 1 PLANETS Workshop on Sustainable Models for Digital Preservation Nov Brussels Carlos Oliveira (Deputy Head of Unit)
Cyberinfrastructure: Many Things to Many People Russ Hobby Program Manager Internet2.
The Multilingual Web – Where Are We? Next Generation Localisation Josef van Genabith, CNGL & NCLT, DCU.
Taming the Big Data in Computational Chemistry #euroCRIS2015 Barcelona 9-11-XI-2015 Carles Bo ICIQ (BIST) -
SAPIR Search in Audio-Visual Content using P2P Information Retrival For more information visit: Support.
Technology, Scalability, Metadata (or not) Research Challenges for Digital Libraries Carl Lagoze Cornell Information Science.
Powered by Microsoft Azure, Auctori Is the Next Generation in Multilingual, Global, Search Engine Optimized Web Content Management Systems MICROSOFT AZURE.
Axis AI Solves Challenges of Complex Data Extraction and Document Classification through Advanced Natural Language Processing and Machine Learning MICROSOFT.
1 Abdul Waheed Khan Communication and Information Sector UNESCO Building Knowledge Societies.
1 CS 430: Information Discovery Lecture 26 Architecture of Information Retrieval Systems 1.
”Smart Containers” Charles F. Vardeman II, Da Huo, Michelle Cheatham, James Sweet, and Jaroslaw Nabrzyski
The New Now: Institutional Repositories and Academia Institutional Repository USM April 17, 2015 Marilyn Billings Scholarly Communication Librarian.
Agenda’s for Preservation Research Micah Altman MIT Libraries Prepared for SAA Research Forum Atlanta August 2016.
MINING DEEP KNOWLEDGE FROM SCIENTIFIC NETWORKS
SCALABLE OPEN ACCESS Hussein Suleman
Australian and New Zealand Metadata Working Group
Presentation transcript:

CS 431 The Semester in Elevator Speak Carl Lagoze – Cornell University May 5, 2004

Libraries as a model Elevator Speak –Tim Berners-Lee didn’t invent information. Libraries have a centuries long tradition of information organization. We need to learn from that tradition but rethink it in the networked environment. The research –Coordination of physical and digital information –Machine learning from organized corpora –Balancing human and machine effort

Metadata Elevator Speak –Metadata is both a plague and a cure. In many cases it is necessary, but too much thinking about it relies on human input. Non-expert humans just don’t do it well The research –Automatic generation of metadata from document context –Automatic generation of metadata for non-textual resources from related text

Tools and Standards Elevator Speak –The entire XML stack provides a suite of tools and standards that enrich our ability to process semi- structured data. However, considerable work remains to make this suite as efficient and robust as established relational technology Research Areas –Bridging the gap between fully structured and unstructured data –Overcoming the complexity problem

Semantic Web Elevator Speak –Despite the almost overwhelming hype, the work coming out of the semantic web initiative provide an important foundation for modeling and manipulating distributed semi-structured information. Research Areas –Efficient storage and querying of semi-structured information –Bridging the gap between XML standards and the semantic web community

Web-Scale Information Discovery Elevator Speak –The use of link structure and document context has dramatically advanced our ability to find and rank information at a massive scale Research Areas –Customization of search results based on user profiles, role, geographic location, etc. –Incorporating the deep web –Introducing the dimension of time in web analysis

Preservation Elevator Speak –Despite years of research in preservation of digital content it remains a difficult, expensive, and unresolved problem Research Areas –Integrating information theory and preservation –Economic models of preservation

Scholarly Publishing Elevator Speak –We are in the midst of massive changes. It is not yet clear who are the losers and winners or how the technical/social/economic solutions will shake out. Research Areas –P2P and scholarly publishing –Scholarly publishing networks –Bibliometrics

Digital Rights Management Elevator Speak –Another issue, like scholarly publishing, that is on the front lines of the battle between the old (physical) and new (digital) worlds. Who “wins” has a much to do with politics and money as it does with technology Research Areas –Fair use and DRM –Web-scale DRM infrastructure –Business models for a digital society

The Big Elevator Speak As “code” infiltrates our social, political, cultural, and economic lives its not just good old computer science any more. We can work to create the most optimal algorithms and engineer the best systems. But, their effect on our lives requires an awareness of social context, human behavior, and ethics.