Web Science and Web Archive L3S Wolfgang Nejdl L3S Research Center Hannover, Germany.

Slides:



Advertisements
Similar presentations
WDL Technical Architecture Working Group (TAWG) June 2010 Achievements and Recommendations Co-chaired by Noha Adly, Bibliotheca Alexandrina Babak Hamidzadeh,
Advertisements

GMD German National Research Center for Information Technology Darmstadt University of Technology Perspectives and Priorities for Digital Libraries Research.
Social Sciences Collections & Research: a new content-based team Gillian Ridgley, Ian Cooke, Jerry Jenkins.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Libraries for Future Generations Martha Anderson Director National Digital Information Infrastructure and Preservation Program The Library of Congress.
Panel: What Changes With Digital? Web Archiving ARL Forum 2009 Tracy Seneca – California Digital Library.
1 What is the Internet Archive We are a Digital Library Mission Statement: Universal access to human knowledge Founded in 1996 by Brewster Kahle in San.
Digital Preservation Sustainability on the EU Policy Level Elevator Pitches.
Cultural Content and Digital Heritage Bernard Smith European Commission INFSO/D2.
Sandra McIntyre Program Director. OVERVIEW Analysis.
PoliWeb project (PEPS'14) Geraldine Castel CEMRA, Université Stendhal, France Genoveva Vargas-Solar CNRS, LIG-LAFMIA, France Towards a cloud infrastructure.
The National Digital Stewardship Alliance: Community, Content, Commitment.
History of English Language Assessment Archives in context and as context Database structure ISAAR (CPF) Online Archival Sustainability.
Ethics and Information in the Digital Age Rafael Capurro University of Applied Sciences, Germany LIDA 2001, Dubrovnik, Croatia, May, 2001.
Collect/connect The future of library collections and collection management Libraries Australia Adelaide, 27 October 2011 Caroline Brazier, Director of.
The Subject Librarian's Role in Building Digital Collections: Where Information Management and Subject Expertise Meet Ruth Vondracek Oregon State University.
What is the Internet? The Internet is a computer network connecting millions of computers all over the world It has no central control - works through.
digital libraries internationally projects, applications, research in many countries © Tefko Saracevic Rutgers University
Araba Dawson-Andoh 122 A Alden Library
AERO Meeting | September 24, 2009 EthicShare: Building an Inter-Institutional Scholarly Research Community Kate McCready Cecily Marcus.
WHAT DOES THE ACADEMIC LIBRARY COMMUNITY WANT FROM CROSSREF? James G. Neal CrossRef Annual Member Meeting 25 September 2002.
1 Advanced Archive-It Application Training: Archiving Social Networking and Social Media Sites.
Web Archives, IDEAL, and PBL Overview Edward A. Fox Digital Library Research Laboratory Dept. of Computer Science Virginia Tech Blacksburg, VA, USA 21.
CS598CXZ Course Summary ChengXiang Zhai Department of Computer Science University of Illinois, Urbana-Champaign.
ALEXANDRIA Temporal Retrieval, Exploration and Analytics in Web Archives Wolfgang Nejdl L3S Research Center Hannover, Germany.
How to Face the Challenges of Web Archiving? The experiences of a small library on the edge. Chloe Martin, Internet Memory Catherine Ryan, National Library.
Web Capture team Office of strategic initiatives February 27, 2006 Selecting Content from the Web: Challenges and Experiences of the Library of Congress.
Building Scalable Web Archives Florent Carpentier, Leïla Medjkoune Internet Memory Foundation IIPC GA, Paris, May 2014.
The Jared Dunn, Will Kent, and Martin Wolske Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign, USA A New.
Technology in Action Alan Evans Kendall Martin Mary Anne Poatsy Twelfth Edition.
Science Research: Journey to 10,000 Sources Presented by: Abe Lederman, President and Founder Deep Web Technologies, Inc. Special Libraries Association.
STIM Sloan-Stanford Network for the History of Technology.
The role of Parthenos for CLARIN ERIC Steven Krauwer CLARIN ERIC Executive Director 1.
CODATA Data Archiving Activities CODATA Data Archiving Activities Bill Anderson Co-chair CODATA Data Preservation Task Group ERPANET/CODATA International.
The Global Economic Crisis and its Impact on Higher Education: Challenges and Opportunities Washington DC, April 16, 2009 Sabine U. O’Hara Executive Director.
The New Digital World and the Transformation of Information and Libraries Patricia L. Thibodeau Associate Dean Library Services & Archives Oct. 26, 2011.
 Universities are fundamentally about two things: education and research.  You need to understand the process of academic research to succeed in Higher.
EUscreen: Examining An Aggregator ’ s Role in Digital Preservation Samantha Losben Digital Preservation - Final Project December 15, 2010.
Teachers’ and Advisors’ Conference 30 April 2015 The Leeds Curriculum - a voyage of discovery Karen Llewellyn and Caroline Campbell.
Internet Skills The World Wide Web (Web) consists of billions of interconnected pages of information from a wide variety of sources. In this section: Web.
The Knowledge Exchange Presentation to CNI April 2005 Bas Cordewener, SURF Sigrun Eckelmann, DFG Norman Wiseman, JISC.
The Information Challenge Exponential growth of resources New researchers with new needs Multiple communication options New expectations and opportunities.
Information Sources and Classification. Where does Information Come From?                  
Existing web archives and scholarly uses of web archives Jane Winters (Institute of Historical Research, University of London) RESAW seminar, Aarhus University,
The Library of Congress Martha Anderson Program Officer, NDIIPP Office of Strategic Initiatives Library of Congress April 2005 LC Perspective : Preservation.
Digital Archives – Preservation, dissemination, promotion and fruition of the Common Archival Heritage Silvestre Lacerda Deputy Director of the Directorate.
10/07/2008 Semantic Web Technologies & Higher Education.
ANIE IE Research Workshop Objectives towards a Curriculum Development University of Pretoria July 4-5, 2011 Rafael Capurro International Center for Information.
Building Knowledge Societies Abdul Waheed Khan Assistant Director-General for Communication and Information Durban ::: 19 August 2007 E-Learning: Universities.
World Wide Web Library 150 Week 8. The Web The World Wide Web is one part of the Internet. No one controls the web Diverse kinds of services accessed.
CyberInfrastructure for Network Analysis Importance of, contributions by network analysis Transformation of NA Support needed for NA.
ARD Prasad Indian Statistical Institute, Bangalore.
Building an Infrastructure for Digital Humanities: Issues and Considerations Peter Zhou 周欣平 University of California, Berkeley October 8, 2009.
Metadata Extraction & Web Archives: Automating the Record Creation Process Abbie Grotke / Gina Jones /
 Law School graduates and professional practice.  Canadian Law Societies and their influence on the Law School curriculum.  Composition of law school.
S YMPOSIUM ON B IAS AND D IVERSITY IN IR (aka L IVING K NOWLEDGE S UMMER S CHOOL ) LivingKnowledge Consortium ESSIR Summer School 2011 August 31, 2011,
Search and Access Technologies for Large Scale Web Archives Joseph JaJa, Sangchul Song, and Mike Smorul Institute for Advanced Computer Studies Department.
The Boston TV News Digital Library: Partners WGBH Media Library and Archives (WGBH) Northeast Historic Film (NHF) Boston Public Library (BPL)
‘ “A frontal attack on professionalism, standards and scholarship”?. Democratising archives and the production of knowledge Dr Andrew Flinn, Department.
Collaboration & Learning GlobalSchoolNet ePals IEARN / World Youth News Teacher's Guide to International Collaboration on the Internet Center for Innovation.
Preliminary Findings Baseline Assessment of Scientists’ Data Sharing Practices Carol Tenopir, University of Tennessee
SSE3 Knowledge mangement concepts 1. Agenda What is knowledge management Classification of knowledge Knowledge management process Common/shared information.
The National Digital Stewardship Alliance: Stewardship, Collaboration, Inclusiveness, Exchange.
Strategies for archiving the Danish web space Bjarne Andersen Head of Digital Resources State and University Library, Aarhus
Perspectives on Information Course Introduction January 25, 2016.
Library Web Portals: Reinventing Libraries for the Future
Objectives, activities, and results of the database Lituanistika
Web archives as a research subject
Presentation transcript:

Web Science and Web Archive L3S Wolfgang Nejdl L3S Research Center Hannover, Germany

Hannover

Computer Science and interdisciplinary research on all aspects of the Web Internet: Communication and Networks Information: Accessing information and knowledge on and through the Web Community: Supporting communities and groups on the Web, for research, education, production and entertainment Society: Requirements (technological, social, legal) for the Web Selected projects Web L3S LivingKnowledge: Diversity, opinion and bias on the Web CUbRIK: Searching by computers and humans Glocal: Event-based Search for Networked Media Privacy and clinical research Arcomem: Social Web & Archiving ForgetIT: Concise Preservation via Managed Forgetting

Spam Attack on Copts Gun running from Sudan Are we loosing the past of the web?

Library of Congress In April 2010 LoC and Twitter signed an agreement to archive all tweets since 2006 January 2013: It is clear that technology to allow for scholarship access to large data sets is lagging behind technology for creating and distributing such data. The Library is now pursuing partnerships to allow some limited access capability in reading rooms. German National Library Based on a law of June 22, 2006, the GNL should collect, enrich, catalog and archive Web publications Internet Archive Archiving the Web (3 Petabyte) since 1996 Access possible through the URL National Archives in Denmark, Portugal, etc. Relevant L3S Web Archiving: LiWA, ARCOMEM, ForgetIT Web Search: PHAROS, CUBRIK Web Analysis: EUMSSI ERC Advanced Grant: ALEXANDRIA (2014 – 2018, 2.5 Mill. Euro) Cooperations German National Library, British Library, Internet Archive, Rutgers University, et al

ERC Grant ALEXANDRIA: Temporal Information Retrieval, Exploration and Analytics in Web Archives

ALEXANDRIA Test Beds Temporal Wikipedia English, German, Italian Wikipedia with all revisions Links to news archives (NYTimes, Times, Zeit) and web content Entity extraction and evolution, time and entity aware retrieval Academic Web Archive Academic content in Germany and UK BibSonomy and FreeSearch/DBLP data Time-aware entity extraction and linking, collaborative exploration and analytics Politics on the Web Political web sites: German and UK Web content (together with the British Library, German National Library and Internet Archive), Stanford US collections, new crawls, blogs, social media Social stream aggregation, collaborative analytics, as well as the other research questions

Web Observatory and eHumanities Multidisciplinary Research Questions: How to decide which Web content to capture, in order to enable relevant analysis by the eHumanities? How to document the selection and collection process? How can combining distributed Web Observatories help to cover multiple perspectives, disciplines and tasks (for selection)? How does the Web influence collective and individual remembering and language? How to systematically capture Web evolution and the evolution of observed processes and social realities? What are relevant multidisciplinary methods for a comprehensive analysis of Web content and the (changing) social realities reflected by it? How to deal with legal, commercial and privacy aspects of Web Archiving? Collective remembering & collective memory in the Web Age „Web Memory / Archive“ Web as reflection of social processes and practices, language, culture „Web (Archive) as Memory“ Web Observatory with focus on eHumanities „Web Gedächtnis“