Science & Technology Centers Program Center for Science of Information Bryn Mawr Howard MIT Princeton Purdue Stanford Texas A&M UC Berkeley UC San Diego.

Slides:



Advertisements
Similar presentations
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
Advertisements

Breakout Discussion 1 Facilitators Karen Butler-Purry - TAMU Kevin Tomsovic - UTK.
Presentation at WebEx Meeting June 15,  Context  Challenge  Anticipated Outcomes  Framework  Timeline & Guidance  Comment and Questions.
Broader Impacts: Meaningful Links between Research and Societal Benefits October 23, 2014 Martin Storksdieck I Center for Research on Lifelong STEM Learning.
Science & Technology Centers Program National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC.
Science & Technology Centers Program National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC.
CSoI Center Update– March 1, 2011 Bob Brown, Managing Director Center for Science of Information soihub.org Center Staffing We are pleased to announce.
Background Chronopolis Goals Data Grid supporting a Long-term Preservation Service Data Migration Data Migration to next generation technologies Trust.
Database Theory: Back to the Future Victor Vianu UC San Diego / INRIA.
ActionWebs : Education and Outreach ActionWebs Kickoff Meeting December 17, 2009 │Berkeley, California Education Overview Kristen Gates, Ed.D University.
Broadening Participation  Tom Brady – Potential Activities for ’07 Budget  Sally O’Conner – Cyber Tools for Undergraduate Education.
1 Cyberinfrastructure Framework for 21st Century Science & Engineering (CF21) IRNC Kick-Off Workshop July 13,
Computational Science Education Programs CASC Meeting October 4,2012 Steven I. Gordon
MS DB Proposal Scott Canaan B. Thomas Golisano College of Computing & Information Sciences.
Advanced Topics COMP163: Database Management Systems University of the Pacific December 9, 2008.
SING* and ToNC * Scientific Foundations for Internet’s Next Generation Sirin Tekinay Program Director Theoretical Foundations Communication Research National.
Distributed Intelligence, Lifelong Learning, & Innovative Media: Foundations for Graduate Education Sharon Derry, University of Wisconsin-Madison Gerhard.
Klassificering af Inf. Systemer Baseret på: Luis M. Camarinha-Matos & Hamideh Afsarmanesh: Collaborative networks: a new scientific discipline.
© 2013 Association for Computing Machinery Honeywell Introduction to the ACM Digital Library January 16, 2013 Honeywell Introduction to the ACM Digital.
The Influence of the University/College/Department Mission How your university and department’s missions influence your engineering degree requirements.
SKA-cba-ase NSF and Science of Design Avogadro Scale Engineering Center for Bits & Atoms November 18-19, 2003 Kamal Abdali Computing & Communication.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
1 Building National Cyberinfrastructure Alan Blatecky Office of Cyberinfrastructure EPSCoR Meeting May 21,
Company LOGO Broader Impacts Sherita Moses-Whitlow 07/09/09.
National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC Berkeley UC San Diego UIUC Biology Thrust.
Science & Technology Centers Program National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC.
Science & Technology Centers Program Center for Science of Information Bryn Mawr Howard MIT Princeton Purdue Stanford Texas A&M UC Berkeley UC San Diego.
Designing the Microbial Research Commons: An International Symposium Overview National Academy of Sciences Washington, DC October 8-9, 2009 Cathy H. Wu.
Data Curation Education and Biological Information Specialists DigCCurr 2007 Chapel Hill, April 20, 2007 P. Bryan Heidorn, Carole L. Palmer, Melissa H.
Partnerships and Broadening Participation Dr. Nathaniel G. Pitts Director, Office of Integrative Activities May 18, 2004 Center.
Science & Technology Centers Program Center for Science of Information National Science Foundation Science & Technology Centers Program Bryn Mawr Howard.
Ensemble Computing in the National Science Digital Library (NSDL)
Marcus Bellamy Alun Jones Session 6: Knowledge & Collaboration Networks.
COLUMBIA UNIVERSITY Department of Electrical Engineering The Fu Foundation School of Engineering and Applied Science IN THE CITY OF NEW YORK Signal and.
National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC Berkeley UC San Diego UIUC Biology Thrust.
Science & Technology Centers Program National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC.
Science & Technology Centers Program Center for Science of Information Bryn Mawr Howard MIT Princeton Purdue Stanford Texas A&M UC Berkeley UC San Diego.
Chris DeWald Science Instructional Coordinator Montana Office of Public Instruction.
Pascucci-1 Valerio Pascucci Director, CEDMAV Professor, SCI Institute & School of Computing Laboratory Fellow, PNNL Massive Data Management, Analysis,
Science & Technology Centers Program National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC.
Science & Technology Centers Program National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC.
Science & Technology Centers Program Center for Science of Information Bryn Mawr Howard MIT Princeton Purdue Stanford Texas A&M UC Berkeley UC San Diego.
Information Theory in an Industrial Research Lab Marcelo J. Weinberger Information Theory Research Group Hewlett-Packard Laboratories – Advanced Studies.
What is CDR? – A Few Examples Water Resources in a Changing Climate – Idaho Climate Change Large CD consortia — not the case that everyone works on everything.
Emerging Frontiers of Science of Information Management Objectives, Structure, and Operation.
“A Library outranks any other one thing a community can do to benefit its people.” --Andrew Carnegie.
2 nd G2G Technical Seminar – May 2013 FORECASTS ON RESEARCH IN ENGINEERING DESIGN THEORY, TOOLS AND METHODOLOGY AND ITS IMPACT ON PRODUCT DESIGN AND ENGINEERING.
NOVA Evaluation Report Presented by: Dr. Dennis Sunal.
NSF Peer Review: Panelist Perspective QEM Biology Workshop; 10/21/05 Dr. Mildred Huff Ofosu Asst. Vice President; Sponsored Programs & Research; Morgan.
National Research Council Of the National Academies
Course, Curriculum, and Laboratory Improvement (CCLI) Transforming Undergraduate Education in Science, Technology, Engineering and Mathematics PROGRAM.
1 Why is Digital Curation Important for Workforce and Economic Development? Alan Blatecky Office of Cyberinfrastructure Symposium on Digital Curation in.
2013 SCHOOL OF INFORMATION THEORY PURDUE UNIVERSITY – CENTER FOR SCIENCE OF INFORMATION.
Science & Technology Centers Program National Science Foundation Science & Technology Centers Program Bryn Mawr Howard MIT Princeton Purdue Stanford UC.
Science & Technology Centers Program Center for Science of Information Bryn Mawr Howard MIT Princeton Purdue Stanford Texas A&M UC Berkeley UC San Diego.
Science & Technology Centers Program Center for Science of Information Bryn Mawr Howard MIT Princeton Purdue Stanford Texas A&M UC Berkeley UC San Diego.
Sub-fields of computer science. Sub-fields of computer science.
Intellectual Merit & Broader Impact Statements August 2016
Center for Science of Information
Center for Science of Information
Emerging Frontiers of Science Information
CSoI Update From Barbara: Center-wide post-docs
Emerging Frontiers of Science of Information
Reflections on Lessons Learned: The Center for Innovative Learning Technologies Roy Pea Stanford University AERA 2003 Chicago, Illinois.
Intellectual Merit & Broader Impact Statements August 2018
Intellectual Merit & Broader Impact Statements August 2017
Gulf States Math Alliance 2019 Conference
BCoN Data Integration Workshop, University of Kansas, Feb 13-14, 2018
S-STEM (NSF ) NSF Scholarships for Science, Technology, Engineering, & Mathematics Information Materials 6 Welcome! This is the seventh in a series.
Intellectual Merit & Broader Impact Statements August 2019
Presentation transcript:

Science & Technology Centers Program Center for Science of Information Bryn Mawr Howard MIT Princeton Purdue Stanford Texas A&M UC Berkeley UC San Diego UIUC Center for Science of Information Purdue, October National Science Foundation/Science & Technology Centers Program

Science & Technology Centers Program 1.Science of Information 2.Center Mission & Center Team 3.Research –Structural Information –Learnable Information: Knowledge Extraction –Information Theoretic Approach to Life Sciences 2

Science & Technology Centers Program The Information Revolution started in 1948, with the publication of: A Mathematical Theory of Communication. The digital age began. Claude Shannon: Shannon information quantifies the extent to which a recipient of data can reduce its statistical uncertainty. “semantic aspects of communication are irrelevant...” Objective: Reproducing reliably data: Fundamental Limits for Storage and Communication. Applications Enabler/Driver: CD, iPod, DVD, video games, Internet, Facebook, WiFi, mobile, Google,.. Design Driver: universal data compression, voiceband modems, CDMA, multiantenna, discrete denoising, space-time codes, cryptography, distortion theory approach to Big Data. 3

Science & Technology Centers Program 4 Claude Shannon laid the foundation of information theory, demonstrating that problems of data transmission and compression can be precisely modeled formulated, and analyzed. Science of information builds on Shannon’s principles to address key challenges in understanding information that nowadays is not only communicated but also acquired, curated, organized, aggregated, managed, processed, suitably abstracted and represented, analyzed, inferred, valued, secured, and used in various scientific, engineering, and socio-economic processes..

Science & Technology Centers Program 5 Extend Information Theory to meet new challenges in biology, economics, data sciences, knowledge extraction, … Understand new aspects of information (embedded) in structure, time, space, and semantics, and dynamic information, limited resources, complexity, representation-invariant information, and cooperation & dependency. Center’s Theme for the Next Five Years: DataInformation Knowledge Framing the Foundation

Science & Technology Centers Program 1.Science of Information 2.Center Mission & Center Team 3.Research –Structural Information –Learnable Information: Knowledge Extraction 6

Science & Technology Centers Program Advance science and technology through a new quantitative understanding of the representation, communication and processing of information in biological, physical, social and engineering systems. Some Specific Center’s Goals: RESEARCH define core theoretical principles governing transfer of information, develop metrics and methods for information, apply to problems in physical and social sciences, and engineering, offer a venue for multi-disciplinary long-term collaborations, EDUCATION AND DIVERSITY explore effective ways to educate students, train the next generation of researchers, broaden participation of underrepresented groups, KNOWLEDGE TRANSFER transfer advances in research to industry & broader community. 7

Science & Technology Centers Program Bryn Mawr College: D. Kumar Howard University: C. Liu, L. Burge MIT: P. Shor (co-PI), M. Sudan Purdue University (lead): W. Szpankowski (PI) Princeton University: S. Verdu (co-PI) Stanford University: A. Goldsmith (co-PI) Texas A&M: P.R. Kumar University of California, Berkeley: Bin Yu (co-PI) University of California, San Diego: S. Subramaniam UIUC: O. Milenkovic. Bin Yu, U.C. Berkeley Sergio Verdú, Princeton Peter Shor, MIT Andrea Goldsmith, Stanford Wojciech Szpankowski, Purdue 8 R. Aguilar, M. Atallah, S. Datta, A. Grama, A. Mathur, J. Neville, D. Ramkrishna, L. Si, V. Rego, A. Qi, M. Ward, D. Xu, C. Liu, L. Burge, S. Aaronson, N. Lynch, R. Rivest, Y. Polyanskiy, W. Bialek, S. Kulkarni, C. Sims, G. Bejerano, T. Cover, A. Ozgur, T. Weissman, V. Anantharam, J. Gallant, T. Courtade, M. Mahoney, D. Tse,T.Coleman, Y. Baryshnikov, M. Raginsky, N. Santhanam.

Science & Technology Centers Program Nobel Prize (Economics): Chris Sims National Academies (NAS/NAE) – Bialek, Cover, Datta, Lynch, Kumar, Ramkrishna, Rice, Rivest, Shor, Sims, Verdu, Yu. Turing award winner -- Rivest. Shannon award winners -- Cover and Verdu. Nevanlinna Prize (outstanding contributions in Mathematical Aspects of Information Sciences) – Sudan and Shor. Richard W. Hamming Medal – Cover and Verdu. Humboldt Research Award – Szpankowski. Swartz Prize in Neuroscience – W. Bialek. 9

Science & Technology Centers Program 10 Research Thrusts: 1. Life Sciences 2. Communication 3. Knowledge Management (Data Analysis) S. Subramaniam A. Grama David Tse T. Weissman S. Kulkarni J. Neville CSoI MISSION :Advance science and technology through a new quantitative understanding of the representation, communication and processing of information in biological, physical, social and engineering systems. RESEARCH MISSION : Create a shared intellectual space, integral to the Center’s activities, providing a collaborative research environment that crosses disciplinary and institutional boundaries.

Science & Technology Centers Program 11

Science & Technology Centers Program 12 D. Kumar M. Ward B. Ladd Integrate cutting-edge, multidisciplinary research and education efforts across the center to advance the training and diversity of the work force 1.Summer Schools, Purdue, 2011&2013, Stanford, 2012, UCSD, Center Wide Fellows ( Courtade, Ma, Wang, Kamath, Javanmard) 3.Information Frontiers Curriculum & Learning HUB (from data to information to knowledge) 4.Intro to Science Information ( BMC, Howard, Purdue, GWU) 5.Student Workshop & Student Teams, NSF TUES Grant, CSoI Curriculum, Module and Courses, Faculty Training Workshop, Channels Scholars Program 10.Supplement REU and Professional Development 11.Recruitment of US Citizens 12.Collaborations w/ Minority Serving Programs ( U. Hawaii, UTEP) K. Andronicos

Science & Technology Centers Program 1.Science of Information 2.Center Mission & Center Team 3.Research –Structural Information –Learnable Information: Knowledge Extraction 13

Science & Technology Centers Program Structure: Measures are needed for quantifying information embodied in structures (e.g., information in material structures, nanostructures, biomolecules, gene regulatory networks, protein networks, social networks, financial transactions). [F. Brooks, JACM, 2003] Szpankowski, Choi, Manger : Information contained in unlabeled graphs & universal graphical compression. Grama & Subramaniam : quantifying role of noise and incomplete data, identifying conserved structures, finding orthologies in biological network reconstruction. Neville : Outlining characteristics (e.g., weak dependence) sufficient for network models to be well-defined in the limit. Yu & Qi: Finding distributions of latent structures in social networks. Szpankowski, Baryshnikov, & Duda: structure of Markov fields and optimal compression.

Science & Technology Centers Program 15

Science & Technology Centers Program 16

Science & Technology Centers Program 1.Science of Information 2.Center Mission 3.Integrated Research –Structural Information –Learnable Information: Models for Querying Systems 17

Science & Technology Centers Program 18 Learnable Information (BigData): Data driven science focuses on extracting information from data. How much information can actually be extracted from a given data repository? Information Theory of Big Data? Big data domain exhibits certain properties: Large (peta and exa scale) Noisy (high rate of false positives and negatives) Multiscale (interaction at different levels of abstractions) Dynamic (temporal and spatial changes) Heterogeneous (high variability over space and time Distributed (collected and stored at distributed locations) Elastic (flexibility to data model and clustering capabilities) Complex dependencies (structural, long term) High dimensional Ad-hoc solutions do not work at scale! ``Big data has arrived but big insights have not..’’ Financial Times, J. Hartford.

Science & Technology Centers Program 19 Recommendation systems make suggestions based on prior information: Recommendations are usually lists indexed by likelihood. on server Prior search history Processor Distributed Data Original (large) database is compressed Compressed version can be stored at several locations Queries about original database can be answered reliably from a compressed version of it. Databases may be compressed for the purpose of answering queries of the form: “Is there an entry similar to y in the database?”. Data is often processed for purposes other than reproduction of the original data: (new goal: reliably answer queries rather than reproduce data!) Courtade, Weissman, IEEE Trans. Information Theory, 2013.