. gov Toward Digital Government: The Case of Government Statistics Gary Marchionini University of North Carolina at Chapel Hill www.ils.unc.edu/govstat.

Slides:



Advertisements
Similar presentations
Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Advertisements

The SDMX Registry Model April 2, 2009 Arofan Gregory Open Data Foundation.
Geoscience Information Network Stephen M Richard Arizona Geological Survey National Geothermal Data System.
DELOS Highlights COSTANTINO THANOS ITALIAN NATIONAL RESEARCH COUNCIL.
SEVENPRO – STREP KEG seminar, Prague, 8/November/2007 © SEVENPRO Consortium SEVENPRO – Semantic Virtual Engineering Environment for Product.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Metadata for the SKN: Philosophy, Progress, and Future Directions Sheila Denn, Dan Gillman, Carol Hert, Jung Sun Oh, and Cristina Pattuelli.
Information and Business Work
1 Introduction to XML. XML eXtensible implies that users define tag content Markup implies it is a coded document Language implies it is a metalanguage.
Issues in the Transfer of Help Tools to Government Agencies: The Example of the Statistical Interactive Glossary (SIG) Stephanie W. Haas School of Information.
Open Statistics: Envisioning a Statistical Knowledge Network Ben Shneiderman Founding Director ( ), Human-Computer Interaction.
Update and Thoughts on Directions for Metadata Work Carol Hert March 17, 2003.
The Statistical Knowledge Network: Glossary and Metadata at the EIA Stephanie W. Haas & Sheila O. Denn The GovStat Project NSF.
Architecture & Data Management of XML-Based Digital Video Library System Jacky C.K. Ma Michael R. Lyu.
Columbia University Dept of Computer Science Center for Research on Info Access University of So. Calif Information Sciences Institute (ISI)
“Reverse Engineering” Statistical Metadata through User Studies Carol A. Hert Syracuse University January 23, 2003.
The GovStat Project ils.unc.edu/govstat Integration of Data and Interfaces to Enhance Human Understanding of Government Statistics: Toward the National.
Bieber et al., NJIT © Slide 1 Digital Library Integration Masters Project and Masters Thesis Summer and Fall 2005 CIS 786 / CIS Fall.
Distributed Collaborations Using Network Mobile Agents Anand Tripathi, Tanvir Ahmed, Vineet Kakani and Shremattie Jaman Department of computer science.
Metadata for the SKN: Philosophy, Progress, and Future Directions Sheila Denn, Dan Gillman, Carol Hert, Jung Sun Oh, and Cristina Pattuelli.
Knowledge Portals and Knowledge Management Tools
 MODERN DATABASE MANAGEMENT SYSTEMS OVERVIEW BY ENGINEER BILAL AHMAD
Overview of Search Engines
Tool support for Enterprise Architecture in System Architect Architecture Practitioners Conference, Brussels David Harrison Senior Consultant, Popkin.
Metadata: Its Functions in Knowledge Representation for Digital Collections 1 Summary.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
MDC Open Information Model West Virginia University CS486 Presentation Feb 18, 2000 Lijian Liu (OIM:
Aurora: A Conceptual Model for Web-content Adaptation to Support the Universal Accessibility of Web-based Services Anita W. Huang, Neel Sundaresan Presented.
Module Title? DBMS Introduction to Database Management System.
Distributed Access to Data Resources: Metadata Experiences from the NESSTAR Project Simon Musgrave Data Archive, University of Essex.
Using Taxonomies Effectively in the Organization v. 2.0 KnowledgeNets 2001 Vivian Bliss Microsoft Knowledge Network Group
Using ISO/IEC to Help with Metadata Management Problems Graeme Oakley Australian Bureau of Statistics.
1 Distributed Agents for User-Friendly Access of Digital Libraries DAFFODIL Effective Support for Using Digital Libraries Norbert Fuhr University of Duisburg-Essen,
Fusion GPS Externalization Pilot Training 3/1/2011 Lydia M. Naylor Research Lead.
10/18/2015 NORTEL NETWORKS CONFIDENTIAL – FOR TRAINING PURPOSES ONLY Global Documentation Evolution System Overview and End-to-End Process Training.
Using Taxonomies Effectively in the Organization KMWorld 2000 Mike Crandall Microsoft Information Services
February 17, 1999Open Forum on Metadata Registries 1 Census Corporate Statistical Metadata Registry By Martin V. Appel Daniel W. Gillman Samuel N. Highsmith,
FP WIKT '081 Marek Skokan, Ján Hreňo Semantic integration of governmental services in the Access-eGov project Faculty of Economics.
Metadata Architecture at StatCan MSIS 2008 Luxembourg, April 7-9, 2008 Karen Doherty Director General Informatics Branch Statistics Canada.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
System models l Abstract descriptions of systems whose requirements are being analysed.
IST Programme - Key Action III Semantic Web Technologies in IST Key Action III (Multimedia Content and Tools) Hans-Georg Stork CEC DG INFSO/D5
August 2005 TMCOps TMC Operator Requirements and Position Descriptions Phase 2 Interactive Tool Project Presentation.
OWL Representing Information Using the Web Ontology Language.
Mercury – A Service Oriented Web-based system for finding and retrieving Biogeochemical, Ecological and other land- based data National Aeronautics and.
Ch- 8. Class Diagrams Class diagrams are the most common diagram found in modeling object- oriented systems. Class diagrams are important not only for.
Eurostat SDMX and Global Standardisation Marco Pellegrino Eurostat, Statistical Office of the European Union Bangkok,
SDMX IT Tools Introduction
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
2.An overview of SDMX (What is SDMX? Part I) 1 Edward Cook Eurostat Unit B5: “Central data and metadata services” SDMX Basics course, October 2015.
Distributed Data Analysis & Dissemination System (D-DADS ) Special Interest Group on Data Integration June 2000.
Fire Emissions Network Sept. 4, 2002 A white paper for the development of a NSF Digital Government Program proposal Stefan Falke Washington University.
DANIELA KOLAROVA INSTITUTE OF INFORMATION TECHNOLOGIES, BAS Multimedia Semantics and the Semantic Web.
David Herring NOAA Climate Program Office May 28, 2013 NOAA Climate.gov A brief overview and highlights of what’s new.
Collaborative Query Previews in Digital Libraries Lin Fu, Dion Goh, Schubert Foo Division of Information Studies School of Communication and Information.
Statistical Data and Metadata Exchange SDMX Metadata Common Vocabulary Status of project and issues ( ) Marco Pellegrino Eurostat
Towards a Statistical Knowledge Network Ben Shneiderman & Catherine Plaisant University of Maryland at College Park Gary Marchionini, Stephanie Haas &
IPDA Architecture Project International Planetary Data Alliance IPDA Architecture Project Report.
1 Open Discovery Space Overview Argiris Tzikopoulos, Ellinogermaniki Agogi Open Discovery Space [CIP-ICT-PSP ][elearning] A socially-powered and.
International Planetary Data Alliance Registry Project Update September 16, 2011.
Geospatial metadata Prof. Wenwen Li School of Geographical Sciences and Urban Planning 5644 Coor Hall
IPDA Registry Definitions Project Dan Crichton Pedro Osuna Alain Sarkissian.
Semantic Web Technologies Readings discussion Research presentations Projects & Papers discussions.
Lecture #11: Ontology Engineering Dr. Bhavani Thuraisingham
Datamining : Refers to extracting or mining knowledge from large amounts of data Applications : Market Analysis Fraud Detection Customer Retention Production.
2. An overview of SDMX (What is SDMX? Part I)
2. An overview of SDMX (What is SDMX? Part I)
United Nations Statistics Division
Malte Dreyer – Matthias Razum
“What Everyone Calls It”
Presentation transcript:

. gov Toward Digital Government: The Case of Government Statistics Gary Marchionini University of North Carolina at Chapel Hill NSF Grants EIA and EIA Principal Investigators: Gary Marchionini, Stephanie Haas, Ben Shneiderman, Catherine Plaisant, and Carol Hert

. gov Digital Government: Leveraging IT Government information dissemination –Websites –Other publications (no mass ings yet) Transactions –Registrations –Census, regulatory filings –Taxes Policy making –E-voting –E-rules Our work focuses on statistical information and agencies as many important decisions by policy makers and citizens depend on statistics

. gov Preliminary Work Human needs –Interviews (agencies, public) –Transaction log analysis – content analysis System development and testing –Novel interfaces –Information architecture –Usability studies

. gov Focus on Tables Table browser –Java applet –DTD for tables (DC and DDI influence) –XML protocol –Mapping metadata elements to interface control mechanisms –Piping data from large databases to applet –User studies Metadata to aid understanding

. gov Statistical Knowledge Network Create SKN prototype with agency partners Integration –Horizontal integration across federal agencies (BLS, EIA, NCHS, Census, SSA, NASS) –Vertical integration from local/state Focus on non-specialists –Help crucial –Metadata drives help User interfaces are the intermediaries to link people and data Find what you need, understand what you find

. gov Data Flow agency data with integrated metadata agency with multiple metadata repositories agency backend data and metadata Distributed Public Intermediary: variable/concept level, XML-based incorporating ISO and DDI providing java-based statistical literacy tools to user interfaces Statistical Ontology firewall Domain Experts End User Communities Domain Ontologies I n t e r f a c e s U s e r end user end users: interact with data from information/concept perspective, not agency perspective membrane end user end user end user end user

. gov Statistical Knowledge Network Architecture Agencies SKN Registry Actions Contribute Find Display Annotate Understand Manipulate Collaborate ….. …………. Objects Actions Private Work Space Objects Actions Private Work Space Objects Actions Private Work Space OntologyRules & Constraints SKN Consortium …... gov Objects Reports metadata Tables metadata People metadata Glossary Annotations

. gov Interface Prototypes: Find, Display, Understand; Leverage Metadata, Glossary, Ontology Relation Browser Mulitlayered help: treemaps, video help Animated Glossary Contextualizer PairTrees Spatial audio for maps Missing Data

. gov Use Case Scenarios to Guide Design Based on discussions with agency partners 20 scenarios 4 detailed with in depth resources located Used to ground ongoing work

. gov Relation Browser++ displaying all webpages EIA

. gov RB++ with Cursor Over Residential Sector

. gov RB++ showing ‘hous’ typed in title field

. gov Multi-layered interfaces 1 level 3 levels of growing complexity map+table +filters map+table +filters +scatterplot map+table +filters +scatterplot

. gov Animated Demonstration Features

. gov Script Guidelines Base the script on a live demonstration (never on a written description) –Focus on tasks (not tours of widgets or conceptual overviews) –Act out the interaction (with minimum description) then describe results in context of task –Start with a tour of main screen components (orient and introduce vocabulary) 5-10 sec. max –Plan a linear sequences made of very short autonomous chunks (15-60 sec.) Map the chunks to existing online documentation Show text title at beginning of each chunk Carefully synchronize voice and visual (hard when alone) Provide duration and file size for individual chunk

. gov Interactive Glossary Development Tools Provide foundation for content development Separate content development from presentation development Reduce overall development time Maximize reuse of existing elements Create multiple presentations from a single content development effort

. gov Animation Template

. gov Content Foundation Template (SIG) Question initial motivation Answer overview, definition Process explanation, equation Example Result statistic, answer Review summary, interpretation

. gov Animation Template Consistent display and interaction for all animations Presents animation and explanatory text simultaneously Navigate (forward and back) through animation segments Complete review of text at any time

. gov Animation Template Three pieces: text, animations, template Text is tagged with content section tags in a separate text file Animation consists of segments in individual animation files Text and animation segments coordinated by placement in template

. gov ontology Semantic level Classes Relationships Constraint rules DTD/XML Schema Structural level Elements Attributes Datatypes SKN Ontology DTD / XML Schema Interface Tools Statistical Interactive Glossary (SIG) Ontology Applications  Knowledge organization  Content and terminology control  Data integration  Query support  Automatic classification support  Reasoning mechanism  Others modeling implementation

. gov unit aged unit aged unit married couples living together, with husband or wife aged 65 or older age SSA household Domain knowledge Operational knowledge estimate poverty estimate poverty benefit Census Bureau FIFARS earning salary wage income family distribution

. gov Project DTD Investigate DDI and ISO Leverage DDI and data cubes Markup a set of objects –Tables –Reports/press releases Use markup to build added value search (find what you need) and help (understand what you find) support into interfaces

. gov The Basic Structure entDscr_1: description of an entity within the marked up document docDscr : description of the markup-what is being marked-up, who marked it up, etc. entDscr_2: description of an entity within the marked up document varDscr_1: description of each variable within an entity, study group or document stdygrpDscr: describes the “group” to which an entity or document belongs such as a survey program nCubeDscr: used when entity is an aggregated table fileDscr: descripes physical file structures for nCubes varDscr_2: description of each variable within an entity, study group or document

. gov One Example of How the DTD Helps The DTD can help bring the “expert knowledge” to the less expert user and bring relevant information together by enabling searching via variables as well as subjects/keywords

. gov Median income, by age, 2001 age persons Age or older

. gov Discovering Metadata Hybrid machine learning approach –Crawl website –Create term document matrices –Use k-means clustering with small K to fit on screen in RB++ –Revise Use structure in the existing sites to train a classifier For small n of concepts, classify site

. gov Combining Machine Learning and Dynamic Interfaces What should these topics be, and how do we know if we’ve found the right names for them?

. gov Combining Machine Learning and Dynamic Interfaces How do we assign thousands of documents to their respective topics?

. gov Initial, Unstructured Approach doc

. gov Initial, Unstructured Approach doc

. gov Initial, Unstructured Approach doc This approach yielded intuitively coherent clusters. But the clusters fall at too fine a level of granularity, while also wasting large portions of the data. Clustering Based on Word Distributions

. gov New Approach, Semi-Supervised

. gov New Approach, Semi-Supervised doc

. gov New Approach, Semi-Supervised doc This approach capitalizes on the agencies’ efforts and expertise, and so far seems to yield superior results. However, the amount of training data is very sparse, and the observed categories have high correlation in some cases. Our current work addresses these tuning issues.

. gov State Statistical Office USDA / NASS State Cooperative Agency (Dept. of Agriculture,etc.) Farmers & Producers Statistical Consumers Supply data to agencies Obtain data from agencies Collection agents Vertical Integration: Agriculture

. gov Multiple Research Threads for the SKN Interfaces Metadata and Ontology Multi-leveled help Automatic slicing and dicing User needs and user testing Cross agency cooperation See