Enhance Zone Label OCR Text/ Bitmaps Text/ Bitmaps Database Full Text Index/ Search Full Text Index/ Search Index/ Search by Word ROI Pattern Index/ Search.

Slides:



Advertisements
Similar presentations
Collections Management Software for Museums and Archives r e d i s c o v e r y s o f t w a r e. c o m O V E R V I E W P R E S E N T A T I O N.
Advertisements

Microsoft ® Office OneNote ® 2007 Training Using your Notebook to its fullest potential Kent School District presents:
Databases. A database program can be used to:  sort a file into a different order  Maintain contact with clients  search through the records for a.
Review of AI from Chapter 3. Journal May 13  What advantages and disadvantages do you see with using Expert Systems in real world applications like business,
Sunday Business Systems Using Access More Efficiently Tips and tricks to make things easy.
“The Lord has guided the development of information technology and accelerated its role in work for the dead, and will continue to do so. However, we.
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
CAPTURE SOFTWARE Please take a few moments to review the following slides. Please take a few moments to review the following slides. The filing of documents.
Solutions Summit 2014 Payee Positive Pay Roundtable John Snyder Ati Azemoun, A2iA.
The Convergence of Law and Technology The following slides will provide a brief overview of the Potomac Publishing online service. After viewing this presentation,
Computer Science Research for Family History and Genealogy David W. Embley Heath Nielson, Mike Rimer, Luke Hutchison, Ken Tubbs, Doug Kennard, Tom Finnigan.
Web- and Multimedia-based Information Systems. Assessment Presentation Programming Assignment.
Pattern Library Basics Design Patterns Research Group.
High-Level View of a Source-Centric Genealogical Model: “The Model with Four Boxes” Randy Wilson March 9, 2005.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
Aletheia Apostolos Antonacopoulos PRImA Lab, The University of Salford, United Kingdom
Copyright © 2006 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill Technology Education Copyright © 2006 by The McGraw-Hill Companies,
Automatically Identifying Record Patterns from the Extracted Data Fields of Genealogical Microfilm Kenneth Tubbs David W. Embley.
Quadtrees, Octrees and their Applications in Digital Image Processing
LYU 0102 : XML for Interoperable Digital Video Library Recent years, rapid increase in the usage of multimedia information, Recent years, rapid increase.
Toward Automatic Processing and Indexing of Microfilm.
Project IST_1999_ ARTISTE – An Integrated Art Analysis and Navigation Environment Review Meeting N.1: Paris, C2RMF, November 28, 2000 Workpackage.
Attribute databases. GIS Definition Diagram Output Query Results.
Through Innovative use of Technology and Process Enhancements.
Classification with Hyperplanes Defines a boundary between various points of data which represent examples plotted in multidimensional space according.
Chapter 11 Databases.
Copyright © 2003 by Prentice Hall Computers: Tools for an Information Age Chapter 13 Database Management Systems: Getting Data Together.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
Fuse ™ digital visualizer See the world… here and now.
MSS Technologies and the AIIM Grand Canyon Chapter present: Electronic Document Management System Needs Analysis.
Multimedia Databases (MMDB)
Document Retention System. MARCH 2006 Confidential 2 General Architecture Scan and Search Search only Scan and Search Search only Scan Search Store Secured.
LIS 506 (Fall 2006) LIS 506 Information Technology Week 11: Digital Libraries & Institutional Repositories.
Breakouts. Penguins: Skunks: Cacti: Beetles: Classroom A - Suzanne Classroom C - Chris Lecture Hall 2 - Connie Ward Lecture Hall - Marie (Theme: Content.
ASMLibrary-MYP-9/18/09 MYP – Personal Project ASM Library EBSCO databases.
Report on Intrusion Detection and Data Fusion By Ganesh Godavari.
OARE Module 5A: Scopus (Elsevier). Table of Contents About Scopus (Elsevier) Using Scopus Search Page Results/Refine Search Pages Download, PDF, Export,
Proposed EOB Workflow. EOB Dilemma  Unstructured Documents, data appears in different areas  Small font affects ability for OCR software to bring back.
Data Capture Understand the concept of data encoding. Describe methods of data capture and identify appropriate contexts for their.
Institute for Visualization and Perception Research 1 © Copyright 2000 Haim Levkowitz Introduction (Foley & Van Dam Ch 1) Uses of computer graphics … Some.
Chapter 17 Creating a Database.
MIS 327 Database Management system 1 MIS 327: DBMS Dr. Monther Tarawneh Dr. Monther Tarawneh Week 2: Basic Concepts.
Database What is a database? A database is a collection of information that is typically organized so that it can easily be storing, managing and retrieving.
Ohana Software’s PAF INSIGHT Part 1 By Tina Abplanalp
Chapter 1 Data Structures and Algorithms. Primary Goals Present commonly used data structures Present commonly used data structures Introduce the idea.
March 31, 1998NSF IDM 98, Group F1 Group F Multi-modal Issues, Systems and Applications.
Digitization/Scanning Process from Crystal Infosystems & Services.
Database Management Systems (DBMS)
Data and Applications Security Developments and Directions Dr. Bhavani Thuraisingham The University of Texas at Dallas Lecture #15 Secure Multimedia Data.
Introduction to Information Retrieval Example of information need in the context of the world wide web: “Find all documents containing information on computer.
Databases Letts Chapter 11. A database program can be used to:  sort a file into a different order;  search through the records for a matching string.
Database Management System. DBMS A software package that allows users to create, retrieve and modify databases. A database is a collection of related.
Lesson 13 Databases Unit 2—Using the Computer. Computer Concepts BASICS - 22 Objectives Define the purpose and function of database software. Identify.
HalFILE 2.1 halFILE Workflow. Workflow? Workflow is simply a clearly defined business process Workflow as it relates to pc’s is the attempt to automate.
Copyright (c) 2014 Pearson Education, Inc. Introduction to DBMS.
Challenges and Opportunities. Common Pedigree Research Issues Record Linkage Data Standardization Efficient Data Access Expert Finding.
Face Recognition Technology By Catherine jenni christy.M.sc.
NIMAC for Accessible Media Producers: February 2013 NIMAC 2.0 for AMPs.
Computer Vision: 3D Shape Reconstruction Use images to build 3D model of object or site 3D site model built from laser range scans collected by CMU autonomous.
The world’s easiest database – On the go! Adam Pflug.
INFO Week 7 Indexing and Searching Dr. Xia Lin Assistant Professor College of Information Science and Technology Drexel University.
Cut down on the time it takes employees to process invoices using Square 9’s SmartSearch integration with Microsoft Dynamics GP. SmartSearch allows invoice.
1 FamilySearch Catalog Update CCLA 04/21/ FamilySearch Catalog.
Digital Video Library - Jacky Ma.
Organization and Knowledge Management
School of Computer Science & Engineering
Delivering a Highly Effective ECM Demo in 10 Minutes
Introduction to Information Retrieval
Temple Ready within an Hour of Collection Capture
Presentation transcript:

Enhance Zone Label OCR Text/ Bitmaps Text/ Bitmaps Database Full Text Index/ Search Full Text Index/ Search Index/ Search by Word ROI Pattern Index/ Search by Word ROI Pattern Binarize Register Waypointing Scan Crop Intelligent Indexing Intelligent Browsing Visual Index/Search by Zone, Label, ROI Intelligent Browsing Visual Index/Search by Zone, Label, ROI

Line detection and local refinement for document zoning

Enhance Zone Label OCR Text/ Bitmaps Text/ Bitmaps Database Full Text Index/ Search Full Text Index/ Search Index/ Search by Word ROI Pattern Index/ Search by Word ROI Pattern Binarize Register Waypointing Scan Crop Intelligent Indexing Intelligent Browsing Visual Index/Search by Zone, Label, ROI Intelligent Browsing Visual Index/Search by Zone, Label, ROI

Enhance Zone Label OCR Text/ Bitmaps Text/ Bitmaps Database Full Text Index/ Search Full Text Index/ Search Index/ Search by Word ROI Pattern Index/ Search by Word ROI Pattern Binarize Register Waypointing Scan Crop Intelligent Indexing Intelligent Browsing Visual Index/Search by Zone, Label, ROI Intelligent Browsing Visual Index/Search by Zone, Label, ROI

A solved problem? Sort of … depending on resolution, clarity, noise, etc.

Definitely NOT a solved problem Cursive, off-line handwriting recognition is an area of active research

Enhance Zone Label OCR Text/ Bitmaps Text/ Bitmaps Database Full Text Index/ Search Full Text Index/ Search Index/ Search by Word ROI Pattern Index/ Search by Word ROI Pattern Binarize Register Waypointing Scan Crop Intelligent Indexing Intelligent Browsing Visual Index/Search by Zone, Label, ROI Intelligent Browsing Visual Index/Search by Zone, Label, ROI

Focus on key fields: – Record type – Place – Date range – Repository – Film number OCR/Index the text * David Ouimette, Jake Gehring

To facilitate collection analysis and indexing prep...

Enhance Zone Label OCR Text/ Bitmaps Text/ Bitmaps Database Full Text Index/ Search Full Text Index/ Search Index/ Search by Word ROI Pattern Index/ Search by Word ROI Pattern Intelligent Indexing Binarize Register Waypointing Scan Crop Intelligent Browsing Visual Index/Search by Zone, Label, ROI Intelligent Browsing Visual Index/Search by Zone, Label, ROI

Agent “looking over the shoulder” of the indexer: Name glyphs + ground truth = Training Set for recognition algorithms Agreement/disagreement with other indexer weights prior/conditional probabilities “Remembers” image patterns and characters from previously entered names to prompt indexer and refine probabilities and training set Exploits contextual constraints from surrounding data

Enhance Zone Label OCR Text/ Bitmaps Text/ Bitmaps Database Full Text Index/ Search Full Text Index/ Search Index/ Search by Word ROI Pattern Index/ Search by Word ROI Pattern Intelligent Indexing Binarize Register Waypointing Scan Crop Intelligent Browsing Visual Index/Search by Zone, Label, ROI Intelligent Browsing Visual Index/Search by Zone, Label, ROI

Use automated waypointing for light to fine-grained, hierarchical frame to field level snap-click browsing Capture and “remember” the frames and fields the user is browsing. Ask: “5 others users have interest in this name – would you like to know who they are?” “Amazon Browsing: “Users who looked at this also looked at the following frames/fields/names …” Hyperlinking of fields to source data or related information in this or other collections

Enhance Zone Label OCR Text/ Bitmaps Text/ Bitmaps Database Full Text Index/ Search Full Text Index/ Search Index/ Search by Word ROI Pattern Index/ Search by Word ROI Pattern Intelligent Indexing Binarize Register Waypointing Scan Crop Intelligent Browsing Visual Index/Search by Zone, Label, ROI Intelligent Browsing Visual Index/Search by Zone, Label, ROI

<header: Name fields Place fields Date fields Regions of Interest: Partial Search: Words, Bitmaps, Regions of Interests Logical Groupings

Enhance Zone Label OCR Text/ Bitmaps Text/ Bitmaps Database Full Text Index/ Search Full Text Index/ Search Index/ Search by Word ROI Pattern Index/ Search by Word ROI Pattern Intelligent Indexing Binarize Register Waypointing Scan Crop Intelligent Browsing Visual Index/Search by Zone, Label, ROI Intelligent Browsing Visual Index/Search by Zone, Label, ROI

Longitudinal Search and Visualization Time Space

If hindsight is 20-20, what about foresight?

User IDs IDs User IDs IDs Passwords Interfaces Passwords Interfaces Security, Software, Data Models, Sharing, Security, Software, Data Models, Sharing, Data Complexity Collaboration There is a great need to simplify and unify technologies, algorithms and data Complexity Simplify

The busy mother of 3 The busy mother of 3 A turning-the-heart experience A turning-the-heart experience Rewarding: don’t go away empty-handed Rewarding: don’t go away empty-handed Pac-Man genealogist Pac-Man genealogist The “20-minute genealogist” The “20-minute genealogist”

Common Data Model Common Data Model Record Linkage and Merging Record Linkage and Merging Standardized collaboration model Standardized collaboration model Closer Human + Computer coupling Closer Human + Computer coupling Genealogy and Source Data Genealogy and Source Data Of all of these, perhaps the most unifying concept of all is that this is the Lord's work to link and unite families in His temples.