TANGO (RPI, June 2009) George Nagy, Mukkai Krishnamoorthy, Sharad Seth Raghav Padmanabhan, Ramana C. Jandhyala, Sean Kelley Max Muthalathu, William Silversmith.

Slides:



Advertisements
Similar presentations
WELCOME TO M.S.WORD PRESENTATION
Advertisements

Data Extraction from Web Tables: the Devil is in the Details George Nagy Electrical, Computer, and Systems Engineering DocLab, Rensselaer Polytechnic Institute.
Notes on Contemporary Table Recognition Embley, Lopresti, and Nagy  February 2006  Slide 1 Notes on Contemporary Table Recognition David W. Embley 1,
1 Excel and Regression. 2 3 On the next screen you can see I typed in the data for the schooling / income study. I even put in labels in the first row.
MEM Excel Workshop Advanced Sorting Pivot Table Introduction
Microsoft Excel Mania Brian Kovar and Stacy Kovar.
From Tessellations to Table Interpretation Ramana C. Jandhyala DocLab, RPI.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Visualizing Multiple Physician Office Locations Exercise 9 GIS in Planning and Public Health Wansoo Im, Ph.D.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
A Probabilistic Classifier for Table Visual Analysis William Silversmith TANGO Research Project NSF Grant # and Greetings Prof. Embley!
Robust estimation Problem: we want to determine the displacement (u,v) between pairs of images. We are given 100 points with a correlation score computed.
September 23, 2007NSF TANGO BYU/RPI1 TANGO Table Analysis for Generating Ontologies David W. Embley (BYU) & George Nagy (RPI) under NSF Awards
WNT TRAINING Wang Notation Tool Developed by Piyushee Jha Acknowledgments: National Science Foundation Rensselaer Polytechnic Institute Brigham Young University.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
TANGO – Table Analysis for Generating Ontologies Sean Kelley Rensselaer Polytechnic Institute 2011 Electrical Engineering.
Access 2007 ® Use Databases How can Microsoft Access 2007 help you to enter and organize information?
LIAL HORNSBY SCHNEIDER
NU Data Excel Orientation Graphing of Screening Data and Basic Graphing Functions.
Learning Table Extraction from Examples Ashwin Tengli, Yiming Yang and Nian Li Ma School of Computer Science Carnegie Mellon University Coling 04.
DISCLAIMER This guide is meant to walk you through the physical process of graphing and regression in Excel…. not to describe when and why you might want.
Macros n Macros are little programs that you can create to automate particular tasks that you may want to execute more easily than having to specify all.
Pascal Visualization Challenge Blaž Fortuna, IJS Marko Grobelnik, IJS Steve Gunn, US.
2/25: Using Microsoft Excel
1 Document Production Software Assists you with composing, editing, designing, printing, and electronically publishing documents –Word processing –Desktop.
SCIENTIFIC SOLUTIONS Journal Citation Reports ® New Features of Version 4.0.
Computational Biology, Part E Basic Principles of Computer Graphics Robert F. Murphy Copyright  1996, 1999, 2000, All rights reserved.
Examples of different formulas and their uses....
McGraw-Hill Career Education© 2008 by the McGraw-Hill Companies, Inc. All Rights Reserved. 2-1 Office PowerPoint 2007 Lab 2 Modifying and Refining a Presentation.
1 From Tessellations to Table Interpretation R. C. Jandhyala 1, M. Krishnamoorthy 1, G. Nagy 1, R. Padmanabhan 1, S. Seth 2, W. Silversmith 1 1 DocLab,
Software Development Cycle What is Software? Instructions (computer programs) that when executed provide desired function and performance Data structures.
Unit #3 Resume with a Template Questions or problems? Reminder for Discussion – do not forget to respond to 2 classmates.
Plotting in Microsoft Excel. 1) Enter your data into the Excel spreadsheet in table format. Your data should have column headers, row headers and data.
10/3: Using Microsoft Excel
Using Visual Basic for Applications in Microsoft Project Sean Vogel.
Introduction to Statistics Introduction to Statistics Correlation Chapter 15 Apr 29-May 4, 2010 Classes #28-29.
Digital Image Fundamentals Faculty of Science Silpakorn University.
Sullivan – Fundamentals of Statistics – 2 nd Edition – Chapter 7 Section 2 – Slide 1 of 32 Chapter 7 Section 2 The Standard Normal Distribution.
XP. Objectives Sort data and filter data Summarize an Excel table Insert subtotals into a range of data Outline buttons to show or hide details Create.
Chapter 5: Business Intelligence: Data Warehousing, Data Acquisition, Data Mining, Business Analytics, and Visualization DECISION SUPPORT SYSTEMS AND BUSINESS.
Formatting WorksheetsFormatting Worksheets Lesson 7.
What’s New in Microsoft Office 2007? Covering Microsoft Word, Excel, and PowerPoint.
Microsoft Excel P.6 Computer Studies Chapter 1 – Introduction of Microsoft Excel What is Microsoft Excel? Microsoft Excel is a software for.
DAY 6: MICROSOFT EXCEL – CHAPTER 3 Sravanthi Lakkimsetty September 2, 2015.
Carolyn Penstein Rosé Language Technologies Institute Human-Computer Interaction Institute School of Computer Science With funding from the National Science.
CRSD Technology Training Tony Judice. Quick Access Toolbar – can be modifiedSave as… allows you to save the file to a different location and also as an.
Graphs David Johnson. Describing the Environment How do you tell a person/robot how to get somewhere? –Tell me how to get to the student services building…
Function Of Microsoft Words Tables. Where Table section is located Table section is located on top row with File, Edit, View, Insert, Format, Tools, Window.
Matrices Digital Lesson. Copyright © by Houghton Mifflin Company, Inc. All rights reserved. 2 A matrix is a rectangular array of real numbers. Each entry.
Excel part 5 Working with Excel Tables, PivotTables, and PivotCharts.
Concepts and Realization of a Diagram Editor Generator Based on Hypergraph Transformation Author: Mark Minas Presenter: Song Gu.
Tutorial 8 Gene expression analysis 1. How to interpret an expression matrix Expression data DBs - GEO Clustering –Hierarchical clustering –K-means clustering.
Progress and Outcome Measures - Part 3 Progress and Outcome Measures Part 3, Slide 1Copyright © 2004, Jim Schwab, University of Texas at Austin.
Checking Student Work In MS Word How to mark up papers digitally by using MS Word By Thomas Redd.
Databases Computer Technology. First Record Last Record New Record Previous Record Current Record Next Record Working with Microsoft Access (Database)
1 Lesson 18 Getting Started with Excel Essentials Computer Literacy BASICS: A Comprehensive Guide to IC 3, 4 th Edition Morrison / Wells.
Main Index Contents 11 Main Index Contents Graph Categories Graph Categories Example of Digraph Example of Digraph Connectedness of Digraph Connectedness.
1 Berger Jean-Baptiste
Relationship between pixels Neighbors of a pixel – 4-neighbors (N,S,W,E pixels) == N 4 (p). A pixel p at coordinates (x,y) has four horizontal and vertical.
Database (Microsoft Access). Database A database is an organized collection of related data about a specific topic or purpose. Examples of databases include:
THURSDAY, JULY 17, 2008 STEPHANIE WYNDER Computer Technology Training.
Word Processing Middle Grades Keyboarding 7.01 Identify proper word processing formatting/editing terms and techniques.
Table Lens Paper – The Table Lens: Merging Graphical and Symbolic Representations in an Interactive Focus + Context Visualization for Tabular Information.
VBk Practical Mathematics and Microsoft Excel Course Pie charts WINCHESTER COLLEGE.
Tutorial 5: Working with Excel Tables, PivotTables, and PivotCharts
PowerPoint Create charts and tables
Microsoft Office XP Illustrated Introductory, Enhanced
Adding Tables to Slides
How do we find the best linear regression line?
Assignment resource Working with Excel Tables, PivotTables, and Pivot Charts Fairhurst pp The commands on these slides work with the Week 2 Excel.
Presentation transcript:

TANGO (RPI, June 2009) George Nagy, Mukkai Krishnamoorthy, Sharad Seth Raghav Padmanabhan, Ramana C. Jandhyala, Sean Kelley Max Muthalathu, William Silversmith

June 15,3009TANGO PROGRESS REPORT2 Completed Stuff WNT (Piyushee, MS May 2008) TAT (Raghav, MS May 2009) Pubs: ICPR08, WNT PJ & GN, Dec ICPR08, QBT, RP & GN Dec MKM09, Tessellations, RJ, RP, MK, GN, SS, WS, July 2009 GREC09, TAT results, RP, RP, MK, GN, SS, WS, July 2009

June 15,3009TANGO PROGRESS REPORT3 Software TAT (demo) EX2XY, XY2EX (Ramana) OO2XY, XY2OO (Sean, in progress) XY2LN (SS, MK) XY2WN (Bill) TAT stat analysis (RB & GN, in progress)

June 15,3009TANGO PROGRESS REPORT4 Partial grammar for X-Y trees (MK & SS) Employment Status UnemployedEmployed Education High School or Less College High School or Less College BS/BA Graduat e Degree BS/BA Graduat e Degree SXY = { c [ c c ] c [ c { c [ c c ] } c { c [ c c ] } ] Grammar G1 for parsing all layout-equivalent tessellations of this kind is: S : = A A : = { B } B : = c [ X ] B | c [ X ] X : = c X | A X | A | c

June 15,3009TANGO PROGRESS REPORT5 A’ and A’’ table formats A’ A’’ Hybrid

June 15,3009TANGO PROGRESS REPORT6 Appearance-based distance (WS?) Each table cell is described by a vector: width, type size, typeface, indent, justification, alpha/num, color, #_of_chars,… Compute differences between horizontally and vertically adjacent cells From resulting “gradient map” determine row header, column header, and delta cell regions. (Show GN’s Excel example)

June 15,3009TANGO PROGRESS REPORT7 Prediction of TAT-time Multiple regression of interaction time from: Size of table (#cols, #rows, or # cells) Number of aggregates Number of footnotes Number units Other? (GN has tried it with 20 tables – have Excel ‘GN_Data_Analysis’)

June 15,3009TANGO PROGRESS REPORT8 Table similarity May be useful to determine similar edit sequences. Tree distance between X-Y representations symmetry? Edit distance between linear P-notation for X-Y trees Metric for parse sequences?? Tree distance between Wang category forests? (new)

June 15,3009TANGO PROGRESS REPORT9 Learning ??? Retain edit sequences from TAT Make X-Y tree from each imported but not edited table Find distance of X-Y tree from new table to all previous Execute edit sequences of nearest neighbor(s) Check algorithmically if resulting X-Y tree corresponds to correct WN Check visually if table corresponding to resulting X-Y tree is equivalent to original table. If not, edit Concatenate further edit and associate with X-Y tree of new table, then add to reference set

June 15,3009TANGO PROGRESS REPORT10 Discussion Items Lists & Ordering XML format and verification Augmentations (spotting and processing) Open Office Table ontology XY tree to WN via lexical parse (checks?) Use of parse trees for XY2WN Learning? Overall TANGO evaluation for final report Critique draft slides for GREC and MKM Tools: RPI: OO, VBA, Matlab, Python, BYU: ?? Other RPI projects: PERFECT, CERVITOR, CAVIAR

June 15,3009TANGO PROGRESS REPORT11 Survival Plans NSF TANGO Final Report ! New NSF proposal (Maria) Other possible sponsors? Confs Archival Journals Collaborators Demos and dissemination Next visit