Electronic Thesis And Dissertation Database Errors Luke Schmader Ryan Mestre Client: Zhiwu Xie CS4624 5/6/2014.

Slides:



Advertisements
Similar presentations
EPrints 3 Technical Overview EPrints 3 Briefing 8 th December 2006, London.
Advertisements

Proprietary and Confidential ProQuest Information & Learning ETD Administrator The Online Solution.
Page 1 of 12 To the ETS - Correspondence Online Training Course Welcome The Correspondence functionality is an online service that enables clients to receive.
For Details Visit : or For any Help Contact the Librarian EBSCOhost 2.0.
CC SQL Utilities.
© 2012 Entrinsik, Inc. Informer Administration Exploring the system menu and functions PRESENTER: Jason Vorenkamp| Informer Software Engineer| March 2012.
Test Automation: Coded UI Test
Sybase PowerBuilder Applications Modernization. 11 October About the Company Founded in 2002 Unites high-level information technology and organization.
JobTracker™ A Job Tracking System for Architects & Engineers Produced by LA Solutions.
A CMS for PhD Theses Oleg Burlaca, Constantin Gaindric, Svetlana Cojocaru Institute of Mathematics and Computer Science Oleg Burlaca, Constantin Gaindric,
A web application for browsing research papers By: Rhea Dookeran 09’
FALL 2011 JACKIE STAPLETON, LIAISON LIBRARIAN MARTHA LAUZON, LIBRARY ASSOCIATE RefWorks: The Basics Introduction to Refworks workshop Pre-learning assessment.
SOFTWARE PRESENTATION ODMS (OPEN SOURCE DOCUMENT MANAGEMENT SYSTEM)
3rd Annual Plex/2E Worldwide Users Conference Page based on Title Slide from Slide Layout palette. Design is cacorp Title text for Title or Divider.
Humboldt University: A workflow model for digital theses and dissertations ETD A workflow model for digital theses and dissertations Developments.
Chapter 12: ADO.NET and ASP.NET Programming with Microsoft Visual Basic.NET, Second Edition.
Reference and Instruction Automated Statistics Gathering and Reporting System Members: Patrick Chen (pyc7) Soo-Yung Cho (sc444) Gregg Herlacher (gah24)
Reference Manager Making your life easier! Updated September 2007.
FIRST COURSE Creating Web Pages with Microsoft Office 2007.
Refworks Presented by Margaret Clark, Reference Librarian FSU College of Law Library September 20, 2005.
Anthony Atkins Digital Library and Archives VirginiaTech ETD Technology for Implementers Presented March 22, 2001 at the 4th International.
CSC 318 WEB APPLICATION DEVELOPMENT.  Introduction to Server Scripting language  Client VS Server  Introduction to PHP  PHP Files and Syntax  Function.
Joel Bapaga on Web Design Strategies Technologies Commercial Value.
Session 5: Working with MySQL iNET Academy Open Source Web Development.
MySQL GUI Administration Tools Rob Donahue Manager, Distributed Systems Development May 7th, 2001 Rob Donahue Manager, Distributed Systems Development.
EDUCATION YOU CAN TRUST ® Windows SharePoint Services Course Review Review provided by: DNS Computing Services, LLC
Version 18 Upgrade: Web OPAC. Version 18 Upgrade: Web OPAC Customization 2 All of the information in this document is the property of Ex Libris Ltd. It.
OCLC Online Computer Library Center CONTENTdm ® Digital Collection Management Software Ron Gardner, OCLC Digital Services Consultant ICOLC Meeting April.
1 PHP and MySQL. 2 Topics  Querying Data with PHP  User-Driven Querying  Writing Data with PHP and MySQL PHP and MySQL.
Using Visual Basic 6.0 to Create Web-Based Database Applications
Implement Online Dissertations to Support Global Learning Environment Presented by Kathy Downes Sha Li Zhang Susan Matveyeva Information Technology Conference.
Online Autonomous Citation Management for CiteSeer CSE598B Course Project By Huajing Li.
Electronic Thesis and Dissertation Database Errors Ryan Mestre Luke Schmader Client: Zhiwu Xie Blacksburg March 3, 2014 Virginia Tech CS 4624.
HTML. Principle of Programming  Interface with PC 2 English Japanese Chinese Machine Code Compiler / Interpreter C++ Perl Assembler Machine Code.
Writing macros and programs for Voyager cataloging Kathryn Lybarger ELUNA 2013 May 3, #ELUNA2013.
Discovering Computers Fundamentals Fifth Edition Chapter 9 Database Management.
ETD Software: Toward the Future with Retrospective Hindsight Gail McMillan Digital Library and Archives, Virginia Tech ETD 2008: 10th International Symposium.
DB-based DAQ monitoring and Physics analysis tools Emiliano Barbuto European Emulsion Group (LNGS May 2003)
 Chapter 14 – Security Engineering 1 Chapter 12 Dependability and Security Specification 1.
CERN - IT Department CH-1211 Genève 23 Switzerland t DB Development Tools Benthic SQL Developer Application Express WLCG Service Reliability.
Keywords Searching and Analysis System Member Student ID Role 刘亮 Liu Liang System Analyst 顾子俊 Gu Zijun Developer 杜菡菡 Du Hanhan
About These Slides This slide set is designed to be used with the OMA sample application It is recommended you follow the steps outlined in the “Preparing.
Tutorial Contents Text Cut/Paste & Format OhioLINK ETD Home UT ETD Home ETD Entry Form UT Grad School University Libraries ETD Guide OhioLINK Electronic.
6 th Annual Focus Users’ Conference 6 th Annual Focus Users’ Conference Import Testing Data Presented by: Adrian Ruiz Presented by: Adrian Ruiz.
Client: Paul Mather Virginia Tech CS4624, Blacksburg May 1, 2014 By Nathanael Bice, Scott Brink & Adam Piorkowski.
Copyright © 2006 Pilothouse Consulting Inc. All rights reserved. Search Overview Search Features: WSS and Office Search Architecture Content Sources and.
1 EndNote X2 Your Bibliographic Management Tool 29 September 2009 Humanities and Social Sciences Resource Teams.
EndNote: The Next Steps Rebecca Starkey Reference Librarian The Joseph Regenstein Library
ETD-db: Workflow, the Short Story Edward A. Fox and Gail McMillan Virginia Tech Newcomers’ ETD 2009 University.
7 February 2008Dietrich Beck A LabVIEW Interface to ELOG ELOG Possible solution paths Status Integration into – CS ? – Domain Management System?
Chapter 5 Introduction To Form Builder. Lesson A Objectives  Display Forms Builder forms in a Web browser  Use a data block form to view, insert, update,
Syllabus Management System Matt Bernstein, Paul Capelli, Jared Segal.
Invitation to Computer Science 6 th Edition Chapter 10 The Tower of Babel.
8 th Semester, Batch 2009 Department Of Computer Science SSUET.
Electronic Theses and Dissertations: The bepress Approach Ben Hermalin Interim Dean, Haas School of Business, UC Berkeley & Co-Founder, bepress.
April 20022/CS/3XAPP 1 Database Design Anatomy of an application John Wordsworth Department of Computer Science The University of Reading
CHAPTER 7 LESSON C Creating Database Reports. Lesson C Objectives  Display image data in a report  Manually create queries and data links  Create summary.
Session 11: Cookies, Sessions ans Security iNET Academy Open Source Web Development.
Managing ETDs with Associated Complex Digital Objects Gabrielle V. Michalek Director, Scholarly Publishing, Archives and Data Services Carnegie Mellon.
Introduction to Business Information Systems by Mark Huber, Craig Piercy, Patrick McKeown, and James Norrie Tech Guide D: The Details of SQL, Data Modelling,
Digitalcommons.unl.edu Archiving Department Records.
GNU EPrints 2 Overview Christopher Gutteridge 19 th October 2002 CERN. Geneva, Switzerland.
L.A.M.P. İlker Korkmaz & Kaya Oğuz CS 350. Why cover a lecture on LAMP? ● Job Opportunities – There are many hosting companies offering LAMP as a web.
Architecture Review 10/11/2004
Fundamental of Databases
Easily retrieve data from the Baan database
Managing ETDs with Associated Complex Digital Objects
How Can I Download My Transactions Directly Into Quicken
EndNote What is EndNote? EndNote Library, how to manage?
…and web frameworks in general
Presentation transcript:

Electronic Thesis And Dissertation Database Errors Luke Schmader Ryan Mestre Client: Zhiwu Xie CS4624 5/6/2014

Intro The Electronic Thesis and Dissertations mission: –“To preserve and provide access to the research and scholarship of Virginia Tech's graduate students” You can Browse through the theses on the library’s scholar page –Each record provides the type of document, author, URN, Title, Degree, Department, Advisory Committee, Keywords, Date of Defense, Availability, Abstract, and a link to the Document(s)

The Problem The ETD database filled with many different errors Parts of many papers are incorrectly displayed All of this must be manually fixed

The Mission 1.Find all the errors within the ETD database 2.Figure out patterns associated with each type of error 3.Make deletions and corrections if possible

Initial Setup Set up a SQL server in Linux along with Apache and phpMyAdmin Would allow us to query records with errors –See how errors displayed in database

Starting the Project Used a web browser to open an XML version of the database Take note of each error and figure out the pattern within the database Fix the error programmatically

Errors Errors were found to be mostly control characters –Characters with an ASCII value of 0-31 The errors were in many different contexts within the database Spanish characters were being displayed as � due to XML encoding issue

The Solution Created a program that automatically finds and corrects all errors in ETD database Accepts error ridden XML, outputs usable and bug free XML Removes need for manual fixes of database

Design Hard code all error patterns into custom text file parser Replace Error cases with corresponding fix Output correct XML and Log File

The Expansion Client wanted a more versatile program Adjustments that make the program work for many circumstances and file types Needs… Configurability

Design Configuration file allows user to specify errors he wants to fix Toggle setting to auto fix according to Valid XML or other patterns Easy to use command line interface

Demo

Questions?