Public Meeting On Data Dissemination Request for Information Office of the Chief Information Officer September 24, 2009
10/12/ September 24, 2009 n Introduction n Overview Data Dissemination Objectives Current Limitations n Key Objectives of RFI n Questions Agenda
10/12/ September 24, 2009 Federal Data Dissemination Objectives n Transparent Promote accountability Provide information for citizens on what their government is doing n Participatory Agencies encouraged to provide citizens opportunities to participate in policy making Agencies encouraged to solicit ideas from citizens how to improve those opportunities n Collaborative The public builds innovative tools to enable collaboration across and at all levels of government and private industry
10/12/ September 24, 2009 Current USPTO Data Delivery Options n USPTO has over 150 distinct data products comprising Bulk data sets Interactive optical disc products Statistical reports Details of these data sets are in Attachment 1 – Current Data Sets n Free access to raw data sets through Data.gov Patent Bibliographic Data for Grants and Applications Trademark Application Text and Images Trademark Trial and Appeal Board data Trademark Assignment data
10/12/ September 24, 2009 Current USPTO Data Delivery Options, continued n Online search systems designed for Single Queries, including: AppFT/AIW and PatFT/PIW – Patent Grants and Applications Public/Private PAIR – Patent file wrapper and status information PSIPS – Patent Sequencing Data AOTW – Patent and Trademark Assignments TTAB - Trademark Trial and Appeal Board TARR - Trademark Applications and Registrations TDR - Trademark File Wrapper TESS - Trademark Electronic Search System (TESS) n These systems were not designed for large numbers of hits, bulk queries and deliveries, and are directly linked to examiner’s systems.
10/12/ September 24, 2009 Key Requests from IP Community n More Data – All of it! n Unrestricted Access – as much as I can consume n Flexible Delivery Options – Bulk n Consistent Formatting – Useful data format n Public Access to Data at no cost n Fairly Distributed
10/12/ September 24, 2009 Data Dissemination Limitations n Hardware & network – Industry standard average lifespan is 5-7 years Almost all the network switches were purchased in % of our servers are 7 years or more Beyond the 7 year lifespan – no support for problems, no replacement parts, & growing failures Incredible growth in the number of Examiners has maxed out our systems n Software Legacy Software running on aging and outdated technologies Unmanaged growth and change Poor documentation of configurations and interconnectivity Deficient coding standards n Data structure – integrated public and private data, making preliminary extract necessary n Data volume – approx. 10,000 patent grants and published applications each week PLUS metadata There is a plan to fix this over the next six years – USPTO wants to accelerate Data Dissemination in the Public interest
10/12/ September 24, 2009 Data Dissemination Limitations Data not yet available in bulk n PALM Meta Data of Published Applications or Patents Transaction History Patent Term Adjustment Foreign Priority Attorney Address Foreign Priority Continuity Data Publication Info n IFW Meta Data associated with the Documents n IFW Image Data n All Fee Information for Publish Applications n Trademark retrospective file of cropped images/logos
10/12/ September 24, 2009 Overview High Level Parts n Data extraction (at the USPTO) Data scattered through many UPSTO systems and data stores Private and Public data mixed Systems tied to examiner production (must separate) Have to honor Federal IT security standards n Data Processing/Packaging (at the USPTO) Need a smaller foot print of data than there is today Image and text in the same package Various formats today; USPTO needs to embrace open standard formats (e.g., international standards, XML. PDF, & OpenDoc) n Data Hosting (Anywhere) Outside of USPTO Equal & Fair (Public data distributed quickly to all before enhanced data) Bulk data (‘Checksummed’ as-is) for free
10/12/ September 24, 2009 Key Objectives of RFI n Market Research n No Cost Contract to the Federal Government (USPTO) n Fund development of new infrastructure n Work with USPTO OCIO – OCIO will own the system, including all data rights and associated code n Fund ongoing operations; access to new data n Data format recommendations n Redistribute the data as-is from USPTO to the public for free n Add value to the data and sell it, provide access tools, etc.
10/12/ September 24, 2009 RFI Responses n Written responses only n 20 page limit n Electronic submission only n Send with responses to this box by 2:00 pm, EST on October 15, 2009: n RFI modification to be posted on FBO with Q&A
10/12/ September 24, 2009 Question and Answer Document All of the questions provided to USPTO prior to today’s meeting are provided in the handout along with the agency’s responses.
10/12/ September 24, 2009 Q&A Questions?