Stay up to date with the latest news from DCL.
DCL Announces Technology Breakthrough at the USPTO: Over 1 Million Pages Each Month Automatically Transformed into Searchable, Minable Content
DCL TO CO-PRESENT WITH USPTO AND CGI FEDERAL AT THE BALISAGE CONFERENCE IN AUGUST 2015
Fresh Meadows, NY, June 25, 2015 – Data Conversion Laboratory (DCL), the US Patent and Trademark Office (USPTO), and CGI Federal will team up at 4:00pm on Tuesday, August 11, 2015 to present a case study titled “Implementing a System at US Patent and Trademark Office to Fully Automate the Conversion of Filing Documents to XML” at The Balisage Conference in Bethesda, Maryland.
The co-presenters will describe an automated system designed to ingest a wide range of incoming document images delivered to the USPTO, and convert them to XML without human intervention. The co-presenters are Terrel Morris, Supervisory Program Manager for the USPTO, Amit Khare, Director of Consulting for CGI Federal, and Mark Gross, President of DCL, who bring together extensive experience with patent documents and analyzing complex information, and have worked directly on the conversion engagement for USPTO.
The USPTO is in the process of modernizing all of its patent examination tools and the data created in this project allows us to leverage business intelligence to improve the quality of our work,” explained the USPTO’s Mr. Morris. “The system, automatically delivering over 1,000,000 XML pages every month, has transformed the USPTO community since the inception of the program,” said Mark Gross, DCL’s President. While digitizing content into XML is not new, doing so in a fully automated process changes the economic dynamics, making it feasible to digitize large content collections and large information flows, like that coming into the USPTO. The process at USPTO allows more precise and faster review of information, and automates many technical aspects of examination, such as mapping claims and the presentation of analytic reports that patent examiners can use to facilitate their work. These enhancements all allow patent examiners to now focus more of their time on the real job of examining patents.
Using the USPTO experience as their guide, the presentation will also discuss how this kind of automated processing is expandable to other government agencies, and other organizations, which need to review and process large document flows. As data collections grow larger, conventional conversion techniques—as efficient as they may be—are not economically feasible. An automated system can pre-process incoming page images to improve the optical character recognition (OCR) used to convert extracted information to XML and tag it with structures to achieve the necessary accuracy and precision.
Balisage: The Markup Conference 2015 will be held from August 11-14, 2015, at the Bethesda North Marriott Hotel & Conference Center, outside of Washington, D.C. The conference is an annual event devoted to the theory and practice of descriptive markup and related technologies that structure and manage information. For more information or to register, please visit: www.balisage.net