Data Conversion/Transformation Services

High-accuracy converted content down to the character level

Data Conversion: The Building Blocks for Digital Content

DCL prepares digital content for digital distribution. We transform and convert all types of information from all major word processing, typesetting, PDF, and other document formats (as well as paper) into structured formats, including XML, S1000D, DITA, proprietary schemas, and others. The most complex technical documentation typically characterized by elaborate tables, equations, cross-referencing, special characters, footnotes, and specialized imaging requirements. We support most foreign languages, including Latin-based and double-byte characters. 

DCL's proven automated conversion method produces the highest-quality consistently tagged data, while our 24/7 online document tracking system keeps clients informed throughout the entire process. Our extensive US-based project management team and multi-level QA means our clients do not expend internal resources to fix data or content after conversion.

Robust Digitization and Optical Character Recognition (OCR)

Optical character recognition (OCR) is the electronic conversion and identification of scanned images, such as those from paper documents, into searchable, digital text that is readable in a PDF format. DCL’s experts can help you determine if OCR is right for your legacy materials project.

DCL is fully equipped to handle all types of scanning projects including large-scale as well as the many types of paper-based projects that require special or white-glove handling to scan, such as old and rare materials. We are experienced with both destructive and non-destructive materials. 

DCL provides service at any level depending on the accuracy rate you require.

  • Light: OCR without clean-up (accuracy level generally of 95%)

  • Medium: OCR with errors cleanup (accuracy level is around 98-99%)

  • Full: OCR including full proofreading - character level comparison (99.995% accurate)

  • Automated: Large volume projects can be automated - system picks up and processes the file then delivers the results (99.995% accurate) 

Tools: OCR, Computer Vision, sophisticated algorithms, text analysis, automated QC software

Conversion management services that include strategic planning, independent quality assurance, automation, and more

Whether your objective is to handle a conversion project in-house or you know you need outside support, DCL's conversion management services ensure your product runs smoothly. As your full service provider, vendor manager, or support for in-house conversion projects, DCL’s expert team helps determine the correct approach, coordinate internal or outsourced resources, manage resources throughout the project, perform independent QA services pre and/or post conversion, and more. 

DCL ensures your project is completed on-time, on-budget, and within the highest level of accuracy possible.

  • Conversion Planning

  • Feasibility Analysis

  • Content Reuse Analysis

  • Workflow Analysis

  • Project Management

  • Vendor Management

  • Quality Assurance

  • Software Development

  • Full Lifecycle Consulting

  • Project Development and Planning

  • Data Evaluation and Specification

  • Business Case Analysis and Report

  • Format Structuring

  • Vendor Selection

  • Training

  • Automation Options

DCL Conversion Model

DCL Conversion Hub

In order to make the conversion process as fast and flexible as possible, DCL developed the Hub-and-Spoke software architecture in which all data is normalized in the DCL Conversion Hub. The DCL Hub is an XML-based superset format that standardizes all files and breaks them down into their key elements. The files are then filtered into their target format.

The DCL Guarantee

DCL is recognized for its outstanding customer service and average conversion-accuracy rates of 99.9%. Our QA checkpoints ensure the desired level of accuracy is achieved through automated and manual processes.

Optical Character Recognition (OCR) FAQ

What is OCR?


OCR stands for Optical Character Recognition, the process converting digital images (usually scanned documents) into machine searchable texts.




How does OCR work?


OCR requires digital images, so the first stage is usually scanning papers or books. These images are run through OCR programs that output the text data into desired formats, such as PDF, XML, S1000D, DITA, or proprietary schemas.




What is OCR used for?


OCR has broad applications since it can be used in any field to covert paper documents or digital images to text that machines can interpret. Many businesses or organizations use OCR to digitize legacy paper archives so they are accessible and searchable. OCR can replace manual data entry, with AI doing the tedious work of reading and inputting the data instead of human workers. Advances in AI and machine learning have made OCR more accurate and versatile than ever.




How accurate is OCR?


OCR generally has an accuracy of between 95-99.995%, depending on the scope and scale of the project.




Is OCR considered AI?


Artificial Intelligence is often used in modern OCR to improve the accuracy of character recognition, especially with handwritten texts, and to ensure accuracy and quality control.




Can OCR be used to digitize old or rare materials?


Yes, non-destructive white-glove OCR processes can safely digitize delicate or fragile books and papers.





Markup Language Expertise

  • XML

  • HTML

  • NISO JATS

  • MathML

  • S1000D

  • ePUB/MOBI

  • SCORM

  • DITA & DITA Specializations

  • DITA Learning & Training

  • DITA for Publishers

  • Bookshelf

I have been very happy over the years with the work done for our company by DCL, but my latest situation just put it over the top for me. The willingness of the staff to redo work based on updated material that they had already processed and to do it so quickly and graciously was beyond expectations. I could not be more pleased!

Markets Served

DCL's OCR services are ideal for all industries that require digitized and structured content created from static, flat files or printed materials.

Group_3x.png
Shield_3x.png
Cash_3x.png
Library_3x.png
Medicine_3x.png
Scales_3x.png
Graduation_3x.png
Settings_3x.png
Book_3x.png

INDUSTRY MEMBERSHIPS

DCL is proud to play an active role in various industry associations and working groups. Interested in having a DCL content expert serve on your organization?

Contact us!

Stay up to date with DCL!

Learn about product updates, get company news, and receive our monthly newsletter.

  • DCL LinkedIn
  • DCL Twitter
  • DCL YouTube

61-18 190th Street, Suite 205

Fresh Meadows, NY 11365

+1 718.357.8700

info@dclab.com

HOME  /  INDUSTRIES  /   SOLUTIONS  /  SERVICES  /  RESOURCES /  ABOUT  /  CONTACT  /  PRIVACY  /  TERMS OF USE

© 2020 Copyright Data Conversion Laboratory, All Rights Reserved.