Entity Extraction

Extracting and transforming content into structured markup

Better Content Structure Delivers Better Business

Extract free-form text from textual and form-based documents then generate target XML schema. Complex algorithms, NLP engines and other techniques are then applied to analyze the unstructured text from documents with wide variations in format and quality, and accurately structure the data.

Tools: DCL Reformer, data mining, entity extraction, NLP engines, custom algorithms

The importance of having a plan and process for content QA

Analysis

The first step in DCL Markup Check is content analysis, which quickly identifies areas for investigation. Organizations recognize immediate benefits including

Intelligent Data Capture

Intelligent character recognition identifies and extracts alphanumeric characters, words, and sentences from both handwritten and printed documents. Using AI and Deep Learning, data and content are automatically identified and extracted for use in your business systems.

Content Examples

Some examples of content to be extracted from forms or hard copies include handwriting, content inserted in forms (PDF or print), variable content in identification cards.

Intelligent data capture

Expertise Across all Formats

  • DITA

  • XML

  • HTML, HTML5

  • PubMed JATS

  • MathML

  • NLM XML

  • NISO STS

  • Bookshelf

  • EPUB/MOBI

  • S1000D

  • SGML

  • MS Word

  • and more

Trustpilot_ratings_5star-RGB.png

I have been very happy over the years with the work done for our company by DCL, but my latest situation just put it over the top for me. The willingness of the staff to redo work based on updated material that they had already processed and to do it so quickly and graciously was beyond expectations. I could not be more pleased!

Markets Served

With sophisticated tools and workflow automation DCL solves complex data-centric business challenges and delivers structured content that empowers your business. With subject matter expertise across many industries, DCL structures the world's content to make it consumable for people and machines.

Group_3x.png
Shield_3x.png
Cash_3x.png
Library_3x.png
Medicine_3x.png
Scales_3x.png
Graduation_3x.png
Settings_3x.png
Book_3x.png

INDUSTRY MEMBERSHIPS

DCL is proud to play an active role in various industry associations and working groups. Interested in having a DCL content expert serve on your organization?

Contact us!

  • DCL LinkedIn
  • DCL Twitter
  • DCL YouTube

61-18 190th Street, Suite 205

Fresh Meadows, NY 11365

+1 718.357.8700

info@dclab.com

HOME  /  INDUSTRIES  /   SOLUTIONS  /  SERVICES  /  RESOURCES /  ABOUT  /  CONTACT  /  PRIVACY  /  TERMS OF USE

© 2020 Copyright Data Conversion Laboratory, All Rights Reserved.