Technology: What’s under the hood?

How we’ve trained our AI to fully understand documents

Breakthrough computer vision and natural language processing for document understanding

Pre-trained from 55 million industrial documents, including invoices, receipts, packing lists, shipping labels, and forms

The pre-trained AI requires no additional training when deployed. Deployments can be as fast as 1 day.

Supercharge your document workflows today

Photon provides a universal data capture service to breakthrough deep learning, image classification, object recognition, image processing, and natural-language processing algorithms under the hood.

Photon's proprietary machine learning models have been trained on the largest datasets of industrial documents in the world.

Its training gets smarter over time. It returns structured data from any image, document, or scan by understanding the type, spatial layout, format, and data types of each field.

Read the API Documentation

Computer vision pipeline

Image quality triaging
Automatic thresholding
Boundary detection
Cropping regions of interest
Document classification
Structured layout comprehension
Object detection
Barcode scanning
Image transformations, auto-rotations
De-skew, affine transformations
Printed vs handwritten text classification
Zonal analysis
Cascaded, semi-supervised deep learning model
Pooling
Ensembling
Text capture
Signature validation
Cascaded case escalation
Structured text and barcode data output

Natural language processing pipeline

Constituency parsing
Semantic parsing
Bidirectional Recurrent Neural Networks (LSTMs)
Word vector space model similarity matching
Structured field matching
Data types validation
Domain-specific dictionary lookups
Spell checking
Selective case escalation
Relation extraction
Name matching
Address validation
Tracking # lookups
UPC, SKU, item lookups
Invoice, PO, receivables, ASN matching
Database queries
Exception handling and reconciliation
Data types transformations, Extract Transform Load
API integrations with WMS, ERP, IMS, DB

The Intelligent Document Processing solution for modern businesses

Pre-extraction:
Performs image pre-processing to increase the quality of the scanned document, captures, data, and indexes and classifies the documents into categories
Extraction:
Captures relevant data leveraging Natural Language Processing for further processing
Post-extraction:
Validates the extracted data with the help business logic, validation rules, and enterprise databases

Technology: What’s under the hood?

Breakthrough computer vision and natural language processing for document understanding

The Intelligent Document Processing solution for modern businesses

Products

About

Partnerships

Contact

Book a Demo

Technology: What’s under the hood?

Breakthrough computer vision and natural language processing for document understanding

The Intelligent Document Processing solution for modern businesses

Products

About

Partnerships

​Contact

Book a Demo

Contact