Step 1: scanning and uploading an image containing an MRZ to the OCR engine
First, you will have to provide us with a PDF or a picture of the identity document, which must contain an MRZ. This can be done both by sending it from a web application as well as from a mobile application. After you have done so, this image will automatically be skewed, cropped and converted into a usable format by the MRZ OCR solution.
Step 2: image to TXT while using OCR
Once the MRZ has been scanned and the PDF or the picture read and received, it is converted into a TXT file. All the text from the file is extracted, including the MRZ (see the bottom left of the image).
Step 3: getting JSON output from the API
In the final step of MRZ OCR, the Klippa Parser takes the TXT file which was gained in the previous step and converts this into structured JSON. Machine learning and MRZ parsing are used. The JSON is then returned as output from the API. From this point on, the MRZ data can easily be processed into your database. Additional data like the signature and the picture from the passport can optionally be scanned and recognized from the identity document.
What are the benefits of MRZ scanning?
Automatic recognition of errors, fraud and duplicates.
Spend less on processing of passports and other identity documents by using MRZ OCR.
Scan and process MRZ strings within seconds. Simplify the onboarding process and KYC of your company.
Prevention of manual data entry errors with our high quality data extraction MRZ solution.
Which fields can be extracted?
An MRZ consists of the name, document number, nationality, date of birth, gender and passport expiration date. Sometimes there is optional additional information. When reading MRZ these fields will be extracted and optionally the signatures and pictures can be extracted by our MRZ OCR solution:
Schedule a free online demonstration
Get a clear view of how Klippa can help make your MRZ processing more efficient. A demo takes just 30 minutes.
Frequently Asked Questions
What does MRZ OCR cost?What is the MRZ code?What types of MRZ are there?Can Klippa process all types of MRZ with OCR?What are the use cases of MRZ OCR?How does Klippa check the validity of the MRZ?Is Klippa MRZ processing GDPR compliant?Is there documentation for the API?
The pricing structure for MRZ scanning recognition depends on the monthly document volumes. Both pay per use and monthly licenses are available. Get in contact with our solution architects to get an exact quote.
An MRZ is a coded zone that is used on identity documents like passports, ID cards and sometimes even driving licenses. MRZ stands for machine readable zone. It is an international standard (ICAO 9303) developed by governments, customs offices and airlines. It is a way of securely compacting identity document information into a coded string of characters that a computer system or smartphone camera can read. Almost all passports are nowadays fitted with an MRZ.
An MRZ consists of two or three strings of characters, checksum numbers and separators on the front of passports or the backside of ID cards. For the string a special font called OCR-B is used. This font is specifically developed to make it easier for computers to read and prevent reading errors as much as possible. The checksum numbers are used to validate the extracted data that is in the MRZ and is a safety measure against fraude.
There are three types of MRZ, called MRZ 1, MRZ 2 and MRZ 3. Passports are usually fitted with Type 3 and ID cards with Type 1. The Type 3 MRZ consists of two lines of each 44 characters long and consists of name, passport number, nationality, data of birth, sex and expiration data. Sometimes additional data can be added. Only A-Z and 0-9 characters are included in the MRZ and the separator. The Type 2 format is not commonly used, but spans two lines of 36 characters long. The Type 2 only consists of A-Z and 0-9 characters next to the separator. The Type 1 format spans three lines and each line is 30 characters long. Also in the Type 1 only A-Z and 0-9 are included, next to the separator. Klippa’s MRZ OCR works an all types of MRZs.
Yes, Klippa can extract and validate all three types or MRZ strings on passports, ID cards and other travel documents.
Any use case where large amounts of identity documents have to be processed to extract information such as the name, date of birth, nationality, document number, valid through and more can benefit from MRZ OCR. Many times these are cases with regards to identity validation, customer onboarding or KYC.
MRZ strings are available in three structures: Type 1, Type 2 and Type 3. Klippa only accepts MRZs that have the correct structure, font and length to ensure validity. Next to the structure we validate the content of the MRZ by checking if it consists of only A-Z and 0-9 and separators. MRZ strings contain checksums to validate the extracted data. Klippa uses these checksums to ensure that the MRZ are valid and accurate.
All the services that we offer are fully GDPR compliant, as well as MRZ OCR. For processing, we only use ISO certified servers within the EU and a data processor agreement is in place. We do not store any of your or your customers’ data.
Additionally, Klippa provides an option to mask sensitive data. You can read more about that on our data masking blog.
Please feel free to ask all your questions.
Get in touch by mail, phone or chat!
+31 50 2111631