From 2d6f6839ae1db8aa4dbaebfe1ce89ac2c5863f52 Mon Sep 17 00:00:00 2001 From: "Su, Xiaomeng" Date: Tue, 25 Mar 2025 09:06:28 +0000 Subject: [PATCH] ocr_usermanual_20250311 Reviewed-by: Pruthi, Vineet Co-authored-by: Su, Xiaomeng Co-committed-by: Su, Xiaomeng --- docs/ocr/umn/ALL_META.TXT.json | 6 +++--- docs/ocr/umn/CLASS.TXT.json | 4 ++-- docs/ocr/umn/ocr_01_0002.html | 2 +- docs/ocr/umn/ocr_01_0006.html | 10 ++++++---- docs/ocr/umn/ocr_01_0028.html | 3 +++ docs/ocr/umn/ocr_01_0078.html | 2 +- docs/ocr/umn/ocr_01_0089.html | 2 +- docs/ocr/umn/ocr_01_0153.html | 2 +- 8 files changed, 18 insertions(+), 13 deletions(-) diff --git a/docs/ocr/umn/ALL_META.TXT.json b/docs/ocr/umn/ALL_META.TXT.json index 0640a43c5..ce7a95917 100644 --- a/docs/ocr/umn/ALL_META.TXT.json +++ b/docs/ocr/umn/ALL_META.TXT.json @@ -43,9 +43,9 @@ "node_id":"ocr_01_0006.xml", "product_code":"ocr", "code":"3", - "des":"There are various factors, such as technology and cost, that limit the performance of OCR services. The system-level constraints are the most significant limitations that", + "des":"There are various factors, such as technology and cost, that limit the performance of OCR services. There are system-wide constraints that affect all services, as well as", "doc_type":"usermanual", - "kw":"Constraints and Limitations,User Guide", + "kw":"Notes and Constraints,User Guide", "search_title":"", "metedata":[ { @@ -53,7 +53,7 @@ "documenttype":"usermanual" } ], - "title":"Constraints and Limitations", + "title":"Notes and Constraints", "githuburl":"" }, { diff --git a/docs/ocr/umn/CLASS.TXT.json b/docs/ocr/umn/CLASS.TXT.json index 1757c93c2..6ac3adceb 100644 --- a/docs/ocr/umn/CLASS.TXT.json +++ b/docs/ocr/umn/CLASS.TXT.json @@ -18,9 +18,9 @@ "code":"2" }, { - "desc":"There are various factors, such as technology and cost, that limit the performance of OCR services. The system-level constraints are the most significant limitations that", + "desc":"There are various factors, such as technology and cost, that limit the performance of OCR services. There are system-wide constraints that affect all services, as well as", "product_code":"ocr", - "title":"Constraints and Limitations", + "title":"Notes and Constraints", "uri":"ocr_01_0006.html", "doc_type":"usermanual", "p_code":"", diff --git a/docs/ocr/umn/ocr_01_0002.html b/docs/ocr/umn/ocr_01_0002.html index 1433ae4f1..c76153572 100644 --- a/docs/ocr/umn/ocr_01_0002.html +++ b/docs/ocr/umn/ocr_01_0002.html @@ -6,7 +6,7 @@

Before You Start

You must have programming capabilities and be familiar with Java, Python, iOS, Android, and Node.js.

To use OCR, call APIs to detect and extract text from images or scanned documents, convert the text into an editable JSON format, and enter the results into business systems by coding or save them in formats such as TXT or Excel.

-

OCR Capabilities

  • General OCR

    Detects and extracts text from images in any format, including tables and documents, and adapts to a range of different layouts and table formats.

    +

    OCR Capabilities

    • General OCR

      Detects and extracts text from images in any format, including tables, documents, certificates, receipts, and forms, and adapts to a range of different layouts and table formats.

    Using OCR for the First Time

    If you are a first-time user, the following sections are a good place to start:

    diff --git a/docs/ocr/umn/ocr_01_0006.html b/docs/ocr/umn/ocr_01_0006.html index 911f75eda..d3cca1a5f 100644 --- a/docs/ocr/umn/ocr_01_0006.html +++ b/docs/ocr/umn/ocr_01_0006.html @@ -1,10 +1,12 @@ -

    Constraints and Limitations

    -

    There are various factors, such as technology and cost, that limit the performance of OCR services. The system-level constraints are the most significant limitations that affect all sub-services. In addition to these system-level constraints, each sub-service also has its own independent limitations.

    -

    General Table OCR

    • Only images in PNG, JPG, JPEG, BMP, or TIFF format can be recognized.
    • No side of the image can be smaller than 15 or larger than 8,192 pixels.
    • The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that the entire table and its surrounding area are included in the image.
    • An image can be rotated to any angle.
    • Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted table lines cannot be recognized.
    • English and Chinese are supported but support for traditional Chinese characters is limited.
    +

    Notes and Constraints

    +

    There are various factors, such as technology and cost, that limit the performance of OCR services. There are system-wide constraints that affect all services, as well as service-level constraints that affect individual services only.

    +

    General Table OCR

    • Only images in PNG, JPG, JPEG, BMP, or TIFF format can be recognized.
    • No side of the image can be smaller than 15 or larger than 8,192 pixels.
    • The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that the entire table and its surrounding area are included in the image.
    • An image can be rotated to any angle.
    • Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted table lines cannot be recognized.
    • English and Chinese are both supported, but the support for traditional Chinese characters is limited.
    -

    General Text OCR

    • Only images in PNG, JPG, JPEG, BMP, GIF, TIFF, WebP, PCX, ICO, or PSD format can be recognized.
    • No side of the image can be smaller than 15 or larger than 8,192 pixels.
    • The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that all text and its surrounding area are included in the image.
    • An image can be rotated to any angle.
    • Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted text cannot be recognized.
    • Supported languages: Chinese, English, some traditional Chinese, Malay, Ukrainian, Hindi, Russian, Vietnamese, Indonesian, Thai, Arabic, German, Latin, French, Italian, Spanish, Portuguese, Romanian, Polish Amharic, Japanese, Korean, Turkish, Norwegian, Danish, and Swedish.
    +

    General Text OCR

    • Only images in PNG, JPG, JPEG, BMP, GIF, TIFF, WebP, PCX, ICO, PSD, or PDF format can be recognized.
    • No side of the image can be smaller than 15 or larger than 8,192 pixels.
    • The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that all text and its surrounding area are included in the image.
    • An image can be rotated to any angle.
    • Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted text cannot be recognized.
    • Supported languages: Chinese, English, some traditional Chinese, Malay, Ukrainian, Hindi, Russian, Vietnamese, Indonesian, Thai, Arabic, German, Latin, French, Italian, Spanish, Portuguese, Romanian, Polish Amharic, Japanese, Korean, Turkish, Norwegian, Danish, Swedish, Khmer, and Hebrew.
    +
    +

    Smart Document Recognizer

    • English and Chinese are both supported, but the support for traditional Chinese characters is limited.
    • Only images in PNG, JPG, JPEG, BMP, GIF, TIFF, WebP, PCX, ICO or PSD format and PDF files can be recognized. PDF files can only be recognized one page at a time, but you can use the pdf_page_number parameter to specify which page you want to recognize.
    • No side of the image can be smaller than 15 or larger than 8,192 pixels.
    • The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that all text and its surrounding area are included in the image.
    • An image can be rotated to any angle.
    • For more accurate recognition results, the number of characters on a single page must be limited to 1,800 or less.
    • Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted text cannot be analyzed.
    diff --git a/docs/ocr/umn/ocr_01_0028.html b/docs/ocr/umn/ocr_01_0028.html index eabdee5d5..b9766d86a 100644 --- a/docs/ocr/umn/ocr_01_0028.html +++ b/docs/ocr/umn/ocr_01_0028.html @@ -3,9 +3,12 @@

    Functions

    Function Description

    • General Table

      Detects and extracts text and their row and column locations from images of tables in various formats, as well as the text areas outside tables. It is used to store information on documents and reports as structured data.

    • General Text

      Detects and extracts text and their locations from images and converts them into structured data.

      +
    • Smart Document Recognizer

      Recognizes text, analyzes layout, extracts key-value pairs, identifies tables in various formatted documents such as certificates, receipts, and forms, and converts the results into a structured JSON format.

    Use Cases

    • Digitalizing paper documents

      Automatically detects and extracts text, signatures, and seals from document images and converts them into structured data for faster review.

      +
    • Expense review

      +

      Automatically recognizes and digitally inputs employees' invoices, reducing labor costs and enhancing efficiency.

    diff --git a/docs/ocr/umn/ocr_01_0078.html b/docs/ocr/umn/ocr_01_0078.html index 3722ee9c7..c03dba690 100644 --- a/docs/ocr/umn/ocr_01_0078.html +++ b/docs/ocr/umn/ocr_01_0078.html @@ -2,7 +2,7 @@

    Is It Possible to Call the OCR Service From a Different Region Than OBS Resources?

    Cross-region OBS is not supported, and the OBS region must match the region of the service being called.

    -

    For OBS resources with public read authorization, they can be accessed over the Internet and can support cross-region calls. Although this is convenient, there is a risk of sensitive information leakage, such as personal private data. It is recommended that you use OCR and OBS services in the same region to avoid this risk.

    +

    For OBS resources with public read authorization, they can be accessed over the Internet and can support cross-region calls. Although this is convenient, there is a risk of sensitive information leakage, such as personal private data. It is recommended that you use OCR and OBS services in the same region to avoid this risk.