forked from docs/doc-exports
ocr_usermanual_20250311
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com> Co-authored-by: Su, Xiaomeng <suxiaomeng1@huawei.com> Co-committed-by: Su, Xiaomeng <suxiaomeng1@huawei.com>
This commit is contained in:
@ -43,9 +43,9 @@
|
||||
"node_id":"ocr_01_0006.xml",
|
||||
"product_code":"ocr",
|
||||
"code":"3",
|
||||
"des":"There are various factors, such as technology and cost, that limit the performance of OCR services. The system-level constraints are the most significant limitations that",
|
||||
"des":"There are various factors, such as technology and cost, that limit the performance of OCR services. There are system-wide constraints that affect all services, as well as",
|
||||
"doc_type":"usermanual",
|
||||
"kw":"Constraints and Limitations,User Guide",
|
||||
"kw":"Notes and Constraints,User Guide",
|
||||
"search_title":"",
|
||||
"metedata":[
|
||||
{
|
||||
@ -53,7 +53,7 @@
|
||||
"documenttype":"usermanual"
|
||||
}
|
||||
],
|
||||
"title":"Constraints and Limitations",
|
||||
"title":"Notes and Constraints",
|
||||
"githuburl":""
|
||||
},
|
||||
{
|
||||
|
||||
@ -18,9 +18,9 @@
|
||||
"code":"2"
|
||||
},
|
||||
{
|
||||
"desc":"There are various factors, such as technology and cost, that limit the performance of OCR services. The system-level constraints are the most significant limitations that",
|
||||
"desc":"There are various factors, such as technology and cost, that limit the performance of OCR services. There are system-wide constraints that affect all services, as well as",
|
||||
"product_code":"ocr",
|
||||
"title":"Constraints and Limitations",
|
||||
"title":"Notes and Constraints",
|
||||
"uri":"ocr_01_0006.html",
|
||||
"doc_type":"usermanual",
|
||||
"p_code":"",
|
||||
|
||||
@ -6,7 +6,7 @@
|
||||
<div class="section" id="ocr_01_0002__section128491110112715"><h4 class="sectiontitle">Before You Start</h4><p id="ocr_01_0002__p17124194712712">You must have programming capabilities and be familiar with Java, Python, iOS, Android, and Node.js.</p>
|
||||
<p id="ocr_01_0002__p1612464719274">To use OCR, call APIs to detect and extract text from images or scanned documents, convert the text into an editable JSON format, and enter the results into business systems by coding or save them in formats such as TXT or Excel.</p>
|
||||
</div>
|
||||
<div class="section" id="ocr_01_0002__section37251930334"><h4 class="sectiontitle">OCR Capabilities</h4><ul id="ocr_01_0002__ul12323924352"><li id="ocr_01_0002__li10323225357">General OCR<p id="ocr_01_0002__p1049095193518"><a name="ocr_01_0002__li10323225357"></a><a name="li10323225357"></a>Detects and extracts text from images in any format, including tables and documents, and adapts to a range of different layouts and table formats.</p>
|
||||
<div class="section" id="ocr_01_0002__section37251930334"><h4 class="sectiontitle">OCR Capabilities</h4><ul id="ocr_01_0002__ul12323924352"><li id="ocr_01_0002__li10323225357">General OCR<p id="ocr_01_0002__p1049095193518"><a name="ocr_01_0002__li10323225357"></a><a name="li10323225357"></a>Detects and extracts text from images in any format, including tables, documents, certificates, receipts, and forms, and adapts to a range of different layouts and table formats.</p>
|
||||
</li></ul>
|
||||
</div>
|
||||
<div class="section" id="ocr_01_0002__section1556592243711"><h4 class="sectiontitle">Using OCR for the First Time</h4><p id="ocr_01_0002__p927414793720">If you are a first-time user, the following sections are a good place to start:</p>
|
||||
|
||||
@ -1,10 +1,12 @@
|
||||
<a name="ocr_01_0006"></a><a name="ocr_01_0006"></a>
|
||||
|
||||
<h1 class="topictitle1">Constraints and Limitations</h1>
|
||||
<div id="body0000001751918129"><p id="ocr_01_0006__p552814518411">There are various factors, such as technology and cost, that limit the performance of OCR services. The system-level constraints are the most significant limitations that affect all sub-services. In addition to these system-level constraints, each sub-service also has its own independent limitations.</p>
|
||||
<div class="section" id="ocr_01_0006__section1835213451444"><h4 class="sectiontitle">General Table OCR</h4><ul id="ocr_01_0006__ul45289459414"><li id="ocr_01_0006__li15528124513413">Only images in PNG, JPG, JPEG, BMP, or TIFF format can be recognized.</li><li id="ocr_01_0006__li052816457413">No side of the image can be smaller than 15 or larger than 8,192 pixels.</li><li id="ocr_01_0006__li252864513412">The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that the entire table and its surrounding area are included in the image.</li><li id="ocr_01_0006__li1052816459412">An image can be rotated to any angle.</li><li id="ocr_01_0006__li25285450419">Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted table lines cannot be recognized.</li><li id="ocr_01_0006__li75282452041">English and Chinese are supported but support for traditional Chinese characters is limited.</li></ul>
|
||||
<h1 class="topictitle1">Notes and Constraints</h1>
|
||||
<div id="body0000001751918129"><p id="ocr_01_0006__p552814518411">There are various factors, such as technology and cost, that limit the performance of OCR services. There are system-wide constraints that affect all services, as well as service-level constraints that affect individual services only.</p>
|
||||
<div class="section" id="ocr_01_0006__section1835213451444"><h4 class="sectiontitle">General Table OCR</h4><ul id="ocr_01_0006__ul45289459414"><li id="ocr_01_0006__li15528124513413">Only images in PNG, JPG, JPEG, BMP, or TIFF format can be recognized.</li><li id="ocr_01_0006__li052816457413">No side of the image can be smaller than 15 or larger than 8,192 pixels.</li><li id="ocr_01_0006__li252864513412">The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that the entire table and its surrounding area are included in the image.</li><li id="ocr_01_0006__li1052816459412">An image can be rotated to any angle.</li><li id="ocr_01_0006__li25285450419">Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted table lines cannot be recognized.</li><li id="ocr_01_0006__li75282452041">English and Chinese are both supported, but the support for traditional Chinese characters is limited.</li></ul>
|
||||
</div>
|
||||
<div class="section" id="ocr_01_0006__section1735714458418"><h4 class="sectiontitle">General Text OCR</h4><ul id="ocr_01_0006__ul1352874516418"><li id="ocr_01_0006__li352815456411">Only images in PNG, JPG, JPEG, BMP, GIF, TIFF, WebP, PCX, ICO, or PSD format can be recognized.</li><li id="ocr_01_0006__li17528164515418">No side of the image can be smaller than 15 or larger than 8,192 pixels.</li><li id="ocr_01_0006__li352814451247">The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that all text and its surrounding area are included in the image.</li><li id="ocr_01_0006__li452812456417">An image can be rotated to any angle.</li><li id="ocr_01_0006__li185284451744">Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted text cannot be recognized.</li><li id="ocr_01_0006__li175288454412">Supported languages: Chinese, English, some traditional Chinese, Malay, Ukrainian, Hindi, Russian, Vietnamese, Indonesian, Thai, Arabic, German, Latin, French, Italian, Spanish, Portuguese, Romanian, Polish Amharic, Japanese, Korean, Turkish, Norwegian, Danish, and Swedish.</li></ul>
|
||||
<div class="section" id="ocr_01_0006__section1735714458418"><h4 class="sectiontitle">General Text OCR</h4><ul id="ocr_01_0006__ul1352874516418"><li id="ocr_01_0006__li352815456411">Only images in PNG, JPG, JPEG, BMP, GIF, TIFF, WebP, PCX, ICO, PSD, or PDF format can be recognized.</li><li id="ocr_01_0006__li17528164515418">No side of the image can be smaller than 15 or larger than 8,192 pixels.</li><li id="ocr_01_0006__li352814451247">The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that all text and its surrounding area are included in the image.</li><li id="ocr_01_0006__li452812456417">An image can be rotated to any angle.</li><li id="ocr_01_0006__li185284451744">Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted text cannot be recognized.</li><li id="ocr_01_0006__li175288454412">Supported languages: Chinese, English, some traditional Chinese, Malay, Ukrainian, Hindi, Russian, Vietnamese, Indonesian, Thai, Arabic, German, Latin, French, Italian, Spanish, Portuguese, Romanian, Polish Amharic, Japanese, Korean, Turkish, Norwegian, Danish, Swedish, Khmer, and Hebrew.</li></ul>
|
||||
</div>
|
||||
<div class="section" id="ocr_01_0006__section12870144482912"><h4 class="sectiontitle">Smart Document Recognizer</h4><ul id="ocr_01_0006__ul7622153124420"><li id="ocr_01_0006__li86221318446">English and Chinese are both supported, but the support for traditional Chinese characters is limited.</li><li id="ocr_01_0006__li136239319446">Only images in PNG, JPG, JPEG, BMP, GIF, TIFF, WebP, PCX, ICO or PSD format and PDF files can be recognized. PDF files can only be recognized one page at a time, but you can use the <strong id="ocr_01_0006__b154021851778">pdf_page_number</strong> parameter to specify which page you want to recognize.</li><li id="ocr_01_0006__li11623193134410">No side of the image can be smaller than 15 or larger than 8,192 pixels.</li><li id="ocr_01_0006__li1362313104411">The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that all text and its surrounding area are included in the image.</li><li id="ocr_01_0006__li15623133117442">An image can be rotated to any angle.</li><li id="ocr_01_0006__li1862313316443">For more accurate recognition results, the number of characters on a single page must be limited to 1,800 or less.</li><li id="ocr_01_0006__li46238319446">Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted text cannot be analyzed.</li></ul>
|
||||
</div>
|
||||
</div>
|
||||
|
||||
|
||||
@ -3,9 +3,12 @@
|
||||
<h1 class="topictitle1">Functions</h1>
|
||||
<div id="body0000001752569953"><div class="section" id="ocr_01_0028__section1138891517217"><h4 class="sectiontitle">Function Description</h4><ul id="ocr_01_0028__ul1694384719219"><li id="ocr_01_0028__li29431747428">General Table<p id="ocr_01_0028__p1094419474219"><a name="ocr_01_0028__li29431747428"></a><a name="li29431747428"></a>Detects and extracts text and their row and column locations from images of tables in various formats, as well as the text areas outside tables. It is used to store information on documents and reports as structured data.</p>
|
||||
</li><li id="ocr_01_0028__li8944114716216">General Text<p id="ocr_01_0028__p9945144712217"><a name="ocr_01_0028__li8944114716216"></a><a name="li8944114716216"></a>Detects and extracts text and their locations from images and converts them into structured data.</p>
|
||||
</li><li id="ocr_01_0028__li15352464367">Smart Document Recognizer<p id="ocr_01_0028__p118058293716"><a name="ocr_01_0028__li15352464367"></a><a name="li15352464367"></a>Recognizes text, analyzes layout, extracts key-value pairs, identifies tables in various formatted documents such as certificates, receipts, and forms, and converts the results into a structured JSON format.</p>
|
||||
</li></ul>
|
||||
</div>
|
||||
<div class="section" id="ocr_01_0028__section983663013214"><h4 class="sectiontitle">Use Cases</h4><ul id="ocr_01_0028__ul596658129"><li id="ocr_01_0028__li179665815210">Digitalizing paper documents<p id="ocr_01_0028__p18960584218"><a name="ocr_01_0028__li179665815210"></a><a name="li179665815210"></a>Automatically detects and extracts text, signatures, and seals from document images and converts them into structured data for faster review.</p>
|
||||
</li><li id="ocr_01_0028__li113013917386"><p id="ocr_01_0028__p113101110382"><a name="ocr_01_0028__li113013917386"></a><a name="li113013917386"></a>Expense review</p>
|
||||
<p id="ocr_01_0028__p17181100428">Automatically recognizes and digitally inputs employees' invoices, reducing labor costs and enhancing efficiency.</p>
|
||||
</li></ul>
|
||||
</div>
|
||||
</div>
|
||||
|
||||
@ -2,7 +2,7 @@
|
||||
|
||||
<h1 class="topictitle1">Is It Possible to Call the OCR Service From a Different Region Than OBS Resources?</h1>
|
||||
<div id="body0000001704982986"><p id="ocr_01_0078__p158886233512">Cross-region OBS is not supported, and the OBS region must match the region of the service being called.</p>
|
||||
<p id="ocr_01_0078__p98881427358">For OBS resources <a href="https://docs.otc.t-systems.com/object-storage-service/umn/obs_console_operation_guide/permission_control/configuring_a_bucket_policy/configuring_a_standard_bucket_policy.html#obs-03-0142" target="_blank" rel="noopener noreferrer">with public read authorization</a>, they can be accessed over the Internet and can support cross-region calls. Although this is convenient, there is a risk of sensitive information leakage, such as personal private data. It is recommended that you use OCR and OBS services in the same region to avoid this risk.</p>
|
||||
<p id="ocr_01_0078__p98881427358">For OBS resources with public read authorization, they can be accessed over the Internet and can support cross-region calls. Although this is convenient, there is a risk of sensitive information leakage, such as personal private data. It is recommended that you use OCR and OBS services in the same region to avoid this risk.</p>
|
||||
</div>
|
||||
<div>
|
||||
<div class="familylinks">
|
||||
|
||||
@ -3,7 +3,7 @@
|
||||
<h1 class="topictitle1">How Do I Handle the Error APIG.0307?</h1>
|
||||
<div id="body0000001704993894"><p id="ocr_01_0089__p69900714713">If error message "The token must be updated." and error code "APIG.0307" are displayed when you call an OCR API, the token has expired and needs to be updated.</p>
|
||||
<p id="ocr_01_0089__p0990197124714">Perform the following steps to rectify the fault:</p>
|
||||
<ul id="ocr_01_0089__ul499010794711"><li id="ocr_01_0089__li1699014713478">The validity period of a token is 24 hours. Obtain the token again to call the API.</li><li id="ocr_01_0089__li199012714476">Check whether the <a href="https://docs.otc.t-systems.com/additional/endpoints.html" target="_blank" rel="noopener noreferrer">endpoint</a> in the API URL is correct. Services deployed in different regions cannot be called across regions. If APIs in different regions are called, the token is invalid and error code APIG.0307 is displayed.</li></ul>
|
||||
<ul id="ocr_01_0089__ul499010794711"><li id="ocr_01_0089__li1699014713478">The validity period of a token is 24 hours. Obtain the token again to call the API.</li><li id="ocr_01_0089__li199012714476">Check whether the <a href="https://docs.otc.t-systems.com/regions-and-endpoints/index.html" target="_blank" rel="noopener noreferrer">endpoint</a> in the API URL is correct. Services deployed in different regions cannot be called across regions. If APIs in different regions are called, the token is invalid and error code APIG.0307 is displayed.</li></ul>
|
||||
</div>
|
||||
<div>
|
||||
<div class="familylinks">
|
||||
|
||||
@ -7,7 +7,7 @@
|
||||
<p id="ocr_01_0153__p1441321118"><a href="#ocr_01_0153__section169054167464">Step 2: Configuring the Environment</a></p>
|
||||
<p id="ocr_01_0153__p89181638172212"><a href="#ocr_01_0153__section92251373345">Step 3: Using a Token for Authentication</a></p>
|
||||
<p id="ocr_01_0153__p91691491810"><a href="#ocr_01_0153__section26131714406">Step 4: Calling the Service</a></p>
|
||||
<div class="section" id="ocr_01_0153__section1471165201415"><a name="ocr_01_0153__section1471165201415"></a><a name="section1471165201415"></a><h4 class="sectiontitle">Step 1: Subscribing to a Service</h4><ol id="ocr_01_0153__ol168583537176"><li id="ocr_01_0153__li1185819538178">Log in to the OCR management console.<p id="ocr_01_0153__p1880721511339"><a name="ocr_01_0153__li1185819538178"></a><a name="li1185819538178"></a>Select a region based on service requirements. For details about the region where each service is deployed, see <a href="https://docs.otc.t-systems.com/additional/endpoints.html" target="_blank" rel="noopener noreferrer">Regions and Endpoints</a>.</p>
|
||||
<div class="section" id="ocr_01_0153__section1471165201415"><a name="ocr_01_0153__section1471165201415"></a><a name="section1471165201415"></a><h4 class="sectiontitle">Step 1: Subscribing to a Service</h4><ol id="ocr_01_0153__ol168583537176"><li id="ocr_01_0153__li1185819538178">Log in to the OCR management console.<p id="ocr_01_0153__p1880721511339"><a name="ocr_01_0153__li1185819538178"></a><a name="li1185819538178"></a>Select a region based on service requirements. For details about the region where each service is deployed, see <a href="https://docs.otc.t-systems.com/regions-and-endpoints/index.html" target="_blank" rel="noopener noreferrer">Regions and Endpoints</a>.</p>
|
||||
</li><li id="ocr_01_0153__li107813841214">On the page displayed, select and subscribe to your desired APIs.<p id="ocr_01_0153__p1633017350389"><a name="ocr_01_0153__li107813841214"></a><a name="li107813841214"></a>For this example, subscribe to the General Text OCR API.</p>
|
||||
</li></ol>
|
||||
</div>
|
||||
|
||||
Reference in New Issue
Block a user