ocr_api_20250311

Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Co-authored-by: Su, Xiaomeng <suxiaomeng1@huawei.com>
Co-committed-by: Su, Xiaomeng <suxiaomeng1@huawei.com>
This commit is contained in:
2025-03-26 09:05:18 +00:00
committed by zuul
parent 70ca474258
commit 13aa021d1d
13 changed files with 1198 additions and 32 deletions

File diff suppressed because it is too large Load Diff

View File

@ -27,7 +27,7 @@
"code":"3"
},
{
"desc":"An endpoint is the request address used to call an API. Different services have different endpoints for different regions. You can query all service endpoints at Regions ",
"desc":"An endpoint is the request address for calling an API. Endpoints vary depending on services and regions. For more information, see Regions and Endpoints.",
"product_code":"ocr",
"title":"Endpoint",
"uri":"ocr_03_0062.html",
@ -38,7 +38,7 @@
{
"desc":"Only images in PNG, JPG, JPEG, BMP, or TIFF format can be recognized.No side of the image can be smaller than 15 or larger than 8,192 pixels.The area to be recognized mus",
"product_code":"ocr",
"title":"Constraints and Limitations",
"title":"Notes and Constraints",
"uri":"ocr_03_0063.html",
"doc_type":"api",
"p_code":"1",
@ -72,7 +72,7 @@
"code":"8"
},
{
"desc":"Log in to the OCR management console.Select a region based on your business needs. For details about the regions where services are deployed, see Regions and Endpoints.Se",
"desc":"Log in to the OCR management console.Select a region based on service requirements. For details about the regions where services are deployed, see Regions and Endpoints.S",
"product_code":"ocr",
"title":"Subscribing to an OCR Service",
"uri":"ocr_03_0043.html",
@ -134,6 +134,24 @@
"p_code":"13",
"code":"15"
},
{
"desc":"This API recognizes text, analyzes layout, extracts key-value pairs, identifies tables in various formatted documents such as certificates, receipts, and forms, and conve",
"product_code":"ocr",
"title":"Smart Document Recognizer",
"uri":"ocr_03_0161.html",
"doc_type":"api",
"p_code":"13",
"code":"16"
},
{
"desc":"This section describes how you can use Identity and Access Management (IAM) for fine-grained permissions management of your OCR resources. If your account does not need i",
"product_code":"ocr",
"title":"Permissions Policies and Supported Actions",
"uri":"ocr_03_0162.html",
"doc_type":"api",
"p_code":"",
"code":"17"
},
{
"desc":"HUAWEI CLOUD Help Center presents technical documents to help you quickly get started with HUAWEI CLOUD services. The technical documents include Service Overview, Price Details, Purchase Guide, User Guide, API Reference, Best Practices, FAQs, and Videos.",
"product_code":"ocr",
@ -141,7 +159,7 @@
"uri":"ocr_03_0048.html",
"doc_type":"api",
"p_code":"",
"code":"16"
"code":"18"
},
{
"desc":"An HTTP status code consists of three digits, which is classified into five categories: 1xx: related information; 2xx: operation successful; 3xx: redirection; 4xx: client",
@ -149,8 +167,8 @@
"title":"Status Codes",
"uri":"ocr_03_0090.html",
"doc_type":"api",
"p_code":"16",
"code":"17"
"p_code":"18",
"code":"19"
},
{
"desc":"No data will be returned if an API fails to be called. You can locate the error cause based on the error code of each API. When an API call fails, HTTPS status code 4xx o",
@ -158,8 +176,8 @@
"title":"Error Codes",
"uri":"ocr_03_0028.html",
"doc_type":"api",
"p_code":"16",
"code":"18"
"p_code":"18",
"code":"20"
},
{
"desc":"A project ID or project name is required in some API requests. You need to obtain the project ID and name before calling an API.Log in to the management console.In the up",
@ -167,8 +185,8 @@
"title":"Obtaining the Project ID",
"uri":"ocr_03_0130.html",
"doc_type":"api",
"p_code":"16",
"code":"19"
"p_code":"18",
"code":"21"
},
{
"desc":"HUAWEI CLOUD Help Center presents technical documents to help you quickly get started with HUAWEI CLOUD services. The technical documents include Service Overview, Price Details, Purchase Guide, User Guide, API Reference, Best Practices, FAQs, and Videos.",
@ -177,6 +195,6 @@
"uri":"ocr_03_0029.html",
"doc_type":"api",
"p_code":"",
"code":"20"
"code":"22"
}
]

View File

@ -8,6 +8,8 @@
</li>
<li class="ulchildlink"><strong><a href="ocr_03_0031.html">General Table</a></strong><br>
</li>
<li class="ulchildlink"><strong><a href="ocr_03_0161.html">Smart Document Recognizer</a></strong><br>
</li>
</ul>
</div>

View File

@ -13,6 +13,16 @@
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.1.2.3.1.2 "><p id="ocr_03_0029__p1385552911554">This issue is the first official release.</p>
</td>
</tr>
<tr id="ocr_03_0029__row8566121641318"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.1.2.3.1.1 "><p id="ocr_03_0029__p55664167137">2024-11-15</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.1.2.3.1.2 "><p id="ocr_03_0029__p1356613164135">Changed the default value of the <strong id="ocr_03_0029__b20947143314417">language</strong> parameter in the General Text OCR API. If this parameter is not specified, German and English are recognized by default.</p>
</td>
</tr>
<tr id="ocr_03_0029__row557115251065"><td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.1.2.3.1.1 "><p id="ocr_03_0029__p2057116251169">2025-03-04</p>
</td>
<td class="cellrowborder" valign="top" width="50%" headers="mcps1.3.1.2.3.1.2 "><ul id="ocr_03_0029__ul329505772016"><li id="ocr_03_0029__li12295205718204">Enhanced the functions of General Text OCR to support the recognition of images in PNG, JPG, JPEG, BMP, GIF, TIFF, WebP, PCX, ICO, PSD, or PDF format. Added the recognition of Cambodian and Hebrew.</li><li id="ocr_03_0029__li848219582205">Added the Smart Document Recognizer API.</li></ul>
</td>
</tr>
</tbody>
</table>
</div>

View File

@ -96,7 +96,7 @@
</td>
<td class="cellrowborder" valign="top" width="13.268673132686729%" headers="mcps1.3.4.3.2.5.1.3 "><p id="ocr_03_0031__p113251161618">String</p>
</td>
<td class="cellrowborder" valign="top" width="55.0944905509449%" headers="mcps1.3.4.3.2.5.1.4 "><p id="ocr_03_0031__p1932111161619">Set either this parameter or <strong id="ocr_03_0031__b4303087575854">image</strong>. Image URL. Currently, the following URLs are supported:</p>
<td class="cellrowborder" valign="top" width="55.0944905509449%" headers="mcps1.3.4.3.2.5.1.4 "><p id="ocr_03_0031__p1932111161619">Set either this parameter or <strong id="ocr_03_0031__b4303087575854">image</strong>. The image file has a size limit of 10 MB. The following image URLs are currently supported:</p>
<ul id="ocr_03_0031__ul1832111114162"><li id="ocr_03_0031__li125979581460">Public HTTP/HTTPS URL</li><li id="ocr_03_0031__li193210115162">URL provided by OBS.</li></ul>
<div class="note" id="ocr_03_0031__note356121013169"><span class="notetitle"> NOTE: </span><div class="notebody"><ul id="ocr_03_0031__ul933141171616"><li id="ocr_03_0031__li83381110163">The API response time depends on the image download time. If the image download takes a long time, the API call will fail.</li><li id="ocr_03_0031__li1733211131613">Ensure that the storage service where the images to be detected reside is stable and reliable. OBS is recommended for storing image data.</li><li id="ocr_03_0031__li1331411101616">The URL cannot contain Chinese characters. If Chinese characters exist, they must be encoded using UTF-8.</li></ul>
</div></div>

View File

@ -3,7 +3,7 @@
<h1 class="topictitle1">General Text</h1>
<div id="body0000001696801984"><div class="section" id="ocr_03_0042__section19654449133413"><h4 class="sectiontitle">Function</h4><p id="ocr_03_0042__p1085875063413">This API detects and extracts text from images and converts the text and coordinates into JSON format. It can be used in various scenarios, such as scanned documents, electronic documents, books, receipts, and forms.</p>
</div>
<div class="section" id="ocr_03_0042__section19659149173410"><h4 class="sectiontitle">Constraints and Limitations</h4><ul id="ocr_03_0042__ul785914506344"><li id="ocr_03_0042__li16859150173412">Only images in PNG, JPG, JPEG, BMP, GIF, or TIFF format can be recognized.</li><li id="ocr_03_0042__li19859165011342">No side of the image can be smaller than 15 or larger than 8,192 pixels.</li><li id="ocr_03_0042__li2085919502343">The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that all text and its surrounding area are included in the image.</li><li id="ocr_03_0042__li1185925023417">An image can be rotated to any angle.</li><li id="ocr_03_0042__li138590503341">Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted text cannot be recognized.</li><li id="ocr_03_0042__li4859175093414">Supported languages: Chinese, English, some traditional Chinese, Malay, Ukrainian, Hindi, Russian, Vietnamese, Indonesian, Thai, Arabic, German, Latin, French, Italian, Spanish, Portuguese, Romanian, Polish Amharic, Japanese, Korean, Turkish, Norwegian, Danish, and Swedish.</li></ul>
<div class="section" id="ocr_03_0042__section19659149173410"><h4 class="sectiontitle">Constraints and Limitations</h4><ul id="ocr_03_0042__ul785914506344"><li id="ocr_03_0042__li16859150173412">Only images in PNG, JPG, JPEG, BMP, GIF, TIFF, WebP, PCX, ICO, PSD, or PDF format can be recognized.</li><li id="ocr_03_0042__li19859165011342">No side of the image can be smaller than 15 or larger than 8,192 pixels.</li><li id="ocr_03_0042__li2085919502343">The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that all text and its surrounding area are included in the image.</li><li id="ocr_03_0042__li1185925023417">An image can be rotated to any angle.</li><li id="ocr_03_0042__li138590503341">Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted text cannot be recognized.</li><li id="ocr_03_0042__li4859175093414">Supported languages: Chinese, English, some traditional Chinese, Malay, Ukrainian, Hindi, Russian, Vietnamese, Indonesian, Thai, Arabic, German, Latin, French, Italian, Spanish, Portuguese, Romanian, Polish Amharic, Japanese, Korean, Turkish, Norwegian, Danish, Swedish, Khmer, and Hebrew.</li></ul>
</div>
<div class="section" id="ocr_03_0042__section1370104913413"><h4 class="sectiontitle">URI</h4><p id="ocr_03_0042__p14859150183415">POST /v2/{project_id}/ocr/general-text</p>
@ -96,7 +96,7 @@
</td>
<td class="cellrowborder" valign="top" width="13.268673132686729%" headers="mcps1.3.4.3.2.5.1.3 "><p id="ocr_03_0042__p10862350123414">String</p>
</td>
<td class="cellrowborder" valign="top" width="55.0944905509449%" headers="mcps1.3.4.3.2.5.1.4 "><p id="ocr_03_0042__p4862650193416">Set either this parameter or <strong id="ocr_03_0042__b16840541667581">image</strong>. Image URL. Currently, the following URLs are supported:</p>
<td class="cellrowborder" valign="top" width="55.0944905509449%" headers="mcps1.3.4.3.2.5.1.4 "><p id="ocr_03_0042__p4862650193416">Set either this parameter or <strong id="ocr_03_0042__b16840541667581">image</strong>. The image file has a size limit of 10 MB. The following image URLs are currently supported:</p>
<ul id="ocr_03_0042__ul386211505341"><li id="ocr_03_0042__li28621250153414">Public HTTP/HTTPS URL</li><li id="ocr_03_0042__li1486214507345">URL provided by OBS.</li></ul>
<div class="note" id="ocr_03_0042__note1594105124215"><span class="notetitle"> NOTE: </span><div class="notebody"><ul id="ocr_03_0042__ul9597105418426"><li id="ocr_03_0042__li12597115414213">The API response time depends on the image download time. If the image download takes a long time, the API call will fail.</li><li id="ocr_03_0042__li14597105412421">Ensure that the storage service where the images to be detected reside is stable and reliable. OBS is recommended for storing image data.</li><li id="ocr_03_0042__li659735474218">The URL cannot contain Chinese characters. If Chinese characters exist, they must be encoded using UTF-8.</li></ul>
</div></div>
@ -142,8 +142,8 @@
</td>
<td class="cellrowborder" valign="top" width="13.268673132686729%" headers="mcps1.3.4.3.2.5.1.3 "><p id="ocr_03_0042__p1786419506342">String</p>
</td>
<td class="cellrowborder" valign="top" width="55.0944905509449%" headers="mcps1.3.4.3.2.5.1.4 "><p id="ocr_03_0042__p108641650153414">Language. If this parameter is not specified, Chinese and English will be used by default. The options are as follows:</p>
<ul id="ocr_03_0042__ul1986495016344"><li id="ocr_03_0042__li1486410509349"><strong id="ocr_03_0042__b10258144657581">auto</strong>: automatic language classification</li><li id="ocr_03_0042__li10864195023415"><strong id="ocr_03_0042__b7357133317581">ms</strong>: Malay</li><li id="ocr_03_0042__li1286445083417"><strong id="ocr_03_0042__b19988043547581">uk</strong>: Ukrainian</li><li id="ocr_03_0042__li2864205020346"><strong id="ocr_03_0042__b4104808187581">hi</strong>: Hindi</li><li id="ocr_03_0042__li148641350203419"><strong id="ocr_03_0042__b18818987717581">ru</strong>: Russian</li><li id="ocr_03_0042__li7864125013420"><strong id="ocr_03_0042__b16337539167581">vi</strong>: Vietnamese</li><li id="ocr_03_0042__li108641250133419"><strong id="ocr_03_0042__b7680799647581">id</strong>: Indonesian</li><li id="ocr_03_0042__li18641450113420"><strong id="ocr_03_0042__b18527074197581">th</strong>: Thai</li><li id="ocr_03_0042__li2864145014344"><strong id="ocr_03_0042__b18829267547581">zh</strong>: Chinese and English</li><li id="ocr_03_0042__li11864195018344"><strong id="ocr_03_0042__b18922174677581">ar</strong>: Arabic</li><li id="ocr_03_0042__li586415017344"><strong id="ocr_03_0042__b12941179727581">de</strong>: German</li><li id="ocr_03_0042__li886414507341"><strong id="ocr_03_0042__b1937897827581">la</strong>: Latin</li><li id="ocr_03_0042__li58641350143418"><strong id="ocr_03_0042__b18697063487581">fr</strong>: French</li><li id="ocr_03_0042__li16864145010344"><strong id="ocr_03_0042__b21456063727581">it</strong>: Italian</li><li id="ocr_03_0042__li1986425016349"><strong id="ocr_03_0042__b3339480527581">es</strong>: Spanish</li><li id="ocr_03_0042__li2864205018344"><strong id="ocr_03_0042__b548223067581">pt</strong>: Portuguese</li><li id="ocr_03_0042__li88641250123410"><strong id="ocr_03_0042__b6136934517581">ro</strong>: Romanian</li><li id="ocr_03_0042__li78645501346"><strong id="ocr_03_0042__b14503051207581">pl</strong>: Polish</li><li id="ocr_03_0042__li11865750143414"><strong id="ocr_03_0042__b14046501537581">am</strong>: Amharic</li><li id="ocr_03_0042__li286565033419"><strong id="ocr_03_0042__b13251689587581">ja</strong>: Japanese</li><li id="ocr_03_0042__li18865750153412"><strong id="ocr_03_0042__b6962543117581">ko</strong>: Korean</li><li id="ocr_03_0042__li123001028311"><strong id="ocr_03_0042__b10682510267581">tr</strong>: Turkish</li><li id="ocr_03_0042__li5916176524"><strong id="ocr_03_0042__b2840835287581">no</strong>: Norwegian</li><li id="ocr_03_0042__li1651313311411"><strong id="ocr_03_0042__b9167636007581">da</strong>: Danish</li><li id="ocr_03_0042__li51351415312"><strong id="ocr_03_0042__b19162741097581">sv</strong>: Swedish</li></ul>
<td class="cellrowborder" valign="top" width="55.0944905509449%" headers="mcps1.3.4.3.2.5.1.4 "><p id="ocr_03_0042__p108641650153414">Language. If this parameter is not specified, German and English will be used by default. The options are:</p>
<ul id="ocr_03_0042__ul1986495016344"><li id="ocr_03_0042__li1486410509349"><strong id="ocr_03_0042__b10258144657581">auto</strong>: automatic language classification</li><li id="ocr_03_0042__li10864195023415"><strong id="ocr_03_0042__b7357133317581">ms</strong>: Malay</li><li id="ocr_03_0042__li1286445083417"><strong id="ocr_03_0042__b19988043547581">uk</strong>: Ukrainian</li><li id="ocr_03_0042__li2864205020346"><strong id="ocr_03_0042__b4104808187581">hi</strong>: Hindi</li><li id="ocr_03_0042__li148641350203419"><strong id="ocr_03_0042__b18818987717581">ru</strong>: Russian</li><li id="ocr_03_0042__li7864125013420"><strong id="ocr_03_0042__b16337539167581">vi</strong>: Vietnamese</li><li id="ocr_03_0042__li108641250133419"><strong id="ocr_03_0042__b7680799647581">id</strong>: Indonesian</li><li id="ocr_03_0042__li18641450113420"><strong id="ocr_03_0042__b18527074197581">th</strong>: Thai</li><li id="ocr_03_0042__li2864145014344"><strong id="ocr_03_0042__b18829267547581">zh</strong>: Chinese and English</li><li id="ocr_03_0042__li11864195018344"><strong id="ocr_03_0042__b18922174677581">ar</strong>: Arabic</li><li id="ocr_03_0042__li586415017344"><strong id="ocr_03_0042__b12941179727581">de</strong>: German</li><li id="ocr_03_0042__li886414507341"><strong id="ocr_03_0042__b1937897827581">la</strong>: Latin</li><li id="ocr_03_0042__li58641350143418"><strong id="ocr_03_0042__b18697063487581">fr</strong>: French</li><li id="ocr_03_0042__li16864145010344"><strong id="ocr_03_0042__b21456063727581">it</strong>: Italian</li><li id="ocr_03_0042__li1986425016349"><strong id="ocr_03_0042__b3339480527581">es</strong>: Spanish</li><li id="ocr_03_0042__li2864205018344"><strong id="ocr_03_0042__b548223067581">pt</strong>: Portuguese</li><li id="ocr_03_0042__li88641250123410"><strong id="ocr_03_0042__b6136934517581">ro</strong>: Romanian</li><li id="ocr_03_0042__li78645501346"><strong id="ocr_03_0042__b14503051207581">pl</strong>: Polish</li><li id="ocr_03_0042__li11865750143414"><strong id="ocr_03_0042__b14046501537581">am</strong>: Amharic</li><li id="ocr_03_0042__li286565033419"><strong id="ocr_03_0042__b13251689587581">ja</strong>: Japanese</li><li id="ocr_03_0042__li18865750153412"><strong id="ocr_03_0042__b6962543117581">ko</strong>: Korean</li><li id="ocr_03_0042__li123001028311"><strong id="ocr_03_0042__b10682510267581">tr</strong>: Turkish</li><li id="ocr_03_0042__li5916176524"><strong id="ocr_03_0042__b2840835287581">no</strong>: Norwegian</li><li id="ocr_03_0042__li1651313311411"><strong id="ocr_03_0042__b9167636007581">da</strong>: Danish</li><li id="ocr_03_0042__li51351415312"><strong id="ocr_03_0042__b19162741097581">sv</strong>: Swedish</li><li id="ocr_03_0042__li942724173113"><strong id="ocr_03_0042__b2061653113227">km</strong>: Khmer</li><li id="ocr_03_0042__li8403161716236"><strong id="ocr_03_0042__b12997133482217">he</strong>: Hebrew</li></ul>
</td>
</tr>
<tr id="ocr_03_0042__row1786515016347"><td class="cellrowborder" valign="top" width="15.308469153084689%" headers="mcps1.3.4.3.2.5.1.1 "><p id="ocr_03_0042__p186565063411">single_orientation_mode</p>
@ -157,6 +157,15 @@
<p id="ocr_03_0042__p48651750153410">If this parameter is not specified, <strong id="ocr_03_0042__b17713359797581">false</strong> is used by default. In this case, the fields in the image are recognized as in multiple directions by default.</p>
</td>
</tr>
<tr id="ocr_03_0042__row10618155717818"><td class="cellrowborder" valign="top" width="15.308469153084689%" headers="mcps1.3.4.3.2.5.1.1 "><p id="ocr_03_0042__p78966318201">pdf_page_number</p>
</td>
<td class="cellrowborder" valign="top" width="16.32836716328367%" headers="mcps1.3.4.3.2.5.1.2 "><p id="ocr_03_0042__p18896163102016">No</p>
</td>
<td class="cellrowborder" valign="top" width="13.268673132686729%" headers="mcps1.3.4.3.2.5.1.3 "><p id="ocr_03_0042__p489603192014">Integer</p>
</td>
<td class="cellrowborder" valign="top" width="55.0944905509449%" headers="mcps1.3.4.3.2.5.1.4 "><p id="ocr_03_0042__p08969322011">Specify which page of the PDF to recognize. If this parameter is specified, the content on the specified page is identified. If not specified, the default is to recognize the first page.</p>
</td>
</tr>
</tbody>
</table>
</div>
@ -350,11 +359,11 @@
"direction" : 67.6506,
"words_block_count" : 1,
"words_block_list" : [ {
"words": "<em id="ocr_03_0042__i19614103687581">Word</em>",
"words": "<em id="ocr_03_0042__i451313173254">Word</em>",
"confidence" : 0.9999,
"location" : [ [ 517, 447 ], [ 540, 504 ], [ 505, 518 ], [ 482, 461 ] ],
"char_list" : [ {
"char": "<em id="ocr_03_0042__i9560226097581">Character</em>",
"char": "<em id="ocr_03_0042__i1968525152510">Character</em>",
"char_location" : [ [ 517, 447 ], [ 530, 479 ], [ 495, 493 ], [ 482, 461 ] ],
"char_confidence" : 0.9999
}, {

View File

@ -1,7 +1,7 @@
<a name="ocr_03_0043"></a><a name="ocr_03_0043"></a>
<h1 class="topictitle1">Subscribing to an OCR Service</h1>
<div id="body0000001708231908"><ol id="ocr_03_0043__ol168583537176"><li id="ocr_03_0043__li1185819538178">Log in to the OCR management console.<p id="ocr_03_0043__p1880721511339"><a name="ocr_03_0043__li1185819538178"></a><a name="li1185819538178"></a>Select a region based on your business needs. For details about the regions where services are deployed, see <a href="https://docs.otc.t-systems.com/additional/endpoints.html" target="_blank" rel="noopener noreferrer">Regions and Endpoints</a>.</p>
<div id="body0000001708231908"><ol id="ocr_03_0043__ol168583537176"><li id="ocr_03_0043__li1185819538178">Log in to the OCR management console.<p id="ocr_03_0043__p1880721511339"><a name="ocr_03_0043__li1185819538178"></a><a name="li1185819538178"></a>Select a region based on service requirements. For details about the regions where services are deployed, see <a href="https://docs.otc.t-systems.com/regions-and-endpoints/index.html" target="_blank" rel="noopener noreferrer">Regions and Endpoints</a>.</p>
</li><li id="ocr_03_0043__li107813841214">On the page displayed, select and subscribe to your desired APIs.</li></ol>
</div>
<div>

View File

@ -19,6 +19,11 @@
<td class="cellrowborder" valign="top" width="79.07%" headers="mcps1.3.2.2.3.1.2 "><p id="ocr_03_0047__p1833746705">This API detects and extracts text from images of general tables and converts the text into a structured format.</p>
</td>
</tr>
<tr id="ocr_03_0047__row9471124212118"><td class="cellrowborder" valign="top" width="20.93%" headers="mcps1.3.2.2.3.1.1 "><p id="ocr_03_0047__p64711942192116"><a href="ocr_03_0161.html">Smart Document Recognizer</a></p>
</td>
<td class="cellrowborder" valign="top" width="79.07%" headers="mcps1.3.2.2.3.1.2 "><p id="ocr_03_0047__p947144210212">Recognizes text, analyzes layout, extracts key-value pairs, identifies tables in various formatted documents such as certificates, receipts, and forms, and converts the results into a structured JSON format.</p>
</td>
</tr>
</tbody>
</table>
</div>

View File

@ -10,7 +10,7 @@
</li>
<li class="ulchildlink"><strong><a href="ocr_03_0062.html">Endpoint</a></strong><br>
</li>
<li class="ulchildlink"><strong><a href="ocr_03_0063.html">Constraints and Limitations</a></strong><br>
<li class="ulchildlink"><strong><a href="ocr_03_0063.html">Notes and Constraints</a></strong><br>
</li>
<li class="ulchildlink"><strong><a href="ocr_03_0064.html">Basic Concepts</a></strong><br>
</li>

View File

@ -1,7 +1,7 @@
<a name="ocr_03_0062"></a><a name="ocr_03_0062"></a>
<h1 class="topictitle1">Endpoint</h1>
<div id="body0000001696542620"><p id="ocr_03_0062__p10378132342811">An endpoint is the <strong id="ocr_03_0062__b781011643016">request address</strong> used to call an API. Different services have different endpoints for different regions. You can query all service endpoints at <u id="ocr_03_0062__u1331154519423"><a href="https://docs.otc.t-systems.com/additional/endpoints.html" target="_blank" rel="noopener noreferrer">Regions and Endpoints</a></u>.</p>
<div id="body0000001696542620"><p id="ocr_03_0062__p10378132342811">An endpoint is the <strong id="ocr_03_0062__b11117013148">request address</strong> for calling an API. Endpoints vary depending on services and regions. For more information, see <a href="https://docs.otc.t-systems.com/regions-and-endpoints/index.html" target="_blank" rel="noopener noreferrer">Regions and Endpoints</a>.</p>
</div>
<div>
<div class="familylinks">

View File

@ -1,9 +1,11 @@
<a name="ocr_03_0063"></a><a name="ocr_03_0063"></a>
<h1 class="topictitle1">Constraints and Limitations</h1>
<h1 class="topictitle1">Notes and Constraints</h1>
<div id="body0000001744422777"><div class="section" id="ocr_03_0063__section22225132010"><h4 class="sectiontitle">General Table OCR</h4><ul id="ocr_03_0063__ul82234142011"><li id="ocr_03_0063__li192233119200">Only images in PNG, JPG, JPEG, BMP, or TIFF format can be recognized.</li><li id="ocr_03_0063__li1022310119207">No side of the image can be smaller than 15 or larger than 8,192 pixels.</li><li id="ocr_03_0063__li722317112201">The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that the entire table and its surrounding area are included in the image.</li><li id="ocr_03_0063__li12234182017">An image can be rotated to any angle.</li><li id="ocr_03_0063__li22236162017">Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted table lines cannot be recognized.</li><li id="ocr_03_0063__li14223513206">English and Chinese are supported but support for traditional Chinese characters is limited.</li></ul>
</div>
<div class="section" id="ocr_03_0063__section64471203375"><h4 class="sectiontitle">General Text OCR</h4><ul id="ocr_03_0063__ul19448820203713"><li id="ocr_03_0063__li16450202063714">Only images in PNG, JPG, JPEG, BMP, GIF, or TIFF format can be recognized.</li><li id="ocr_03_0063__li14521820103711">No side of the image can be smaller than 15 or larger than 8,192 pixels.</li><li id="ocr_03_0063__li11455182013379">The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that all text and its surrounding area are included in the image.</li><li id="ocr_03_0063__li1745662013714">An image can be rotated to any angle.</li><li id="ocr_03_0063__li54571820143719">Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted text cannot be recognized.</li><li id="ocr_03_0063__li1554071215919">Supported languages: Chinese, English, some traditional Chinese, Malay, Ukrainian, Hindi, Russian, Vietnamese, Indonesian, Thai, Arabic, German, Latin, French, Italian, Spanish, Portuguese, Romanian, Polish Amharic, Japanese, Korean, Turkish, Norwegian, Danish, and Swedish.</li></ul>
<div class="section" id="ocr_03_0063__section64471203375"><h4 class="sectiontitle">General Text OCR</h4><ul id="ocr_03_0063__ul19448820203713"><li id="ocr_03_0063__li16450202063714">Only images in PNG, JPG, JPEG, BMP, GIF, TIFF, WebP, PCX, ICO, PSD, or PDF format can be recognized.</li><li id="ocr_03_0063__li14521820103711">No side of the image can be smaller than 15 or larger than 8,192 pixels.</li><li id="ocr_03_0063__li11455182013379">The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that all text and its surrounding area are included in the image.</li><li id="ocr_03_0063__li1745662013714">An image can be rotated to any angle.</li><li id="ocr_03_0063__li54571820143719">Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted text cannot be recognized.</li><li id="ocr_03_0063__li1554071215919">Supported languages: Chinese, English, some traditional Chinese, Malay, Ukrainian, Hindi, Russian, Vietnamese, Indonesian, Thai, Arabic, German, Latin, French, Italian, Spanish, Portuguese, Romanian, Polish Amharic, Japanese, Korean, Turkish, Norwegian, Danish, Swedish, Khmer, and Hebrew.</li></ul>
</div>
<div class="section" id="ocr_03_0063__section12870144482912"><h4 class="sectiontitle">Smart Document Recognizer</h4><ul id="ocr_03_0063__ul7622153124420"><li id="ocr_03_0063__li86221318446">English and Chinese are both supported, but the support for traditional Chinese characters is limited.</li><li id="ocr_03_0063__li136239319446">Only images in PNG, JPG, JPEG, BMP, GIF, TIFF, WebP, PCX, ICO or PSD format and PDF files can be recognized. PDF files can only be recognized one page at a time, but you can use the <strong id="ocr_03_0063__b98210560140">pdf_page_number</strong> parameter to specify which page you want to recognize.</li><li id="ocr_03_0063__li11623193134410">No side of the image can be smaller than 15 or larger than 8,192 pixels.</li><li id="ocr_03_0063__li1362313104411">The area to be recognized must occupy more than 80% of the image. When scanning a table, ensure that all text and its surrounding area are included in the image.</li><li id="ocr_03_0063__li15623133117442">An image can be rotated to any angle.</li><li id="ocr_03_0063__li1862313316443">For more accurate recognition results, the number of characters on a single page must be limited to 1,800 or less.</li><li id="ocr_03_0063__li46238319446">Text in images with complex backgrounds (such as outdoor scenery or anti-counterfeit watermarks) or distorted text cannot be analyzed.</li></ul>
</div>
</div>
<div>

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff