doc-exports/docs/dli/sqlreference/dli_08_0204.html
Su, Xiaomeng 76a5b1ee83 dli_sqlreference_20240227
Reviewed-by: Pruthi, Vineet <vineet.pruthi@t-systems.com>
Co-authored-by: Su, Xiaomeng <suxiaomeng1@huawei.com>
Co-committed-by: Su, Xiaomeng <suxiaomeng1@huawei.com>
2024-03-27 22:02:33 +00:00

307 lines
44 KiB
HTML

<a name="dli_08_0204"></a><a name="dli_08_0204"></a>
<h1 class="topictitle1">Creating a DLI Table Using the Hive Syntax</h1>
<div id="body8662426"><div class="section" id="dli_08_0204__en-us_topic_0156816283_en-us_topic_0093946816_s03a9a8df01184a68831600f336283a25"><h4 class="sectiontitle">Function</h4><p id="dli_08_0204__en-us_topic_0156816283_en-us_topic_0093946816_ae11f3982a5344cc59623245f67a56358">This Hive syntax is used to create a DLI table. The main differences between the DataSource and the Hive syntax lie in the supported data formats and the number of supported partitions. For details, see syntax and precautions.</p>
</div>
<div class="section" id="dli_08_0204__en-us_topic_0241764532_dli_08_0077_en-us_topic_0114776171_en-us_topic_0093946792_se27973c28c9447c7adf942223c2e7e07"><h4 class="sectiontitle">Precautions</h4><ul id="dli_08_0204__en-us_topic_0241764532_dli_08_0077_ul9531201710213"><li id="dli_08_0204__en-us_topic_0241764532_dli_08_0077_li25313179210">Table properties cannot be specified using CTAS table creation statements.</li><li id="dli_08_0204__li173575713551"><strong id="dli_08_0204__b65896540193756">Instructions on using partitioned tables:</strong><ul id="dli_08_0204__en-us_topic_0241764532_dli_08_0077_en-us_topic_0114776171_en-us_topic_0093946792_ua8a2d573dd514bf5be3d34e03ed365c5"><li id="dli_08_0204__en-us_topic_0241764532_dli_08_0077_en-us_topic_0114776171_en-us_topic_0093946792_li47893176101914">When you create a partitioned table, ensure that the specified column in <strong id="dli_08_0204__b102380907311428">PARTITIONED BY</strong> is not a column in the table and the data type is specified. The partition column supports only the open-source Hive table types including <strong id="dli_08_0204__b6678751053">string</strong>, <strong id="dli_08_0204__b1367817513519">boolean</strong>, <strong id="dli_08_0204__b36780511518">tinyint</strong>, <strong id="dli_08_0204__b16781651654">smallint</strong>, <strong id="dli_08_0204__b11679851252">short</strong>, <strong id="dli_08_0204__b567905856">int</strong>, <strong id="dli_08_0204__b46791354518">bigint</strong>, <strong id="dli_08_0204__b6679551251">long</strong>, <strong id="dli_08_0204__b5679951517">decimal</strong>, <strong id="dli_08_0204__b1967916516516">float</strong>, <strong id="dli_08_0204__b176801754519">double</strong>, <strong id="dli_08_0204__b166804519514">date</strong>, and <strong id="dli_08_0204__b20680051253">timestamp</strong>.</li><li id="dli_08_0204__en-us_topic_0241764532_dli_08_0077_en-us_topic_0114776171_li87213195420">Multiple partition fields can be specified. The partition fields need to be specified after the <strong id="dli_08_0204__b169022910511">PARTITIONED BY</strong> keyword, instead of the table name. Otherwise, an error occurs.</li><li id="dli_08_0204__li779941410571">A maximum of 200,000 partitions can be created in a single table.</li><li id="dli_08_0204__li28731941124715">CTAS table creation statements cannot be used to create partitioned tables.</li></ul>
</li></ul>
</div>
<div class="section" id="dli_08_0204__en-us_topic_0156816283_en-us_topic_0093946816_s9c100e961a0c4e9085c14200525b2305"><h4 class="sectiontitle">Syntax</h4><div class="codecoloring" codetype="Sql" id="dli_08_0204__en-us_topic_0156816283_screen33001332152012"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal"> 1</span>
<span class="normal"> 2</span>
<span class="normal"> 3</span>
<span class="normal"> 4</span>
<span class="normal"> 5</span>
<span class="normal"> 6</span>
<span class="normal"> 7</span>
<span class="normal"> 8</span>
<span class="normal"> 9</span>
<span class="normal">10</span>
<span class="normal">11</span>
<span class="normal">12</span>
<span class="normal">13</span>
<span class="normal">14</span>
<span class="normal">15</span>
<span class="normal">16</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">CREATE</span><span class="w"> </span><span class="k">TABLE</span><span class="w"> </span><span class="p">[</span><span class="k">IF</span><span class="w"> </span><span class="k">NOT</span><span class="w"> </span><span class="k">EXISTS</span><span class="p">]</span><span class="w"> </span><span class="p">[</span><span class="n">db_name</span><span class="p">.]</span><span class="k">table_name</span><span class="w"> </span>
<span class="w"> </span><span class="p">[(</span><span class="n">col_name1</span><span class="w"> </span><span class="n">col_type1</span><span class="w"> </span><span class="p">[</span><span class="k">COMMENT</span><span class="w"> </span><span class="n">col_comment1</span><span class="p">],</span><span class="w"> </span><span class="p">...)]</span>
<span class="w"> </span><span class="p">[</span><span class="k">COMMENT</span><span class="w"> </span><span class="n">table_comment</span><span class="p">]</span><span class="w"> </span>
<span class="w"> </span><span class="p">[</span><span class="n">PARTITIONED</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="p">(</span><span class="n">col_name2</span><span class="w"> </span><span class="n">col_type2</span><span class="p">,</span><span class="w"> </span><span class="p">[</span><span class="k">COMMENT</span><span class="w"> </span><span class="n">col_comment2</span><span class="p">],</span><span class="w"> </span><span class="p">...)]</span><span class="w"> </span>
<span class="w"> </span><span class="p">[</span><span class="k">ROW</span><span class="w"> </span><span class="n">FORMAT</span><span class="w"> </span><span class="n">row_format</span><span class="p">]</span>
<span class="w"> </span><span class="n">STORED</span><span class="w"> </span><span class="k">AS</span><span class="w"> </span><span class="n">file_format</span><span class="w"> </span>
<span class="w"> </span><span class="p">[</span><span class="n">TBLPROPERTIES</span><span class="w"> </span><span class="p">(</span><span class="k">key</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="n">value</span><span class="p">)]</span>
<span class="w"> </span><span class="p">[</span><span class="k">AS</span><span class="w"> </span><span class="n">select_statement</span><span class="p">];</span>
<span class="n">row_format</span><span class="p">:</span>
<span class="w"> </span><span class="p">:</span><span class="w"> </span><span class="n">SERDE</span><span class="w"> </span><span class="n">serde_cls</span><span class="w"> </span><span class="p">[</span><span class="k">WITH</span><span class="w"> </span><span class="n">SERDEPROPERTIES</span><span class="w"> </span><span class="p">(</span><span class="n">key1</span><span class="o">=</span><span class="n">val1</span><span class="p">,</span><span class="w"> </span><span class="n">key2</span><span class="o">=</span><span class="n">val2</span><span class="p">,</span><span class="w"> </span><span class="p">...)]</span>
<span class="w"> </span><span class="o">|</span><span class="w"> </span><span class="n">DELIMITED</span><span class="w"> </span><span class="p">[</span><span class="n">FIELDS</span><span class="w"> </span><span class="n">TERMINATED</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="nb">char</span><span class="w"> </span><span class="p">[</span><span class="n">ESCAPED</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="nb">char</span><span class="p">]]</span>
<span class="w"> </span><span class="p">[</span><span class="n">COLLECTION</span><span class="w"> </span><span class="n">ITEMS</span><span class="w"> </span><span class="n">TERMINATED</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="nb">char</span><span class="p">]</span>
<span class="w"> </span><span class="p">[</span><span class="k">MAP</span><span class="w"> </span><span class="n">KEYS</span><span class="w"> </span><span class="n">TERMINATED</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="nb">char</span><span class="p">]</span>
<span class="w"> </span><span class="p">[</span><span class="n">LINES</span><span class="w"> </span><span class="n">TERMINATED</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="nb">char</span><span class="p">]</span>
<span class="w"> </span><span class="p">[</span><span class="k">NULL</span><span class="w"> </span><span class="k">DEFINED</span><span class="w"> </span><span class="k">AS</span><span class="w"> </span><span class="nb">char</span><span class="p">]</span>
</pre></div></td></tr></table></div>
</div>
</div>
<div class="section" id="dli_08_0204__en-us_topic_0156816283_en-us_topic_0093946816_s9ceb61496680404b879ef5439843c6c7"><h4 class="sectiontitle">Keywords</h4><ul id="dli_08_0204__en-us_topic_0156816283_ul74261946687"><li id="dli_08_0204__en-us_topic_0156816283_li12814539191914">IF NOT EXISTS: Prevents system errors when the created table exists.</li><li id="dli_08_0204__en-us_topic_0156816283_li11143171444414">COMMENT: Field or table description.</li><li id="dli_08_0204__en-us_topic_0156816283_li572161314209">PARTITIONED BY: Partition field.</li><li id="dli_08_0204__li4301114017139">ROW FORMAT: Row data format.</li><li id="dli_08_0204__en-us_topic_0156816283_en-us_topic_0093946792_l2b4d833ae66e470dae553c300f0783b8">STORED AS: Specifies the format of the file to be stored. Currently, only the TEXTFILE, AVRO, ORC, SEQUENCEFILE, RCFILE, and PARQUET format are supported. This keyword is mandatory when you create DLI tables.</li><li id="dli_08_0204__dli_08_0076_li6331130191815">TBLPROPERTIES: This keyword is used to add a <strong id="dli_08_0204__b121051353165016">key/value</strong> property to a table.<ul id="dli_08_0204__dli_08_0076_ul1611116494185"><li id="dli_08_0204__li19918111553615">If the table storage format is Parquet, you can use <strong id="dli_08_0204__b154211665119">TBLPROPERTIES(parquet.compression = 'zstd')</strong> to set the table compression format to <strong id="dli_08_0204__b854210612510">zstd</strong>.</li></ul>
</li><li id="dli_08_0204__en-us_topic_0156816283_li739819389205">AS: Run the CREATE TABLE AS statement to create a table.</li></ul>
</div>
<div class="section" id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_section1254323371313"><h4 class="sectiontitle">Parameters</h4>
<div class="tablenoborder"><table cellpadding="4" cellspacing="0" summary="" id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_table175787333130" frame="border" border="1" rules="all"><caption><b>Table 1 </b>Parameters</caption><thead align="left"><tr id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_row991835319515"><th align="left" class="cellrowborder" valign="top" width="20.09%" id="mcps1.3.5.2.2.4.1.1"><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p5918185319510"><strong id="dli_08_0204__b79870843893930">Parameter</strong></p>
</th>
<th align="left" class="cellrowborder" valign="top" width="9.950000000000001%" id="mcps1.3.5.2.2.4.1.2"><p id="dli_08_0204__p1838381253217"><strong id="dli_08_0204__b11517102951081">Mandatory</strong></p>
</th>
<th align="left" class="cellrowborder" valign="top" width="69.96%" id="mcps1.3.5.2.2.4.1.3"><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p1691816531254"><strong id="dli_08_0204__b149052530793937">Description</strong></p>
</th>
</tr>
</thead>
<tbody><tr id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_row18919165314510"><td class="cellrowborder" valign="top" width="20.09%" headers="mcps1.3.5.2.2.4.1.1 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p1691818532053">db_name</p>
</td>
<td class="cellrowborder" valign="top" width="9.950000000000001%" headers="mcps1.3.5.2.2.4.1.2 "><p id="dli_08_0204__p9383612103219">No</p>
</td>
<td class="cellrowborder" valign="top" width="69.96%" headers="mcps1.3.5.2.2.4.1.3 "><p id="dli_08_0204__p357760171519">Database name</p>
<p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p29187535520">The value can contain letters, numbers, and underscores (_), but it cannot contain only numbers or start with a number or underscore (_).</p>
</td>
</tr>
<tr id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_row792017532518"><td class="cellrowborder" valign="top" width="20.09%" headers="mcps1.3.5.2.2.4.1.1 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p792011536519">table_name</p>
</td>
<td class="cellrowborder" valign="top" width="9.950000000000001%" headers="mcps1.3.5.2.2.4.1.2 "><p id="dli_08_0204__p17355193421410">Yes</p>
</td>
<td class="cellrowborder" valign="top" width="69.96%" headers="mcps1.3.5.2.2.4.1.3 "><p id="dli_08_0204__p1037215991517">Table name in the database</p>
<p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p1292085310518">The value can contain letters, numbers, and underscores (_), but it cannot contain only numbers or start with a number or underscore (_). The matching rule is <strong id="dli_08_0204__b550525134916">^(?!_)(?![0-9]+$)[A-Za-z0-9_$]*$</strong>. If special characters are required, use single quotation marks ('') to enclose them.</p>
</td>
</tr>
<tr id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_row89201537510"><td class="cellrowborder" valign="top" width="20.09%" headers="mcps1.3.5.2.2.4.1.1 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p69201153552">col_name</p>
</td>
<td class="cellrowborder" valign="top" width="9.950000000000001%" headers="mcps1.3.5.2.2.4.1.2 "><p id="dli_08_0204__p8701417135216">Yes</p>
</td>
<td class="cellrowborder" valign="top" width="69.96%" headers="mcps1.3.5.2.2.4.1.3 "><p id="dli_08_0204__en-us_topic_0241764532_dli_08_0077_en-us_topic_0114776171_en-us_topic_0114776170_p27571023024">Column name</p>
<p id="dli_08_0204__dli_08_0076_en-us_topic_0114776170_p27571023024">The column field can contain letters, numbers, and underscores (_), but it cannot contain only numbers and must contain at least one letter.</p>
<p id="dli_08_0204__p1236220301333">The column name is case insensitive.</p>
</td>
</tr>
<tr id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_row8920953453"><td class="cellrowborder" valign="top" width="20.09%" headers="mcps1.3.5.2.2.4.1.1 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p1992019538511">col_type</p>
</td>
<td class="cellrowborder" valign="top" width="9.950000000000001%" headers="mcps1.3.5.2.2.4.1.2 "><p id="dli_08_0204__p147018173522">Yes</p>
</td>
<td class="cellrowborder" valign="top" width="69.96%" headers="mcps1.3.5.2.2.4.1.3 "><p id="dli_08_0204__en-us_topic_0241764532_dli_08_0077_en-us_topic_0114776171_en-us_topic_0114776170_p197578239211">Data type of a column field, which is primitive.</p>
</td>
</tr>
<tr id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_row14921353755"><td class="cellrowborder" valign="top" width="20.09%" headers="mcps1.3.5.2.2.4.1.1 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p892015320519">col_comment</p>
</td>
<td class="cellrowborder" valign="top" width="9.950000000000001%" headers="mcps1.3.5.2.2.4.1.2 "><p id="dli_08_0204__p19701217105220">No</p>
</td>
<td class="cellrowborder" valign="top" width="69.96%" headers="mcps1.3.5.2.2.4.1.3 "><p id="dli_08_0204__dli_08_0076_en-us_topic_0114776170_p675715235211">Column field description, which can only be string constants.</p>
</td>
</tr>
<tr id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_row6193115414135"><td class="cellrowborder" valign="top" width="20.09%" headers="mcps1.3.5.2.2.4.1.1 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_p82281641104318">row_format</p>
</td>
<td class="cellrowborder" valign="top" width="9.950000000000001%" headers="mcps1.3.5.2.2.4.1.2 "><p id="dli_08_0204__p1270117125218">Yes</p>
</td>
<td class="cellrowborder" valign="top" width="69.96%" headers="mcps1.3.5.2.2.4.1.3 "><p id="dli_08_0204__en-us_topic_0241764532_dli_08_0077_p72284418433">Row data format</p>
</td>
</tr>
<tr id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_row19211953658"><td class="cellrowborder" valign="top" width="20.09%" headers="mcps1.3.5.2.2.4.1.1 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p10921125314510">file_format</p>
</td>
<td class="cellrowborder" valign="top" width="9.950000000000001%" headers="mcps1.3.5.2.2.4.1.2 "><p id="dli_08_0204__p4355183417140">Yes</p>
</td>
<td class="cellrowborder" valign="top" width="69.96%" headers="mcps1.3.5.2.2.4.1.3 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p1330615431399">DLI table storage format, which can be <strong id="dli_08_0204__b783102395614">TEXTFILE</strong>, <strong id="dli_08_0204__b4832923155610">AVRO</strong>, <strong id="dli_08_0204__b1283202325611">ORC</strong>, <strong id="dli_08_0204__b208329239565">SEQUENCEFILE</strong>, <strong id="dli_08_0204__b783252325610">RCFILE</strong>, or <strong id="dli_08_0204__b1683216233567">PARQUET</strong>.</p>
</td>
</tr>
<tr id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_row4661422104814"><td class="cellrowborder" valign="top" width="20.09%" headers="mcps1.3.5.2.2.4.1.1 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p441312774716">table_comment</p>
</td>
<td class="cellrowborder" valign="top" width="9.950000000000001%" headers="mcps1.3.5.2.2.4.1.2 "><p id="dli_08_0204__p143551534181414">No</p>
</td>
<td class="cellrowborder" valign="top" width="69.96%" headers="mcps1.3.5.2.2.4.1.3 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p1741310794711">Table description, which can only be string constants.</p>
</td>
</tr>
<tr id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_row1639886153716"><td class="cellrowborder" valign="top" width="20.09%" headers="mcps1.3.5.2.2.4.1.1 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_p148499132374">key = value</p>
</td>
<td class="cellrowborder" valign="top" width="9.950000000000001%" headers="mcps1.3.5.2.2.4.1.2 "><p id="dli_08_0204__p03551234161415">No</p>
</td>
<td class="cellrowborder" valign="top" width="69.96%" headers="mcps1.3.5.2.2.4.1.3 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_p0850713163711">Set table properties and values.</p>
<p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_p163630663811">If the table storage format is Parquet, you can use <strong id="dli_08_0204__b1925754110519">TBLPROPERTIES(parquet.compression = 'zstd')</strong> to set the table compression format to <strong id="dli_08_0204__b02571414516">zstd</strong>.</p>
</td>
</tr>
<tr id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_row16660102274814"><td class="cellrowborder" valign="top" width="20.09%" headers="mcps1.3.5.2.2.4.1.1 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p55824491274">select_statement</p>
</td>
<td class="cellrowborder" valign="top" width="9.950000000000001%" headers="mcps1.3.5.2.2.4.1.2 "><p id="dli_08_0204__p8356113414143">No</p>
</td>
<td class="cellrowborder" valign="top" width="69.96%" headers="mcps1.3.5.2.2.4.1.3 "><p id="dli_08_0204__en-us_topic_0241764535_dli_08_0204_en-us_topic_0156816283_en-us_topic_0114776192_p105821649182719">The <strong id="dli_08_0204__b73392258114755">CREATE TABLE AS</strong> statement is used to insert the <strong id="dli_08_0204__b1248996802114755">SELECT</strong> query result of the source table or a data record to a newly created DLI table.</p>
</td>
</tr>
</tbody>
</table>
</div>
</div>
<div class="section" id="dli_08_0204__section139223276592"><a name="dli_08_0204__section139223276592"></a><a name="section139223276592"></a><h4 class="sectiontitle">Example 1: Creating a DLI Non-Partitioned Table</h4><p id="dli_08_0204__p1021316360015">Example description: Create a DLI non-partitioned table named <strong id="dli_08_0204__b9479135335618">table1</strong> and use the <strong id="dli_08_0204__b194801053115614">STORED AS</strong> keyword to set the storage format of the table to <strong id="dli_08_0204__b9480253125615">orc</strong>.</p>
<p id="dli_08_0204__p62114310163">You can save DLI tables in the <strong id="dli_08_0204__b1863549576">textfile</strong>, <strong id="dli_08_0204__b1064174105712">avro</strong>, <strong id="dli_08_0204__b6647412570">orc</strong>, <strong id="dli_08_0204__b865174145712">sequencefile</strong>, <strong id="dli_08_0204__b1865104165710">rcfile</strong>, or <strong id="dli_08_0204__b15669495717">parquet</strong> format.</p>
<div class="codecoloring" codetype="Sql" id="dli_08_0204__screen185181211212"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal">1</span>
<span class="normal">2</span>
<span class="normal">3</span>
<span class="normal">4</span>
<span class="normal">5</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">CREATE</span><span class="w"> </span><span class="k">TABLE</span><span class="w"> </span><span class="k">IF</span><span class="w"> </span><span class="k">NOT</span><span class="w"> </span><span class="k">EXISTS</span><span class="w"> </span><span class="n">table1</span><span class="w"> </span><span class="p">(</span>
<span class="w"> </span><span class="n">col_1</span><span class="w"> </span><span class="n">STRING</span><span class="p">,</span>
<span class="w"> </span><span class="n">col_2</span><span class="w"> </span><span class="nb">INT</span>
<span class="p">)</span>
<span class="n">STORED</span><span class="w"> </span><span class="k">AS</span><span class="w"> </span><span class="n">orc</span><span class="p">;</span>
</pre></div></td></tr></table></div>
</div>
</div>
<div class="section" id="dli_08_0204__section288243544316"><h4 class="sectiontitle">Example 2: Creating a DLI Partitioned Table</h4><p id="dli_08_0204__p287518264445">Example description: Create a partitioned table named <strong id="dli_08_0204__b1531771320112836">student</strong>, which is partitioned using <strong id="dli_08_0204__b417745897112836">facultyNo</strong> and <strong id="dli_08_0204__b1618559438112836">classNo</strong>.</p>
<p id="dli_08_0204__p387420166445">In practice, you can select a proper partitioning field and add it to the end of the <strong id="dli_08_0204__b190278666811290">PARTITIONED BY</strong> keyword.</p>
<div class="codecoloring" codetype="Sql" id="dli_08_0204__screen166364584414"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal">1</span>
<span class="normal">2</span>
<span class="normal">3</span>
<span class="normal">4</span>
<span class="normal">5</span>
<span class="normal">6</span>
<span class="normal">7</span>
<span class="normal">8</span>
<span class="normal">9</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">CREATE</span><span class="w"> </span><span class="k">TABLE</span><span class="w"> </span><span class="k">IF</span><span class="w"> </span><span class="k">NOT</span><span class="w"> </span><span class="k">EXISTS</span><span class="w"> </span><span class="n">student</span><span class="p">(</span>
<span class="w"> </span><span class="n">id</span><span class="w"> </span><span class="nb">int</span><span class="p">,</span>
<span class="w"> </span><span class="n">name</span><span class="w"> </span><span class="n">STRING</span>
<span class="p">)</span>
<span class="n">STORED</span><span class="w"> </span><span class="k">AS</span><span class="w"> </span><span class="n">avro</span>
<span class="n">PARTITIONED</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="p">(</span>
<span class="w"> </span><span class="n">facultyNo</span><span class="w"> </span><span class="nb">INT</span><span class="p">,</span>
<span class="w"> </span><span class="n">classNo</span><span class="w"> </span><span class="nb">INT</span>
<span class="p">);</span>
</pre></div></td></tr></table></div>
</div>
</div>
<div class="section" id="dli_08_0204__section17654112105220"><h4 class="sectiontitle">Example 3: Using CTAS to Create a DLI Table Using All or Part of the Data in the Source Table</h4><p id="dli_08_0204__p19201175614">Example description: Based on the DLI table <strong id="dli_08_0204__b523323765713">table1</strong> created in <a href="#dli_08_0204__section139223276592">Example 1: Creating a DLI Non-Partitioned Table</a>, use the CTAS syntax to copy data from <strong id="dli_08_0204__b122341237185718">table1</strong> to <strong id="dli_08_0204__b8235037185716">table1_ctas</strong>.</p>
<p id="dli_08_0204__p776612190912">When using CTAS to create a table, you can ignore the syntax used to create the table being copied. This means that regardless of the syntax used to create <strong id="dli_08_0204__b15749860793656">table1</strong>, you can use the DataSource syntax to create <strong id="dli_08_0204__b24423658193656">table1_ctas</strong>.</p>
<p id="dli_08_0204__p1838135118569">In this example, the storage format of <strong id="dli_08_0204__b16308747125714">table1</strong> is <strong id="dli_08_0204__b130994745718">orc</strong>, and the storage format of <strong id="dli_08_0204__b53091047205713">table1_ctas</strong> may be <strong id="dli_08_0204__b23091747145712">parquet</strong>. This means that the storage format of the table created by CTAS may be different from that of the original table.</p>
<p id="dli_08_0204__p1676015436555">Use the <strong id="dli_08_0204__b13763364583">SELECT</strong> statement following the <strong id="dli_08_0204__b1676376175818">AS</strong> keyword to select required data and insert the data to <strong id="dli_08_0204__b87642645813">table1_ctas</strong>.</p>
<p id="dli_08_0204__p184204104544">The <strong id="dli_08_0204__b143277575694832">SELECT</strong> syntax is as follows: <strong id="dli_08_0204__b145261521294832">SELECT &lt;</strong><em id="dli_08_0204__i24491214494832">Column name</em><strong id="dli_08_0204__b48412519294832"> &gt; FROM &lt;</strong><em id="dli_08_0204__i205895317194832">Table name</em><strong id="dli_08_0204__b9036107094832"> &gt; WHERE &lt;</strong><em id="dli_08_0204__i112808902694832">Related filter criteria</em><strong id="dli_08_0204__b10824416794832">&gt;</strong>.</p>
<ul id="dli_08_0204__ul143001810811"><li id="dli_08_0204__li183001109118">In the example, <strong id="dli_08_0204__b19432038183212">select * from table1</strong> indicates that all statements are selected from <strong id="dli_08_0204__b11527184519324">table1</strong> and copied to <strong id="dli_08_0204__b19895353219">table1_ctas</strong>.<div class="codecoloring" codetype="Sql" id="dli_08_0204__screen133742268101"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal">1</span>
<span class="normal">2</span>
<span class="normal">3</span>
<span class="normal">4</span>
<span class="normal">5</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">CREATE</span><span class="w"> </span><span class="k">TABLE</span><span class="w"> </span><span class="k">IF</span><span class="w"> </span><span class="k">NOT</span><span class="w"> </span><span class="k">EXISTS</span><span class="w"> </span><span class="n">table1_ctas</span>
<span class="n">STORED</span><span class="w"> </span><span class="k">AS</span><span class="w"> </span><span class="n">sequencefile</span>
<span class="k">AS</span>
<span class="k">SELECT</span><span class="w"> </span><span class="o">*</span>
<span class="k">FROM</span><span class="w"> </span><span class="n">table1</span><span class="p">;</span>
</pre></div></td></tr></table></div>
</div>
</li><li id="dli_08_0204__li88207208112">If you do not need all data in <strong id="dli_08_0204__b787313210810">table1</strong>, change <strong id="dli_08_0204__b1267612410810">AS SELECT * FROM table1</strong> to <strong id="dli_08_0204__b755780918">AS SELECT col_1 FROM table1 WHERE col_1 = Ann</strong>. In this way, you can run the <strong id="dli_08_0204__b1465472116327">SELECT</strong> statement to insert all rows whose <strong id="dli_08_0204__b86326343913">col_1</strong> column is <strong id="dli_08_0204__b9128193918913">Ann</strong> from <strong id="dli_08_0204__b169616421395">table1</strong> to <strong id="dli_08_0204__b8851184410918">table1_ctas</strong>.<div class="codecoloring" codetype="Sql" id="dli_08_0204__screen1825923319119"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal">1</span>
<span class="normal">2</span>
<span class="normal">3</span>
<span class="normal">4</span>
<span class="normal">5</span>
<span class="normal">6</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">CREATE</span><span class="w"> </span><span class="k">TABLE</span><span class="w"> </span><span class="k">IF</span><span class="w"> </span><span class="k">NOT</span><span class="w"> </span><span class="k">EXISTS</span><span class="w"> </span><span class="n">table1_ctas</span>
<span class="k">USING</span><span class="w"> </span><span class="n">parquet</span>
<span class="k">AS</span>
<span class="k">SELECT</span><span class="w"> </span><span class="n">col_1</span>
<span class="k">FROM</span><span class="w"> </span><span class="n">table1</span>
<span class="k">WHERE</span><span class="w"> </span><span class="n">col_1</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="s1">'Ann'</span><span class="p">;</span>
</pre></div></td></tr></table></div>
</div>
</li></ul>
</div>
<div class="section" id="dli_08_0204__section675283719111"><h4 class="sectiontitle">Example 4: Creating a DLI Non-Partitioned Table and Customizing the Data Type of a Column Field</h4><p id="dli_08_0204__p168826548311">Example description: Create a DLI non-partitioned table named <strong id="dli_08_0204__b1204710469115124">table2</strong>. You can customize the native data types of column fields based on service requirements.</p>
<ul id="dli_08_0204__ul10265111414415"><li id="dli_08_0204__li926512141646"><strong id="dli_08_0204__b3686422659576">STRING</strong>, <strong id="dli_08_0204__b4500541389576">CHAR</strong>, or <strong id="dli_08_0204__b2965888129576">VARCHAR</strong> can be used for text characters.</li><li id="dli_08_0204__li1126531412420"><strong id="dli_08_0204__b86972519195855">TIMESTAMP</strong> or <strong id="dli_08_0204__b140775655295855">DATE</strong> can be used for time characters.</li><li id="dli_08_0204__li19265514646"><strong id="dli_08_0204__b139700506295925">INT</strong>, <strong id="dli_08_0204__b80319664195925">SMALLINT/SHORT</strong>, <strong id="dli_08_0204__b42595555595925">BIGINT/LONG</strong>, or <strong id="dli_08_0204__b13239375895925">TINYINT</strong> can be used for integer characters.</li><li id="dli_08_0204__li17265171414419"><strong id="dli_08_0204__b94329012195941">FLOAT</strong>, <strong id="dli_08_0204__b126693959995941">DOUBLE</strong>, or <strong id="dli_08_0204__b35220863095941">DECIMAL</strong> can be used for decimal calculation.</li><li id="dli_08_0204__li126519141646"><strong id="dli_08_0204__b15278217641014">BOOLEAN</strong> can be used if only logical switches are involved.</li></ul>
<p id="dli_08_0204__p117310234312">For details, see "Data Types" &gt; "Primitive Data Types".</p>
<div class="codecoloring" codetype="Sql" id="dli_08_0204__screen177818151356"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal"> 1</span>
<span class="normal"> 2</span>
<span class="normal"> 3</span>
<span class="normal"> 4</span>
<span class="normal"> 5</span>
<span class="normal"> 6</span>
<span class="normal"> 7</span>
<span class="normal"> 8</span>
<span class="normal"> 9</span>
<span class="normal">10</span>
<span class="normal">11</span>
<span class="normal">12</span>
<span class="normal">13</span>
<span class="normal">14</span>
<span class="normal">15</span>
<span class="normal">16</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">CREATE</span><span class="w"> </span><span class="k">TABLE</span><span class="w"> </span><span class="k">IF</span><span class="w"> </span><span class="k">NOT</span><span class="w"> </span><span class="k">EXISTS</span><span class="w"> </span><span class="n">table2</span><span class="w"> </span><span class="p">(</span>
<span class="w"> </span><span class="n">col_01</span><span class="w"> </span><span class="n">STRING</span><span class="p">,</span>
<span class="w"> </span><span class="n">col_02</span><span class="w"> </span><span class="nb">CHAR</span><span class="w"> </span><span class="p">(</span><span class="mi">2</span><span class="p">),</span>
<span class="w"> </span><span class="n">col_03</span><span class="w"> </span><span class="nb">VARCHAR</span><span class="w"> </span><span class="p">(</span><span class="mi">32</span><span class="p">),</span>
<span class="w"> </span><span class="n">col_04</span><span class="w"> </span><span class="k">TIMESTAMP</span><span class="p">,</span>
<span class="w"> </span><span class="n">col_05</span><span class="w"> </span><span class="nb">DATE</span><span class="p">,</span>
<span class="w"> </span><span class="n">col_06</span><span class="w"> </span><span class="nb">INT</span><span class="p">,</span>
<span class="w"> </span><span class="n">col_07</span><span class="w"> </span><span class="nb">SMALLINT</span><span class="p">,</span>
<span class="w"> </span><span class="n">col_08</span><span class="w"> </span><span class="nb">BIGINT</span><span class="p">,</span>
<span class="w"> </span><span class="n">col_09</span><span class="w"> </span><span class="n">TINYINT</span><span class="p">,</span>
<span class="w"> </span><span class="n">col_10</span><span class="w"> </span><span class="nb">FLOAT</span><span class="p">,</span>
<span class="w"> </span><span class="n">col_11</span><span class="w"> </span><span class="n">DOUBLE</span><span class="p">,</span>
<span class="w"> </span><span class="n">col_12</span><span class="w"> </span><span class="nb">DECIMAL</span><span class="w"> </span><span class="p">(</span><span class="mi">10</span><span class="p">,</span><span class="w"> </span><span class="mi">3</span><span class="p">),</span>
<span class="w"> </span><span class="n">col_13</span><span class="w"> </span><span class="nb">BOOLEAN</span>
<span class="p">)</span>
<span class="n">STORED</span><span class="w"> </span><span class="k">AS</span><span class="w"> </span><span class="n">parquet</span><span class="p">;</span>
</pre></div></td></tr></table></div>
</div>
</div>
<div class="section" id="dli_08_0204__section9184929957"><h4 class="sectiontitle">Example 5: Creating a DLI Partitioned Table and Customizing TBLPROPERTIES Parameters</h4><p id="dli_08_0204__p195179383233">Example description: Create a DLI partitioned table named <strong id="dli_08_0204__b16432112338">table3</strong> and partition the table based on <strong id="dli_08_0204__b1644101183317">col_3</strong>. Set <strong id="dli_08_0204__b259087840113241">dli.multi.version.enable</strong>, <strong id="dli_08_0204__b1939307255113241">comment</strong>, <strong id="dli_08_0204__b497484564113241">orc.compress</strong>, and <strong id="dli_08_0204__b2045513248113241">auto.purge</strong> in <strong id="dli_08_0204__b2096612000113241">TBLPROPERTIES</strong>.</p>
<ul id="dli_08_0204__ul172625151156"><li id="dli_08_0204__li526261515155"><strong id="dli_08_0204__b964998380113326">dli.multi.version.enable</strong>: In this example, set this parameter to <strong id="dli_08_0204__b1956990292113326">true</strong>, indicating that the DLI data versioning function is enabled for table data backup and restoration.</li><li id="dli_08_0204__li026291521511"><strong id="dli_08_0204__b2486912153419">comment</strong>: table description, which can be modified later.</li><li id="dli_08_0204__li0262515141518"><strong id="dli_08_0204__b1645579383113445">orc.compress</strong>: compression mode of the <strong id="dli_08_0204__b1928470252113445">orc</strong> format, which is <strong id="dli_08_0204__b1755372869113445">ZLIB</strong> in this example.</li><li id="dli_08_0204__li5262715201513"><strong id="dli_08_0204__b1532402503113557">auto.purge</strong>: In this example, set this parameter to <strong id="dli_08_0204__b1125892485113557">true</strong>, indicating that data that is deleted or overwritten will bypass the recycle bin and be permanently deleted.</li></ul>
<div class="codecoloring" codetype="Sql" id="dli_08_0204__screen868716251265"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal"> 1</span>
<span class="normal"> 2</span>
<span class="normal"> 3</span>
<span class="normal"> 4</span>
<span class="normal"> 5</span>
<span class="normal"> 6</span>
<span class="normal"> 7</span>
<span class="normal"> 8</span>
<span class="normal"> 9</span>
<span class="normal">10</span>
<span class="normal">11</span>
<span class="normal">12</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">CREATE</span><span class="w"> </span><span class="k">TABLE</span><span class="w"> </span><span class="k">IF</span><span class="w"> </span><span class="k">NOT</span><span class="w"> </span><span class="k">EXISTs</span><span class="w"> </span><span class="n">table3</span><span class="w"> </span><span class="p">(</span>
<span class="w"> </span><span class="n">col_1</span><span class="w"> </span><span class="n">STRING</span><span class="p">,</span>
<span class="w"> </span><span class="n">col_2</span><span class="w"> </span><span class="n">STRING</span>
<span class="p">)</span>
<span class="n">PARTITIONED</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="p">(</span><span class="n">col_3</span><span class="w"> </span><span class="nb">DATE</span><span class="p">)</span>
<span class="n">STORED</span><span class="w"> </span><span class="k">AS</span><span class="w"> </span><span class="n">rcfile</span>
<span class="n">TBLPROPERTIES</span><span class="w"> </span><span class="p">(</span>
<span class="w"> </span><span class="n">dli</span><span class="p">.</span><span class="n">multi</span><span class="p">.</span><span class="k">version</span><span class="p">.</span><span class="n">enable</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="k">true</span><span class="p">,</span>
<span class="w"> </span><span class="k">comment</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="s1">'Created by dli'</span><span class="p">,</span>
<span class="w"> </span><span class="n">orc</span><span class="p">.</span><span class="n">compress</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="s1">'ZLIB'</span><span class="p">,</span>
<span class="w"> </span><span class="n">auto</span><span class="p">.</span><span class="n">purge</span><span class="w"> </span><span class="o">=</span><span class="w"> </span><span class="k">true</span>
<span class="p">);</span>
</pre></div></td></tr></table></div>
</div>
</div>
<div class="section" id="dli_08_0204__section18908162241516"><h4 class="sectiontitle">Example 6: Creating a Non-Partitioned Table in Textfile Format and Setting ROW FORMAT</h4><p id="dli_08_0204__p16176113683510">Example description: In this example, create a non-partitioned table named <strong id="dli_08_0204__b65031328113518">table4</strong> in the <strong id="dli_08_0204__b1450319284352">textfile</strong> format and set <strong id="dli_08_0204__b0503112813350">ROW FORMAT</strong> (the ROW FORMAT function is available only for textfile tables).</p>
<ul id="dli_08_0204__ul169261339131011"><li id="dli_08_0204__li177621581632"><strong id="dli_08_0204__b743120410370">Fields</strong>: columns in a table. Each field has a name and data type. Fields in a table are separated by slashes (/).</li><li id="dli_08_0204__li9671621649"><strong id="dli_08_0204__b046122314375">COLLECTION ITEMS</strong>: A collection item refers to an element in a group of data, which can be an array, a list, or a collection. Collection items in <strong id="dli_08_0204__b176311548133716">table4</strong> are separated by $.</li><li id="dli_08_0204__li48022361836"><strong id="dli_08_0204__b22827515379">MAP KEYS</strong>: A map key is a data structure of key-value pairs and is used to store a group of associated data. Map keys in a table are separated by number signs (#).</li><li id="dli_08_0204__li136971101340"><strong id="dli_08_0204__b112382011389">Rows</strong>: rows in a table. Each row contains a group of field values. Rows in a table end with <strong id="dli_08_0204__b62395112384">\n</strong>. (Note that only <strong id="dli_08_0204__b323941183818">\n</strong> can be used as the row separator.)</li><li id="dli_08_0204__li792663951016"><strong id="dli_08_0204__b124587941011439">NULL</strong>: a special value that represents a missing or unknown value. In a table, <strong id="dli_08_0204__b33988554114320">NULL</strong> indicates that the field has no value or the value is unknown. When there is a null value in the data, it is represented by the string <strong id="dli_08_0204__b1693122322114428">null</strong>.</li></ul>
<div class="codecoloring" codetype="Sql" id="dli_08_0204__screen6591111419353"><div class="highlight"><table class="highlighttable"><tr><td class="linenos"><div class="linenodiv"><pre><span class="normal"> 1</span>
<span class="normal"> 2</span>
<span class="normal"> 3</span>
<span class="normal"> 4</span>
<span class="normal"> 5</span>
<span class="normal"> 6</span>
<span class="normal"> 7</span>
<span class="normal"> 8</span>
<span class="normal"> 9</span>
<span class="normal">10</span>
<span class="normal">11</span></pre></div></td><td class="code"><div><pre><span></span><span class="k">CREATE</span><span class="w"> </span><span class="k">TABLE</span><span class="w"> </span><span class="k">IF</span><span class="w"> </span><span class="k">NOT</span><span class="w"> </span><span class="k">EXISTS</span><span class="w"> </span><span class="n">table4</span><span class="w"> </span><span class="p">(</span>
<span class="w"> </span><span class="n">col_1</span><span class="w"> </span><span class="n">STRING</span><span class="p">,</span>
<span class="w"> </span><span class="n">col_2</span><span class="w"> </span><span class="nb">INT</span>
<span class="p">)</span>
<span class="n">STORED</span><span class="w"> </span><span class="k">AS</span><span class="w"> </span><span class="n">TEXTFILE</span>
<span class="k">ROW</span><span class="w"> </span><span class="n">FORMAT</span>
<span class="n">DELIMITED</span><span class="w"> </span><span class="n">FIELDS</span><span class="w"> </span><span class="n">TERMINATED</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="s1">'/'</span>
<span class="n">COLLECTION</span><span class="w"> </span><span class="n">ITEMS</span><span class="w"> </span><span class="n">TERMINATED</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="s1">'$'</span>
<span class="k">MAP</span><span class="w"> </span><span class="n">KEYS</span><span class="w"> </span><span class="n">TERMINATED</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="s1">'#'</span>
<span class="n">LINES</span><span class="w"> </span><span class="n">TERMINATED</span><span class="w"> </span><span class="k">BY</span><span class="w"> </span><span class="s1">'\n'</span>
<span class="k">NULL</span><span class="w"> </span><span class="k">DEFINED</span><span class="w"> </span><span class="k">AS</span><span class="w"> </span><span class="s1">'NULL'</span><span class="p">;</span>
</pre></div></td></tr></table></div>
</div>
</div>
</div>
<div>
<div class="familylinks">
<div class="parentlink"><strong>Parent topic:</strong> <a href="dli_08_0224.html">Creating a DLI Table</a></div>
</div>
</div>