To manage data using ModelArts, create a dataset. Then you can perform operations on the dataset, such as labeling data, importing data, and publishing the dataset.
After the dataset is created, the dataset management page is displayed. You can perform the following operations on the dataset: label data, publish dataset versions, manage dataset versions, modify the dataset, import data, and delete the dataset. For details about the operations supported by different types of datasets, see .
Parameter |
Description |
---|---|
Input Dataset Path |
Select the OBS path to the input dataset. |
Output Dataset Path |
Select the OBS path to the output dataset. NOTE:
The output dataset path cannot be the same as the input dataset path or cannot be the subdirectory of the input dataset path. Select an empty directory as the Output Dataset Path. |
Label Set |
|
Team Labeling |
Enable or disable team labeling. Image segmentation does not support team labeling. Therefore, this parameter is unavailable when you use image segmentation. After enabling team labeling, enter the name and type of the team labeling task, and select the labeling team and team members. For details about the parameter settings, see Creating Team Labeling Tasks. Before enabling team labeling, ensure that you have added a team and members on the Labeling Teams page. If no labeling team is available, click the link on the page to go to the Labeling Teams page, and add your team and members. For details, see Introduction to Team Labeling. After a dataset is created with team labeling enabled, you can view the Team Labeling mark in Labeling Type. |
Parameter |
Description |
---|---|
Input Dataset Path |
Select the OBS path to the input dataset. |
Output Dataset Path |
Select the OBS path to the output dataset. NOTE:
The output dataset path cannot be the same as the input dataset path or cannot be the subdirectory of the input dataset path. Select an empty directory as the Output Dataset Path. |
Label Set (Sound Classification) |
Set labels only for datasets of the sound classification type.
|
Label Management (Speech Paragraph Labeling) |
Only datasets for speech paragraph labeling support multiple labels.
|
Speech Labeling (Speech Paragraph Labeling) |
Only datasets for speech paragraph labeling support speech labeling. By default, speech labeling is disabled. If this function is enabled, you can label speech content. |
Parameter |
Description |
---|---|
Input Dataset Path |
Select the OBS path to the input dataset. NOTE:
Labeled text classification data can be identified only when you import data. When creating a dataset, set an empty OBS directory. After the dataset is created, import the labeled data into it. For details about the format of the data to be imported, see Specifications for Importing Data from an OBS Directory. |
Output Dataset Path |
Select the OBS path to the output dataset. NOTE:
The output dataset path cannot be the same as the input dataset path or cannot be the subdirectory of the input dataset path. Select an empty directory as the Output Dataset Path. |
Label Set (for text classification and named entity recognition) |
|
Label Set (for text triplet) |
For datasets of the text triplet type, set entity labels and relationship labels.
|
Team Labeling |
Enable or disable team labeling. After enabling team labeling, enter the name and type of the team labeling task, and select the labeling team and team members. For details about the parameter settings, see Creating Team Labeling Tasks. Before enabling team labeling, ensure that you have added a team and members on the Labeling Teams page. If no labeling team is available, click the link on the page to go to the Labeling Teams page, and add your team and members. For details, see Introduction to Team Labeling. After a dataset is created with team labeling enabled, you can view the Team Labeling mark in Labeling Type. |
Parameter |
Description |
---|---|
Input Dataset Path |
Select the OBS path to the input dataset. |
Output Dataset Path |
Select the OBS path to the output dataset. NOTE:
The output dataset path cannot be the same as the input dataset path or cannot be the subdirectory of the input dataset path. Select an empty directory as the Output Dataset Path. |