7.4 Create Data Entities within the Data Flow editor

If you have already created or imported your target data entity, then you would drag the data entity onto the Design Canvas and complete the column mappings and options.

To create the definition of an entity while in the Data Flow editor,

  1. Drag the data entity onto the Design Canvas.
  2. Select the component and click the Add Data Entity icon Add Data Entity icon present on the top right corner of the target component.
  3. Add Data Entity page appears allowing you to configure the following details of the target component:

    General tab

    • In the Name text box, enter the name of the newly created Data Entity.
    • From the Entity Type drop-down, select the data entity type.
      • Table
      • ML Model
        When you select this entity type the user interface changes as follows:
        • The Connection Type drop-down only lists Oracle as the option. Consequently, the Connection drop down only lists the Oracle connections that you have created.
        • The Add Data Entity wizard displays the Properties tab where you can select the Type of Learning, Function, Algorithm, and configure parameters to define the ML Model. See Machine Learning (ML) Models for detailed information about creating and using an ML Model data entity.
    • From the Connection Type drop-down, select the required connection from which you wish to add the newly created Data Entity. The Connection drop-down is populated with the connections you have created with the associated connection type.
    • From the Connection drop-down, select the server name from which you wish to add the newly created Data Entity.
    • In the Schema drop-down, all schema corresponding to the selected connection are listed in two groups.
      • New Database Schema (ones that you've not imported from before) and
      • Existing Database Schema (ones that you've imported from before and are potentially replacing data entities).
      From the Schema drop-down, select the required schema.

      Note:

      For Oracle Object Storage connections, the Schema drop-down lists the name of the bucket that you specified in the URL when you created the connection.
    • In the Tags text box, enter a tag of your choice. You can use tags to filter the Data Entities displayed in the Data Entity Page.
    • For Oracle database connections, you can mark the data entities as a feature group. Expand Advanced Options and click the Treat as Feature Group checkbox.
    • Click Next.

    Columns tab

    • Click the Add Columns icon Add Columns icon, to add new columns to the newly created Data Entity.

      A new column is added to the displayed table.

    • The table displays the following columns:
      • Name
      • Data Type - Click the cell to configure the required Data Type.
      • Scale
      • Length
      • Actions - Click the cross icon to delete the created column.
    • To delete the columns in bulk, select the columns and click the Delete icon Delete icon.
    • To search for the required column details, in the Search text box enter the required column name and click enter. The details of the required column are displayed.
    • Click Next.

    Preview Data Entity tab

    It displays a preview of all the created columns and their configured details. If the data entity belongs to an Oracle database, you can also view statistics of the table. See View Statistics of Data Entities for more information.

  4. Click Save.

    The new target Data Entity is created.

  5. Expand the Properties Panel in the right pane to view the following settings of the created components:
    • General - Displays the Name of the component along with its Connection and Schema details.
    • Attributes - Displays the details of all the attributes associated to the component.
    • Column Mapping - Click Auto Map to map all the columns automatically.
    • Preview - Click to have a preview of the component.
    • Options - Change the options as appropriate.