3.4.1 Access Data

Access the data set from the SH Schema and explore the data to understand the attributes.

Remember:

The data set used for this use case is from the SH schema. The SH schema can be readily accessed in Oracle Autonomous Database. For on-premises databases, the schema is installed during the installation or can be manually installed by downloading the scripts. See Installing the Sample Schemas.

To understand the data, you will perform the following:
  • Access the data.
  • Examine the various attributes or columns of the data set.
  • Assess data quality (by exploring the data).

Access Data

You will use SALES table data from the SH schema.

Examine Data

The following table displays information about the attributes from SALES:

Attribute Name Information
PROD_ID The ID of the product
CUST_ID The ID of the customer
TIME_ID The timestamp of the purchase of the product in yyy-mm-dd hh:mm:ss format
CHANNEL_ID The channel ID of the channel sales data
PROMO_ID The product promotion ID
QUANTITY_SOLD The number of items sold
AMOUNT_SOLD The amount or sales data

Identify Target Variable

In this use case, the task is to train a model that predicts the amount sold. Therefore, the target variable is the attribute AMOUNT_SOLD.