During training, Oracle Text determines the ratio of positive to negative documents. If this ratio is not in the range of 0.4 to 0.6, then a warning written to the CTX log indicates that the sentiment classifier is skewed. After the sentiment classifier is trained, it is ready to be used in sentiment queries to perform sentiment analysis.
In the following example, clsfier_camera
is the name of the sentiment classifier that is being trained, review_id
is the name of the document ID column in the document training set, train_category
is the name of the category table that contains the labels for the training set documents, doc_id
is the document ID column in the category table, category
is the category column in the category table, and clsfier
is the name of the sentiment classifier preference that is used to train the classifier.
exec ctx_cls.sa_train_model('clsfier_camera','docx','review_id','train_category','doc_id','category','clsfier');
Note:
If you do not specify a sentiment classifier preference when running the CTX_CLS.SA_TRAIN_MODEL
procedure, then Oracle Text uses the default preference CTXSYS.DEFAULT_SENTIMENT_CLASSIFIER.