Create and Use Custom Language Data
Create and use your own language-specific conditions (such as common abbreviations) when chunking data.
Here, you use the chunker helper function
CREATE_LANG_DATA
from the DBMS_VECTOR_CHAIN
package to load the data file for Simplified Chinese. This data file contains abbreviation tokens for your chosen language.
After loading the language data, you can now use language-specific chunking by specifying the
LANGUAGE
chunking parameter with VECTOR_CHUNKS
or UTL_TO_CHUNKS
.
Related Topics
Parent topic: Configure Chunking Parameters