About OpenCSV Standards
The CSV parser in the extract service for file extractors uses Opencsv. The csv files that are processed by extractservice must be compliant with the Opencsv standards.
See Opencsv File Standards. In addition to the CSV parser, the extract service supports files that are compliant with RFC4180 specification. The RFC 4180 CSV parser enables you to ingest single-line and multi-line data within your .csv files. The RFC 4180 parser supports ingesting data records with up to 99 line breaks. For more information on the RFC 4180 specification, see Common Format and MIME Type for Comma-Separated Values (CSV) Files.
Keep in mind the following:
- While using special characters:
- For strings without special characters, quotes are optional.
- For strings with special characters, quotes are mandatory. For example, if a string has a comma, then you must use quotes for the string such as "Abc, 123".
- Escapes (backslash character) are optional.
- Backslash characters must always be escaped. For example, if there is a backslash in your data, use the following format: "Double backslash ( \\ ) abc".
- To manage quotes inside a quoted string, use a backslash inside the quotes: "Asd \" asd".
- The Opencsv parser allows you to select one of these available characters as a delimiter:
- Comma (,)
- Semi-colon ( ; )
- Pipe (|)
- Tab ( )