Filtering
To avoid indexing binary data (which is not useful in etx searches), filter your documents before they are indexed. Filtering refers to the process of stripping away all the proprietary formatting information from a document so that only its text content remains in ASCII format.
For example, Microsoft™ Word documents usually contain formatting information that describes the fonts, paragraph styles, character styles, and layout of the text. Although this information can be indexed, it is not useful for users who want to search the content of the document. Its inclusion in an etx index can significantly increase the size of the index and affect the performance of text searches. Filtering removes all this information and leaves standard ASCII text.
CREATE INDEX abstract_index ON my_table (abstract etx_clob_ops)
USING etx (FILTER = 'STOP_ON_ERROR');