WebSphere Commerce Search indexes are created separately based on a specific master
catalog. Deploy the WebSphere Commerce Search index, then separately manage and rebuild each index
to refresh its data.
Search indexing methods
A full reindexing rebuilds the entire search index, while a delta reindexing performs only
incremental updates on the existing operational search index.
Important: You must
periodically fully rebuild the search index to ensure that the index size on the file system remains
manageable. That is, when only delta reindexing is performed, the index files that marked for
deletion continue to accumulate. When a full reindexing is performed, an
optimize request is passed in that fully deletes the index files identified
from previous delta reindexes.
Before you deploy your index, consider the index-building scenarios,
depending on your WebSphere Commerce Search environment.
The following topologies are typical:
A production server with a dedicated index-building machine.
A staging and clustered search server.
It is recommended that you use the same operating system across all search servers (master
search server, subordinate servers, and search server cluster).
Search index types and subtypes
WebSphere Commerce Search contains the following search index types to suit your business and
search requirements:
Catalog entry index
A search index for catalog entries in both master and sales catalog.
The catalog entry search index contains the following content:
Structured content
Structured content includes items in the
product catalog and delivers search results based on items that are sold in your
store.
Unstructured content
Unstructured site content includes documents
that do not adhere to a specific data model, such as product attachments contained in various
formats. For example, content such as user manuals and warranty information are considered
unstructured content. Its elements, construction, and organization are typically unknown and can
vary depending on its file type.
Site content
Site content includes HTML and other site files from
WebSphere Commerce starter stores. It is fetched and crawled by the site content
crawler.
Catalog group index
A search index for categories in both master and sales catalog.
WebSphere Commerce Search contains the following search index subtypes, which keeps data in a
separate core for performance reasons:
Inventory
The inventory index, a separate index that
contains index data, is an extension of the product index. For accurate inventory status, you can
refresh the inventory index more frequently than the product index.
Important:
Non-ATP and DOM inventory systems are supported by index inventory data in WebSphere Commerce Search. ATP, External, and No-inventory types are not supported with the Search inventory
index.
The filterCache and documentCache are required when an extension index such as Inventory exists
in WebSphere Commerce Search. For more information, see WebSphere Commerce Search performance tuning.
Price
Sets up a subindex for price data. Prices are indexed using Index Load, as it can populate a
large amount of data into a separate extension index faster than the Catalog Entry index can index
price data. For more information, see Index Load.
The following sample topologies are available for small to large index sizes:
Important: In all scenarios, it is recommended to only perform indexing in an authoring or
staging environment. That is, an environment that is used for quality assurance (QA) purposes, and
if required, containing access to the production database. Otherwise, the runtime search performance
or index integrity can be affected by indexing within a production environment.
Small or medium index size: WebSphere Commerce production server with a dedicated
index-building machine
A dedicated index-building machine, which is known as the master, is used to build the index. The
index is then replicated to other search machines, which are known as targets. The runtime search
performance is not affected during index-building.
Note: Building the search index can still be a timely process, and might occasionally lead to
inconsistencies between the database and search index. However, the runtime search performance is
not affected during index-building due to the offset of the processing load to another
machine.
The following diagram illustrates a typical WebSphere Commerce Search deployment for a small or
medium index size:
Recommended:
Large index size: Staging propagation and clustered search servers
The index-building is performed on the WebSphere Commerce staging database and search staging machine. Business users can test their new data in
the staging database with the updated index. After successfully completing their tests, the data is
propagated from the staging database to the production database, and the index is replicated from
the search staging machine to the search production machines using the master indexing server. This
is the recommended approach, as this option imposes the least amount of risk to the production
environment and provides a more flexible environment for making incremental changes.
The following diagram illustrates a typical WebSphere Commerce Search deployment for a large index size: