Provides full-text indexing for filenames and files in the ]po[ filestorages. Uses a number of external filters to periodically scan the ]po[ file storage for new files, builds up a Full Text Index and allows the user to retrieve the files using the normal search interface.
intranet-search-pg-files requires the following software to extract indexable strings from different file formats:
The package will periodically (default: every 5 minutes) check a maximum number of objects (default: 100) for new files. Please see below for the parameters controlling the indexing behaviour.
This scheduled behaviour is necessary in order to balance the desire for fast indexing with the considerable load that full text indexing will pose on your database.
To add new file type please see ~/packages/intranet-search-pg-files-procs.tcl and search for "intranet_search_pg_files_fti_content". Very basic TCL skills are sufficient to add a new converter once you have the converter running on the shell level.
To control indexing please see the page http://<your_server>/intranet-search-pg-files/. In this page you can see the files found by the indexer and you can re-index certain business objects.
Please see the error log at ~/log/error.log for detailed messages.
tcl/intranet-search-pg-files-procs.tcl File Search Library
im_package_intranet_pg_files_id Returns the package id of the intranet-search-pg-files module intranet_search_pg_files_fti_content Extract and normalize the file contents - using a best effort attempt using variuos filters intranet_search_pg_files_index_all Index the entire server intranet_search_pg_files_index_object Index the files of a single object such as a project, company or user. intranet_search_pg_files_search_indexer Index the entire server.
|index.tcl||Show files that are not indexed by the FTS|
|reindex-biz-object.tcl||Show files that are not indexed by the FTS|