Search Site          
 
Home  

Chapter 4. Configuring mnoGoSearch in detail

Note

To make the configuration options available, you have to start Service first. Press the green Start button in main mnoGoSearch menu.

Common tab

Database

This section covers main database configuration, including Database Type, Data source, Word storage mode and lets you create tables in you database.

To configure database go to Common tab of mnoGoSearch window. Press Edit button in Database section.

In Database Settings window select database type you wish to use, select DSN you created to store indexer information. Select database word storage mode (for details see Chapter 17., Storage modes)

Now press Create indexer tables button to create tables in database. Click Start and when tables are created - Exit.

Parameters

Configure more detailed mnoGoSearch parameters here. By pressing Edit button you can configure the following options:

  • DocSize limit: Maximum size of a document that can be indexed in bytes. Any document larger than this limit will be ignored by the indexer. The default value is 1048576 bytes, i.e. 1 MB.

  • Local charset: the local character set used by your server. It should match the one specified in search.htm template.

  • Force 1251: This option is useful for users which deal with Cyrillic content and broken (or misconfigured?) Microsoft IIS web servers, which tends to not report charset correctly. This option is turned on it is assumed that all servers which reports as 'Microsoft' or 'IIS' have content in Windows-1251 charset.

  • Valid word length: specify the range of word length mnoGoSearch will index. Any words above or below the limit will not be indexed.

  • Number is a valid word: specify whether to index numbers or not.

  • Num/char sequence is a valid word: specify whether to index words containing both letters and numbers or not.

  • Get ISpell data from: specify whether to get ISpell data from text files or from a database. See Chapter 15., Ispell for details.

    If you choose to load ispell data from the database, you have to import ispell data into database first. Press Import filesbutton and select ispell files to import them into database.

  • Crosswords: this feature allows to assign words between <a href="xxx"> and </a> also to a document this link leads to. It works in SQL database mode and is not supported in built-in database and Cache mode. To enable Crosswords, please use "CrossWord yes" command search.htm also.

  • Phrases: enable phrase search support.

Indexer settings

Press Edit button to change indexer settings.

  • Delete no server: delete from the database those URLs that are not listed in "Servers".

  • Detect clones: detect documents that are identical but are stored in various locations and index only one of them.

  • Store valid words: store documents that are found in dictionary.

  • Store invalid words: store documents that are not found in dictionary.

  • User agent: the User-Agent string is the text that programs use to identify themselves to HTTP, mail and news servers, for usage tracking and other purposes. You can specify any string you wish.

To specify indexer's priority (e.g. if you wish to lower indexer's CPU usage, use the Priority command in search.htm template. Just insert this command into search.htm next to other commands (DBAddr, etc.). Priority values range from -2 to 2, where -2 is the lower priority and 2 is the highest. For example to set the highest priority for the indexer, insert the following command:

					
Priority 2
				

Stop words data files

You can specify text files containing stop words. These are the words that are not taken into account during indexing and later - searching. Press Add button to specify a stop words file, Edit button to choose another, and Delete to remove selected file from the list.

Custom HTTP Header lines

You can specify custom HTTP header that mnoGoSearch sends with its request to HTTP server. E.g. you can request server to return only pages in specific language if possible.




Copyright © 2000-2011 Lavtech.Com Corp.