Efficient Full-Text Searches on Massive Data
The scheme for the full-text search has drawn much attention due to its popular use in web document searches and enterprises' document searches. The full-text search leads to a large size of index files and thus may consume massive computing resources for its processing. In this paper, the authors present both the system architecture of a full-text search engines with a huge volume of indexed data and its multi-level cache scheme. The presented system architecture and cache scheme were implemented in a commercial search engine, which has capacity enough to process more than 5-milion queries per day and index about 70-milion web documents crawled in Korea. In economic respect, the proposed cache scheme is very crucial for user's full-text search engine.