Implementation of a File-Based Indexing Framework for the TopX Search Engine

Abstract

TopX is an efficient and effective search engine for text and semistructured data. The current, Java-based implementation of TopX relies on index structures that are stored in a relational database, which creates problems with usability, index size, and efficient accesses. The goal of this thesis is therefore to develop a new, file-based indexing framework that replaces the existing TopX index and does no longer use a relational database, not even for temporary storage.

Important goals of this thesis are

The indexing framework should be implemented in C++, hence excellent programming skills in C++ are mandatory.

Organization

Advisor: Ralf Schenkel, Andreas Broschart
Student: Levan Kasradze
Level:    Master
Status:   running
Start
:     2007
Prerequisites: Excellent programming skills in C++, some SQL and XML experience

Additional Information and Literature

Back to the list of topics.

last change: Ralf Schenkel, January 8, 2008.