Algorithms for Parallel Searching in XML Datasets

Kruliš, Martin

Paralelní vyhledávání nad XML daty

diploma thesis (DEFENDED)

View/Open

Záznam o průběhu obhajoby (185.7Kb)

Permanent link

http://hdl.handle.net/20.500.11956/30691

Identifiers

Study Information System: 65409

Referee

Mareš, Martin

Faculty / Institute

Faculty of Mathematics and Physics

Discipline

Software Systems

Department

Department of Software Engineering

Date of defense

7. 9. 2009

Publisher

Univerzita Karlova, Matematicko-fyzikální fakulta

Language

English

Grade

Excellent

Práce se bude zabývat problematikou indexace XML dat a efektivního vyhledávání s indexem. Hlavním těžíštěm bude snaha navrhnout datové struktury a algoritmy, které umožní úlohu vyhledávání maximálně paralelizovat a využít tak potenciálu vícejádrových procesorů. Součástí práce bude i pilotní implementace navržených algoritmů. Dotazování bude probíhat pomocí vlastního jazyka (ne nutně v textové podobě), což umožní používat různé vyhledávací jazyky jako frontendy. Jako ukázka použití bude implementován vzorový frontend umožňující vyhledávání pomocí podmnožiny jazyka XPath.

Abstract (English)

This thesis will address the problems of indexing XML datasets and finding effective searching methods for indexed data. Defining data structures and algorithms that take highly parallel approach to the searching problem is considered to be main objective, therefore the implementation may benefit from the power of multicore CPUs. Prototype of such implementation will be presented with the thesis. Internal query model will be developed as a common layer. Multiple front-ends representing different query languages will be able to operate on the top of this model, thus the engine will not be dependent on any particular language. Sample front-end for a subset of XPath will be implemented along with the prototype.

Citace dokumentu

Metadata

Show full item record