Nástroj pro testování algoritmů pro učení jazyků

Krejčová, Martina

A tool for testing of algorithms for learning languages

bakalářská práce (OBHÁJENO)

Zobrazit/otevřít

Záznam o průběhu obhajoby (21.57Kb)

Trvalý odkaz

http://hdl.handle.net/20.500.11956/26763

Identifikátory

SIS: 50692

Oponent práce

Kukačka, Marek

Fakulta / součást

Matematicko-fyzikální fakulta

Obor

Programování

Katedra / ústav / klinika

Knihovna fakulty

Datum obhajoby

22. 6. 2009

Nakladatel

Univerzita Karlova, Matematicko-fyzikální fakulta

Jazyk

Čeština

Známka

Výborně

Úkolem gramatické inference je nalezení pravidelností v datech. Je-li pro data z techto odpozorovaných pravidel vytvoren model, mužeme pomocí tohoto modelu data napríklad zkomprimovat, vytvorit data nová, která budou tato pravidla také splnovat, ci mužeme urcit, která data tomuto modelu odpovídají a která ne. Cílem této práce bylo vytvorit testovací prostredí pro algoritmy gramatické inference bezkontextových jazyku. Model dat v našem prostredí tvorí bezkontextové gramatiky. Je zde popsán postup generování gramatik a následne generování dat, z kterých se algoritmus muže pokusit bezkontextový jazyk naucit. Za úcelem srovnání úspešnosti algoritmu na datech generovaných z více ruzných bezkontextových gramatik je rešen pojem složitosti bezkontextových gramatik. Dále je navrženo nekolik variant vyhodnocení úspešnosti algoritmu.

Abstrakt (anglicky)

Goal of this work was to develop a tool for testing algorithms of grammatical inference of context-free languages. Grammatical inference is a process of learning of grammars and languages from data. Learning could mean finding a suitable model that describes data. Due to this model we could for instance compress this data, create new data or find out which data are consistent with this model. The model in our tool is context-free grammar. We describe how to generate the context-free grammar and data from which the algorithm can try to learn the language of this grammar. The problem of context-free grammar complexity was solved in order to evaluate success achieved by an algorithm in different learning tasks. Also some alternatives of evaluating success of an algorithm are described. This tool is also useful to create data for publishing results of the algorithm.

Citace dokumentu

Metadata

Zobrazit celý záznam