Nástroj pro testování algoritmů pro učení jazyků

Krejčová, Martina

A tool for testing of algorithms for learning languages

bachelor thesis (DEFENDED)

View/Open

Záznam o průběhu obhajoby (21.57Kb)

Permanent link

http://hdl.handle.net/20.500.11956/26763

Identifiers

Study Information System: 50692

CU Caralogue: 990011340130106986

Referee

Kukačka, Marek

Faculty / Institute

Faculty of Mathematics and Physics

Discipline

Programming

Department

Library

Date of defense

22. 6. 2009

Publisher

Univerzita Karlova, Matematicko-fyzikální fakulta

Language

Czech

Grade

Excellent

Úkolem gramatické inference je nalezení pravidelností v datech. Je-li pro data z techto odpozorovaných pravidel vytvoren model, mužeme pomocí tohoto modelu data napríklad zkomprimovat, vytvorit data nová, která budou tato pravidla také splnovat, ci mužeme urcit, která data tomuto modelu odpovídají a která ne. Cílem této práce bylo vytvorit testovací prostredí pro algoritmy gramatické inference bezkontextových jazyku. Model dat v našem prostredí tvorí bezkontextové gramatiky. Je zde popsán postup generování gramatik a následne generování dat, z kterých se algoritmus muže pokusit bezkontextový jazyk naucit. Za úcelem srovnání úspešnosti algoritmu na datech generovaných z více ruzných bezkontextových gramatik je rešen pojem složitosti bezkontextových gramatik. Dále je navrženo nekolik variant vyhodnocení úspešnosti algoritmu.

Abstract (English)

Goal of this work was to develop a tool for testing algorithms of grammatical inference of context-free languages. Grammatical inference is a process of learning of grammars and languages from data. Learning could mean finding a suitable model that describes data. Due to this model we could for instance compress this data, create new data or find out which data are consistent with this model. The model in our tool is context-free grammar. We describe how to generate the context-free grammar and data from which the algorithm can try to learn the language of this grammar. The problem of context-free grammar complexity was solved in order to evaluate success achieved by an algorithm in different learning tasks. Also some alternatives of evaluating success of an algorithm are described. This tool is also useful to create data for publishing results of the algorithm.

Citace dokumentu

Metadata

Show full item record