Shluková analýza pro funkcionální data

Zemanová, Barbora

Cluster analysis for functional data

diploma thesis (DEFENDED)

View/Open

Záznam o průběhu obhajoby (114.7Kb)

Permanent link

http://hdl.handle.net/20.500.11956/39803

Identifiers

Study Information System: 75982

Referee

Hušková, Marie

Faculty / Institute

Faculty of Mathematics and Physics

Discipline

Probability, mathematical statistics and econometrics

Department

Department of Probability and Mathematical Statistics

Date of defense

14. 5. 2012

Publisher

Univerzita Karlova, Matematicko-fyzikální fakulta

Language

Czech

Grade

Excellent

Keywords (Czech)

funkcionální data, shluková analýza, snížení dimenze dat, směs rozdělení, EM-algoritmus

Keywords (English)

functional data, cluster analysis, reduction of data dimension, mixture of distribution, EM-algorithm

V této práci se zabýváme shlukovou analýzou pro funkcionální data. Funkcionální data obsahují soubor subjektů, které jsou charakterizovány opakovanými měřeními určité proměnné. Na základě těchto měření budeme chtít subjekty rozdělit do skupin (shluků) tak, aby si subjekty v jednom shluku byly podobné a lišily se od subjektů v ostatních shlucích. Prvním přístupem, který použijeme, je snížení dimenze dat a následné použití shlukovací metody K-means. Druhým přístupem je použití konečné směsi normálních lineárních smíšených modelů. Parametry tohoto modelu odhadneme metodou maximální věrohodnosti pomocí EM-algoritmu. Během celé práce aplikujeme popsané postupy na reálná meteorologická data.

Abstract (English)

In this work we deal with cluster analysis for functional data. Functional data contain a set of subjects that are characterized by repeated measurements of a variable. Based on these measurements we want to split the subjects into groups (clusters). The subjects in a single cluster should be similar and differ from subjects in the other clusters. The first approach we use is the reduction of data dimension followed by the clustering method K-means. The second approach is to use a finite mixture of normal linear mixed models. We estimate parameters of the model by maximum likelihood using the EM algorithm. Throughout the work we apply all described procedures to real meteorological data.

Citace dokumentu

Metadata

Show full item record