Akcelerace vzájemné korelace pomocí GPU

Maděra, Karel

Accelerating cross-correlation with GPUs

dc.contributor.advisor	Kruliš, Martin
dc.creator	Maděra, Karel
dc.date.accessioned	2022-10-04T16:59:39Z
dc.date.available	2022-10-04T16:59:39Z
dc.date.issued	2022
dc.identifier.uri	http://hdl.handle.net/20.500.11956/176386
dc.description.abstract	Cross-correlation is a commonly used tool in the field of signal processing, with ap- plications in pattern recognition, particle physics, electron tomography, and many other areas. For many of these applications, it is often the limiting factor on system perfor- mance due to its computational complexity. In this thesis, we analyze the cross-correlation algorithm and its optimization and parallelization possibilities. We then implement sev- eral optimizations of the definition-based algorithm, mainly focused on parallelization using the Graphical processing unit (GPU). Even though the definition-based algorithm provides many possibilities for parallelization, the implementation needs to solve several problems, such as the algorithm's low arithmetic intensity. Furthermore, the problems differ between computation types, which include cross-correlating a pair of inputs, one in- put with many other inputs, or many inputs with many other inputs. Lastly, we compare the optimizations of the definition-based algorithm with the asymptotically faster and commonly used algorithm based on the Fast Fourier Transform. Depending on the total size of the data, we achieve parity between the two algorithms for matrix sizes ranging from 60x60 to 150x150, allowing performance improvements for systems using matrices smaller...	en_US
dc.description.abstract	Vzájemná korelace je často používaný nástroj v oboru zpracování signálu, který je možné aplikovat pro rozpoznávání obrazu, částicovou fyziku, elektronovou tomografii a pro mnoho dalších oblastí. Pro mnohé z těchto aplikací je výkon vzájemné korelace lim- itujícím faktorem pro celkový výkon systému z důvodů její výpočetní náročnosti. V této práci provedeme analýzu vzájemné korelace vzhledem k možnostem pro její optimalizaci a paralelizaci. Následně implementujeme několik optimalizací algoritmu odvozeného z definice vzájemné korelace, se zaměřením na paralelizaci pomocí grafických karet (GPU). Přestože tento algoritmus poskytuje mnoho možností pro paralelizaci, je pro jejich využití potřeba vyřešit několik problémů, jako je například nízká aritmetická intenzita algoritmu. Problémy se nadále liší podle typu vstupních dat, mezi které patří korelace jednoho páru vstupů, jednoho vstupu s množinou jiných vstupů, případně korelace mnoha vstupů s mnoha jinými vstupy. V závěru práce poté porovnáme námi implementované optimal- izace algoritmu založeného na definici vzájemné korelace s asymptoticky rychlejším a často používaným algoritmem založeným na Rychlé Fourierově transformaci (FFT). V závislosti na celkové velikosti vstupních dat dosahuje naše implementace stejné rychlosti jako algoritmus založený na FFT při...	cs_CZ
dc.language	Čeština	cs_CZ
dc.language.iso	cs_CZ
dc.publisher	Univerzita Karlova, Matematicko-fyzikální fakulta	cs_CZ
dc.subject	cross-correlation\|signal processing\|parallel\|GPU\|CUDA	en_US
dc.subject	korelace\|zpracování signálu\|paralelní\|GPU\|CUDA	cs_CZ
dc.title	Akcelerace vzájemné korelace pomocí GPU	cs_CZ
dc.type	diplomová práce	cs_CZ
dcterms.created	2022
dcterms.dateAccepted	2022-09-13
dc.description.department	Department of Distributed and Dependable Systems	en_US
dc.description.department	Katedra distribuovaných a spolehlivých systémů	cs_CZ
dc.description.faculty	Matematicko-fyzikální fakulta	cs_CZ
dc.description.faculty	Faculty of Mathematics and Physics	en_US
dc.identifier.repId	242076
dc.title.translated	Accelerating cross-correlation with GPUs	en_US
dc.contributor.referee	Horáček, Jan
thesis.degree.name	Mgr.
thesis.degree.level	navazující magisterské	cs_CZ
thesis.degree.discipline	Software Systems	en_US
thesis.degree.discipline	Softwarové systémy	cs_CZ
thesis.degree.program	Computer Science	en_US
thesis.degree.program	Informatika	cs_CZ
uk.thesis.type	diplomová práce	cs_CZ
uk.taxonomy.organization-cs	Matematicko-fyzikální fakulta::Katedra distribuovaných a spolehlivých systémů	cs_CZ
uk.taxonomy.organization-en	Faculty of Mathematics and Physics::Department of Distributed and Dependable Systems	en_US
uk.faculty-name.cs	Matematicko-fyzikální fakulta	cs_CZ
uk.faculty-name.en	Faculty of Mathematics and Physics	en_US
uk.faculty-abbr.cs	MFF	cs_CZ
uk.degree-discipline.cs	Softwarové systémy	cs_CZ
uk.degree-discipline.en	Software Systems	en_US
uk.degree-program.cs	Informatika	cs_CZ
uk.degree-program.en	Computer Science	en_US
thesis.grade.cs	Výborně	cs_CZ
thesis.grade.en	Excellent	en_US
uk.abstract.cs	Vzájemná korelace je často používaný nástroj v oboru zpracování signálu, který je možné aplikovat pro rozpoznávání obrazu, částicovou fyziku, elektronovou tomografii a pro mnoho dalších oblastí. Pro mnohé z těchto aplikací je výkon vzájemné korelace lim- itujícím faktorem pro celkový výkon systému z důvodů její výpočetní náročnosti. V této práci provedeme analýzu vzájemné korelace vzhledem k možnostem pro její optimalizaci a paralelizaci. Následně implementujeme několik optimalizací algoritmu odvozeného z definice vzájemné korelace, se zaměřením na paralelizaci pomocí grafických karet (GPU). Přestože tento algoritmus poskytuje mnoho možností pro paralelizaci, je pro jejich využití potřeba vyřešit několik problémů, jako je například nízká aritmetická intenzita algoritmu. Problémy se nadále liší podle typu vstupních dat, mezi které patří korelace jednoho páru vstupů, jednoho vstupu s množinou jiných vstupů, případně korelace mnoha vstupů s mnoha jinými vstupy. V závěru práce poté porovnáme námi implementované optimal- izace algoritmu založeného na definici vzájemné korelace s asymptoticky rychlejším a často používaným algoritmem založeným na Rychlé Fourierově transformaci (FFT). V závislosti na celkové velikosti vstupních dat dosahuje naše implementace stejné rychlosti jako algoritmus založený na FFT při...	cs_CZ
uk.abstract.en	Cross-correlation is a commonly used tool in the field of signal processing, with ap- plications in pattern recognition, particle physics, electron tomography, and many other areas. For many of these applications, it is often the limiting factor on system perfor- mance due to its computational complexity. In this thesis, we analyze the cross-correlation algorithm and its optimization and parallelization possibilities. We then implement sev- eral optimizations of the definition-based algorithm, mainly focused on parallelization using the Graphical processing unit (GPU). Even though the definition-based algorithm provides many possibilities for parallelization, the implementation needs to solve several problems, such as the algorithm's low arithmetic intensity. Furthermore, the problems differ between computation types, which include cross-correlating a pair of inputs, one in- put with many other inputs, or many inputs with many other inputs. Lastly, we compare the optimizations of the definition-based algorithm with the asymptotically faster and commonly used algorithm based on the Fast Fourier Transform. Depending on the total size of the data, we achieve parity between the two algorithms for matrix sizes ranging from 60x60 to 150x150, allowing performance improvements for systems using matrices smaller...	en_US
uk.file-availability	V
uk.grantor	Univerzita Karlova, Matematicko-fyzikální fakulta, Katedra distribuovaných a spolehlivých systémů	cs_CZ
thesis.grade.code	1
uk.publication-place	Praha	cs_CZ
uk.thesis.defenceStatus	O