Using convolutional neural networks to detect protein-ligand binding sites

Soukup, Samuel

Využití konvolučních neuronových sítí pro detekci protein-ligand vazebných míst

bachelor thesis (DEFENDED)

View/Open

Záznam o průběhu obhajoby (347.2Kb)

Permanent link

http://hdl.handle.net/20.500.11956/197420

Identifiers

Study Information System: 268603

Referee

Škoda, Petr

Faculty / Institute

Faculty of Mathematics and Physics

Discipline

Computer Science with specialisation in Artificial Intelligence

Department

Department of Software Engineering

Date of defense

10. 2. 2025

Publisher

Univerzita Karlova, Matematicko-fyzikální fakulta

Language

English

Grade

Excellent

Keywords (Czech)

protein|bioinformatika|strojové učení|cnn

Keywords (English)

protein|bioinformatics|machine learning|cnn

Použití konvolučních neuronových sítí (CNN) pro vysokorozměrná data, jako jsou obrázky, ukázalo některé slibné výsledky, ale má problémy s featurami bez prostorové korelace. V této práci jsem otestoval zjednodušenou verzi modelu REFINED (REp- resentation of Features as Images with Neighborhood Dependencies) a porovnával její výkon na datasetu detekce vazebného místa ligandu. Dále jsem se pokusil najít korelaci mezi prediktivní silou CNN a použitou metodou pro transformaci vstupních vektorů do obrázků. Nenašel jsem žádný výsledek, podporující hypotézu, že by REFINED mělo lepší prediktivní sílu než nejmo-dernější přístupy nebo jenom CNN využívající matice s náhodně přiřazenými pozicemi prvků. 1

Abstract (English)

Using convolutional neural networks (CNN) for high dimensional data such as images has shown some promising results, but it has issues with features without spatial correla- tion. I test a streamlined version of the REFINED (REpresentation of Features as Images with NEighborhood Dependencies) model and compare its performance on a ligand bind- ing site detection dataset. I explore the correlation between the CNN performance and the technique used to transform input vectors into images. I find no significant evidence supporting that REFINED has better predictive power than state-of-the-art approaches or CNNs using matrices with randomly assigned feature positions. 1

Citace dokumentu

Metadata

Show full item record