SNP distributed representation using entity embedding

doi:10.28919/cmbn/7962

SNP distributed representation using entity embedding

Francisco Calvin Arnel Ferano, Jonathan Christian Setyono, Ardivo Virsa Siswanto, Nicholas Dominic, Bens Pardamean

Abstract

A single Nucleotide Polymorphism (SNP) array is the largest variation of genetic information to detect specific traits in organisms. SNP is located in a specific locus of DNA sequences. To the day this study was conducted, the representation of SNPs for machine learning models is still questionable. Based on the previous works, we proposed a comparative study of distributed representation methods against SNPs data. This study used 1,232 SNPs from the genomic data of 687 Indonesian rice samples collected from four distinct rice fields. The SNP data used was converted into an encoded format. Entity embedding (Embedder) and several comparative models, i.e., Node2Vec, Struc2Vec, and LINE, were chosen to predict the rice yield of the SNP data. The entity embedding using Embedder outperformed the comparative methods used in this study, namely Node2Vec, Struc2Vec, and LINE with the best R2 and MSE scores of 0.9368 and 0.2425 respectively.

Full Text: PDF

Published: 2023-05-22

How to Cite this Article:

Francisco Calvin Arnel Ferano, Jonathan Christian Setyono, Ardivo Virsa Siswanto, Nicholas Dominic, Bens Pardamean, SNP distributed representation using entity embedding, Commun. Math. Biol. Neurosci., 2023 (2023), Article ID 51

Copyright © 2023 Francisco Calvin Arnel Ferano, Jonathan Christian Setyono, Ardivo Virsa Siswanto, Nicholas Dominic, Bens Pardamean. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Commun. Math. Biol. Neurosci.

ISSN 2052-2541

Editorial Office: [email protected]

Username
Password

Communications in Mathematical Biology and Neuroscience

SNP distributed representation using entity embedding

Abstract