QUANTITATIVE TRAIT LOCI MAPPING AND STATISTICAL MODELING OF COMPLEX TRAITS USING GENOMIC DATA

Authors

  • Dr. Narayana Varalakshmi Akula Hospitalist, Independent Researcher, India Author
  • Anusha A. T. M. K Assistant Professor, Meenakshi College of Allied Health Sciences, Meenakshi Academy of Higher Education and Research Author
  • Thilagavathi T Assistant Professor, Nutrition and Dietetics, Meenakshi College of Arts and Science, Meenakshi Academy of Higher Education and Research Author
  • Dr. Nesamani Daniel Ponraj Assistant Professor, Radiodiagnosis, Sree Balaji Medical College and Hospital, Bharath Institute of Higher Education and Research Author
  • Dr. Saikumari V Professor and Head, Department of Management Studies, Easwari Engineering College, Ramapuram, Chennai – 600089, Tamil Nadu, India Author

DOI:

https://doi.org/10.4238/s9p2zy18

Keywords:

Quantitative Trait Loci (QTL), Genome-Wide Association Study (GWAS), Complex Trait Prediction, Mixed Linear Model, SNP Genotyping, Machine Learning in Genomics, Genomic Prediction, XGBoost

Abstract

Complex traits are affected by several genetic loci, and also complex interactions of the genotype and environment, and thus their accurate prediction is a major challenge of genomic studies research. This paper hypothesises a combined methodology that consists of quantitative trait loci (QTL) mapping and statistical modelling to describe and forecast complex traits with high-dimensional genomic data. After quality control, probable minor allele frequency filtering and missing data imputation were applied, a high-density single nucleotide polymorphism (SNP) dataset containing 3, 200 samples and 52, 400 filtered markers were used. A mixed linear model (MLM) was utilised to estimate the effects of a population structure and kinship in genome-wide QTL mapping to effectively identify important loci. The identified QTLs were then introduced as input features in statistical and machine learning analyses including linear regression, random forest, and extreme gradient boosting (XGBoost) in predicting phenotypic characteristics. Cross-validation indices like RMSE and coefficient of determination (R 2 ) were used to assess the model performance. The highest predictive accuracy (R 2 = 0.87, RMSE = 2.05) was obtained with the use of the XGBoost model which was better than the classification with the help of traditional linear techniques. The findings reveal that the combination of QTL mapping and machine learning can substantially increase the accuracy of predictions and predetermine the discovery of biologically important genomic areas. This model offers an efficient and scalable method of new breeding that utilises genomics in the study of complex traits.

Downloads

Published

2026-03-20

Issue

Section

Articles

How to Cite

QUANTITATIVE TRAIT LOCI MAPPING AND STATISTICAL MODELING OF COMPLEX TRAITS USING GENOMIC DATA. (2026). Genetics and Molecular Research. https://doi.org/10.4238/s9p2zy18

Most read articles by the same author(s)

Similar Articles

91-100 of 208

You may also start an advanced similarity search for this article.