Estimating species richness from virome data accounting for variations within the virus population

dc.contributor.authorHerath, H.M.D.K.
dc.contributor.authorTang, S.L.
dc.date.accessioned2025-11-17T05:26:43Z
dc.date.available2025-11-17T05:26:43Z
dc.date.issued2023-09-20
dc.description.abstractSpecies richness is a key species diversity measure. It corresponds to the number of species in an environmental sample. Estimating species richness of a metagenome of viruses (i.e., a virome) based on the reference data is challenging because of the limited amount of sequence data of viruses available in reference databases. A limitation identified with the methods that do not rely on reference sequence data in estimating species richness while being based on the contig spectrum is the assumption of equal genome length for all the species in the sample. This work aims to formulate a mathematical model to estimate species richness from a virome considering the variability of the genome lengths of species in the sample in contrast to the mentioned methods. A model is derived for the expected contig spectrum and the parameters of the model including the species richness is estimated through optimization for the least error between expected and observed contig spectra. Genetic Algorithm is used as the optimization algorithm in parameter estimation. The optimisation procedure incorporated in the proposed approach is shown to be robust based on the results with simulated data. This work enables inference of genome lengths distribution from the metagenomic sequence data in addition to estimating the species richness and can be applied to virome originating from any environmental sample.
dc.description.sponsorshipFinancial assistance given by the University of Peraden iya -University Research Grant (Grant No. URG/2021/15/E) is acknowledged.
dc.identifier.citationProceedings of the Peradeniya University International Research Sessions (iPURSE) – 2023, University of Peradeniya, P 105
dc.identifier.issn1391-4111
dc.identifier.urihttps://ir.lib.pdn.ac.lk/handle/20.500.14444/6699
dc.language.isoen_US
dc.publisherUniversity of Peradeniya, Sri Lanka
dc.subjectMetagenomics
dc.subjectPhages
dc.subjectSpecies Richness
dc.subjectOptimisation
dc.titleEstimating species richness from virome data accounting for variations within the virus population
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Herath, H.M.D.K..pdf
Size:
7.18 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description:

Collections