An application of kernel methods to variety identification based on SSR markers genetic fingerprinting

Martin, F.

BMC Bioinformatics

Published

May 20, 2011

DOI

10.1186/1471-2105-12-177

PMID

21595989

Link to publication

Topic

Summary

Background: In crop production systems, genetic markers are increasingly used to distinguish individuals within a larger population based on their genetic make-up. Supervised approaches cannot be applied directly to genotyping data due to the specific nature of those data which are neither continuous, nor nominal, nor ordinal but only partially ordered. Therefore, a strategy is needed to encode the polymorphism between samples such that known supervised approaches can be applied. Moreover, finding a minimal set of molecular markers that have optimal ability to discriminate, for example, between given groups of varieties, is important as the genotyping process can be costly in terms of laboratory consumables, labor, and time. This feature selection problem also needs special care due to the specific nature of the data used. Results: An approach encoding SSR polymorphisms in a positive definite kernel is presented, which then allows the usage of any kernel supervised method. The polymorphism between the samples is encoded through the Nei-Li genetic distance, which is shown to define a positive definite kernel between the genotyped samples. Additionally, a greedy feature selection algorithm for selecting SSR marker kits is presented to build economical and efficient prediction models for discrimination. The algorithm is a filter method and outperforms other filter methods adapted to this setting. When combined with kernel linear discriminant analysis or kernel principal component analysis followed by linear discriminant analysis, the approach leads to very satisfactory prediction models. Conclusions: The main advantage of the approach is to benefit from a flexible way to encode polymorphisms in a kernel and when combined with a feature selection algorithm resulting in a few specific markers, it leads to accurate and economical identification models based on SSR genotyping.

PMIScience.com is operated by Philip Morris International for the purpose of publishing and disseminating scientific information about Philip Morris International’s efforts in support of its smoke-free product portfolio. This site is a global site for use by scientists, the public health and regulatory communities, and other stakeholders with an interest in tobacco policy. The purpose of this site is not advertising or marketing, nor is it directed at any specific market. It is not intended for use by consumers. New tobacco products sold in the United States are subject to FDA regulation; therefore the content of this site is not intended to make, and nor should it be construed as making, any product related claims in the United States without proper FDA authorization.

Reduced Risk Products ("RRPs”) is the term we use to refer to products that present, are likely to present, or have the potential to present less risk of harm to smokers who switch to these products versus continuing smoking. PMI has a range of RRPs in various stages of development, scientific assessment and commercialization. All of our RRPs are smoke-free products that deliver nicotine with far lower quantities of harmful and potentially harmful constituents than found in cigarette smoke.

An application of kernel methods to variety identification based on SSR markers genetic fingerprinting

Effects of a Modified Exposure Claim for an e-Cigarette on Claim Comprehension, Behavioral Intentions, and Risk Perceptions Among US Adult Tobacco Users and Nonusers: Randomized Experimental Study

Toxicological risk assessment of VEEV NOW ULTRA e-vapor products shows reduced toxicity compared to cigarette smoke

Toxicological risk assessment framework for e-vapor products: From product ideation to product commercialization

Randomized studies assessing the effect of flavor on pharmacokinetic and subjective parameters for dry and moist nicotine pouches

Scientific Update June 2025 Issue 21

Scientific Update October 2025 Issue 22