Sampling Issues Affecting Accuracy of Likelihood-based Classification Using Genetical Data

We demonstrate the effectiveness of a genetic algorithm for discovering multi-locus combinations that provide accurate individual assignment decisions and estimates of mixture composition based on likelihood classification. Using simulated data representing different levels of inter-population diffe...

Full description

Saved in:
Bibliographic Details
Published in:Environmental biology of fishes Vol. 69; no. 1-4; pp. 245 - 259
Main Authors: Guinand, B, Scribner, K T, Topchy, A, Page, K S, Punch, W, Burnham-Curtis, M K
Format: Journal Article
Language:English
Published: Dordrecht Springer Nature B.V 01-03-2004
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:We demonstrate the effectiveness of a genetic algorithm for discovering multi-locus combinations that provide accurate individual assignment decisions and estimates of mixture composition based on likelihood classification. Using simulated data representing different levels of inter-population differentiation (F sub(st) similar to 0.01 and 0.10), genetic diversities (four or eight alleles per locus), and population sizes (20, 40, 100 individuals in baseline populations), we show that subsets of loci can be identified that provide comparable levels of accuracy in classification decisions relative to entire multi-locus data sets, where 5, 10, or 20 loci were considered. Microsatellite data sets from hatchery strains of lake trout, Salvelinus namaycush, representing a comparable range of inter-population levels of differentiation in allele frequencies confirmed simulation results. For both simulated and empirical data sets, assignment accuracy was achieved using fewer loci (e.g., three or four loci out of eight for empirical lake trout studies). Simulation results were used to investigate properties of the 'leave-one-out' (L1O) method for estimating assignment error rates. Accuracy of population assignments based on L1O methods should be viewed with caution under certain conditions, particularly when baseline population sample sizes are low (<50).
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
ObjectType-Conference-3
SourceType-Conference Papers & Proceedings-2
ISSN:0378-1909
1573-5133
DOI:10.1023/B:EBFI.0000022869.72448.cd