- Browse by Subject
Browsing by Subject "Population genetics"
Now showing 1 - 3 of 3
Results Per Page
Sort Options
Item Association of Host and Microbial Species Diversity across Spatial Scales in Desert Rodent Communities(PLOS (Public Library of Science), 2014-10-24) Gavish, Yoni; Kedem, Hadar; Messika, Irit; Cohen, Carmit; Toh, Evelyn; Munro, Daniel; Dong, Qunfeng; Fuqua, Clay; Clay, Keith; Hawlena, Hadas; Department of Microbiology & Immunology, School of MedicineRelationships between host and microbial diversity have important ecological and applied implications. Theory predicts that these relationships will depend on the spatio-temporal scale of the analysis and the niche breadth of the organisms in question, but representative data on host-microbial community assemblage in nature is lacking. We employed a natural gradient of rodent species richness and quantified bacterial communities in rodent blood at several hierarchical spatial scales to test the hypothesis that associations between host and microbial species diversity will be positive in communities dominated by organisms with broad niches sampled at large scales. Following pyrosequencing of rodent blood samples, bacterial communities were found to be comprised primarily of broad niche lineages. These communities exhibited positive correlations between host diversity, microbial diversity and the likelihood for rare pathogens at the regional scale but not at finer scales. These findings demonstrate how microbial diversity is affected by host diversity at different spatial scales and suggest that the relationships between host diversity and overall disease risk are not always negative, as the dilution hypothesis predicts.Item Computational modeling for identification of low-frequency single nucleotide variants(2015-11-16) Hao, Yangyang; Liu, Yunlong; Edenberg, Howard J.; Li, Lang; Nakshatr, HarikrishnaReliable detection of low-frequency single nucleotide variants (SNVs) carries great significance in many applications. In cancer genetics, the frequencies of somatic variants from tumor biopsies tend to be low due to contamination with normal tissue and tumor heterogeneity. Circulating tumor DNA monitoring also faces the challenge of detecting low-frequency variants due to the small percentage of tumor DNA in blood. Moreover, in population genetics, although pooled sequencing is cost-effective compared with individual sequencing, pooling dilutes the signals of variants from any individual. Detection of low frequency variants is difficult and can be cofounded by multiple sources of errors, especially next-generation sequencing artifacts. Existing methods are limited in sensitivity and mainly focus on frequencies around 5%; most fail to consider differential, context-specific sequencing artifacts. To face this challenge, we developed a computational and experimental framework, RareVar, to reliably identify low-frequency SNVs from high-throughput sequencing data. For optimized performance, RareVar utilized a supervised learning framework to model artifacts originated from different components of a specific sequencing pipeline. This is enabled by a customized, comprehensive benchmark data enriched with known low-frequency SNVs from the sequencing pipeline of interest. Genomic-context-specific sequencing error model was trained on the benchmark data to characterize the systematic sequencing artifacts, to derive the position-specific detection limit for sensitive low-frequency SNV detection. Further, a machine-learning algorithm utilized sequencing quality features to refine SNV candidates for higher specificity. RareVar outperformed existing approaches, especially at 0.5% to 5% frequency. We further explored the influence of statistical modeling on position specific error modeling and showed zero-inflated negative binomial as the best-performed statistical distribution. When replicating analyses on an Illumina MiSeq benchmark dataset, our method seamlessly adapted to technologies with different biochemistries. RareVar enables sensitive detection of low-frequency SNVs across different sequencing platforms and will facilitate research and clinical applications such as pooled sequencing, cancer early detection, prognostic assessment, metastatic monitoring, and relapses or acquired resistance identification.Item Forensic applications of associating human scalp hair morphology and pigmentation analysis at the microscopic and molecular level(2017-08) Stubbs, Wesli Kay; Walsh, Susan; Picard, Christine; Berbari, NicholasCriminal investigation and the science behind evidence analysis is an ever- growing niche, and forensic DNA phenotyping (FDP) is no exception. For years the only information given to authorities regarding DNA found at a crime scene was STR analysis and matching to a comparative DNA sample from a known source. However, what happens when there is no suspect to compare DNA profiles, or the case involves a missing person where the only available piece of evidence is a biological sample found at the scene? Before FDP, not much could be done with the DNA sample and the investigation would be stalled. Now it is becoming possible to statistically predict an individual’s visual characteristics using FDP. Currently, with the use of Irisplex, HIrisplex, and HIrisplex-S, statistical analyses and predictions can be done for categorical eye, hair, and skin color by looking at specific genes and their associative SNPs, such as HERC2 and OCA2. The more that is understood about trait-determining genes and their functional significance with regards to our physical traits, the more phenotypes can be added to these prediction tools. In an effort to discover additional genes associated with human phenotypes, this study looked at thirty-two pigmentation-associated candidate genes, and eleven hair structure and morphology associated genes in owl monkey pelage samples. Although the samples were not of human origin, it is important to point out the high conservation between humans and their non-human primate relatives. The owl monkeys used in this study were helpful for tracking expression levels of genes controlling differentpigmentation and hair structure types, because each monkey had intra-individual variation in thickness and in coat color which allowed the generation of potential candidate genes for human investigation. Of the 43 total candidate genes analyzed, 36 had successful amplification, and 28 showed a significant difference in expression when comparing the different samples. The second part of this study was to compare quantitative characteristics of human hair in physical samples and two-dimensional (2D) photos. A test set of 45 individuals had 3-5 hairs from the vertex of their head plucked and analyzed, and a 2D photograph was taken of their scalp hair. The idea was to be able to make quantitative phenotypes in hair (such as hair width, amount of curl) from 2D imagery, when physical samples are not available for analysis. This is due to the fact that the majority of genotype-phenotype databases consist solely of photographic imagery, and seldom have hairs that can be microscopically prepared for analysis. Defining measurable phenotypes from 2D photos that strongly correlate with their physical counterparts allow for the generation of a more accurate phenotype for future genome wide association studies (GWAS) within and outside this laboratory that study hair thickness and hair curl. Three different quantitative phenotypes were compared between the microscopic and 2D photo- analysis.