-
Genetic analyses of smoking initiation, persistence, quantity, and age-at-onset of regular cigarette use in Brazilian families: the Baependi Heart Study
Background:
The purpose of this study was to estimate the genetic influences on the initiation of cigarette smoking, the persistence, quantity and age-at-onset of regular cigarette use in Brazilian families.
Methods:
The data set consisted of 1,694 individuals enrolled in the Baependi Heart Study. The heritability and the heterogeneity in genetic and environmental variance components by gender were estimated from variance components approaches, using the SOLAR (Sequential Oligogenic Linkage Analysis Routines) computer package. The mixed-effects Cox model was used for the genetic analysis of the age-at onset of regular cigarette use.
Results:
The heritability estimates were high (> 50%) for smoking initiation and were intermediate, ranging from 23.4 to 31.9%, for smoking persistence and quantity. Significant evidence for heterogeneity in variance components by gender was observed for smoking initiation and age-at-onset of regular cigarette use. Genetic factors play an important role in the interindividual variation of these phenotypes in females, while in males there is a predominant environmental component, which could be explained by greater social influences in the initiation of tobacco use.
Conclusions:
Significant heritabilities were observed in smoking phenotypes for both males and females from the Brazilian population. These data add to the literature and are concordant with the notion of significant biological determination in smoking behavior. Samples from the Baependi Heart Study may be valuable for the mapping of genetic loci that modulate this complex biological trait.
-
Multi-locus stepwise regression: a haplotype-based algorithm for finding genetic associations applied to atopic dermatitis
Background:
Genome-wide association studies (GWAS) provide an increasing number of single nucleotide polymorphisms (SNPs) associated with diseases. Our aim is to exploit those closely spaced SNPs in candidate regions for a deeper analysis of association beyond single SNP analysis, combining the classical stepwise regression approach with haplotype analysis to identify risk haplotypes for complex diseases.
Methods:
Our proposed multi-locus stepwise regression starts with an evaluation of all pair-wise SNP combinations and then extends each SNP combination stepwise by one SNP from the region, carrying out haplotype regression in each step. The best associated haplotype patterns are kept for the next step and must be corrected for multiple testing at the end. These haplotypes should also be replicated in an independent data set. We applied the method to a region of 259 SNPs from the epidermal differentiation complex (EDC) on chromosome 1q21 of a German GWAS using a case control set (1,914 individuals) and to 268 families with at least two affected children as replication.
Results:
A 4-SNP haplotype pattern with high statistical significance in the case control set (p=4.13x10-7 after Bonferroni correction) could be identified which remained significant in the family set after Bonferroni correction (p=0.0398). Further analysis revealed that this pattern reflects mainly the effect of the well-known FLG gene; however, a FLG-independent haplotype in case control set (OR=1.71, 95% CI: 1.32 2.23, p=5.6x10-5) and family set (OR=1.68, 95% CI: 1.18-2.38, p=2.19x10-3) could be found in addition.
Conclusion:
Our approach is a useful tool for finding allele combinations associated with diseases beyond single SNP analysis in chromosomal candidate regions.
-
Data mining of high density genomic variant data for prediction of Alzheimer's disease risk.
Background:
The discovery of genetic associations is an important factor in the understanding of human illness to derive disease pathways. Identifying multiple interacting genetic mutations associated with disease remains challenging in studying the etiology of complex diseases. And although recently new single nucleotide polymorphisms (SNPs) at genes implicated in immune response, cholesterol/lipid metabolism, and cell membrane processes have been confirmed by genome-wide association studies (GWAS) to be associated with late-onset Alzheimer's disease (LOAD), a percentage of AD heritability continues to be unexplained. We try to find other genetic variants that may influence LOAD risk utilizing data mining methods.
Methods:
Two different approaches were devised to select SNPs associated with LOAD in a publicly available GWAS data set consisting of three cohorts. In both approaches, single-locus analysis (logistic regression) was conducted to filter the data with a less conservative p-value than the Bonferroni threshold; this resulted in a subset of SNPs used next in multi-locus analysis (random forest (RF)). In the second approach, we took into account prior biological knowledge, and performed sample stratification and linkage disequilibrium (LD) in addition to logistic regression analysis to preselect loci to input into the RF classifier construction step.
Results:
The first approach gave 199 SNPs mostly associated with genes in calcium signaling, cell adhesion, endocytosis, immune response, and synaptic function. These SNPs together with APOE and GAB2 SNPs formed a predictive subset for LOAD status with an average error of 9.8% using 10-fold cross validation (CV) in RF modeling. Nineteen variants in LD with ST5, TRPC1, ATG10, ANO3, NDUFA12, and NISCH respectively, genes linked directly or indirectly with neurobiology, were identified with the second approach. These variants were part of a model that included APOE and GAB2 SNPs to predict LOAD risk which produced a 10-fold CV average error of 17.5% in the classification modeling.
Conclusions:
With the two proposed approaches, we identified a large subset of SNPs in genes mostly clustered around specific pathways/functions and a smaller set of SNPs, within or in proximity to five genes not previously reported, that may be relevant for the prediction/understanding of AD.
-
The C allele of the JAK2 rs4495487 is an additional candidate locus that contributes to myeloproliferative neoplasm predisposition in the Japanese population
Background:
Polycythemia vera (PV), essential thrombocythemia (ET), and primary myelofibrosis (PMF) are myeloproliferative neoplasms (MPNs) characterized in most cases by a unique somatic mutation, JAK2 V617F. Recent studies revealed that JAK2 V617F occurs more frequently in a specific JAK2 haplotype, named JAK2 46/1 or GGCC haplotype, which is tagged by rs10974944 (C/G) and/or rs12343867 (T/C). This study examined the impact of single nucleotide polymorphisms (SNPs) of the JAK2 locus on MPNs in a Japanese population.
Methods:
We sequenced 24 JAK2 SNPs in Japanese patients with PV. We then genotyped 138 MPN patients (33 PV, 96 ET, and 9 PMF) with known JAK2 mutational status and 107 controls for a novel SNP, in addition to two SNPs known to be part of the 46/1 haplotype (rs10974944 and rs12343867). Associations with risk of MPN were estimated by odds ratios and their 95% confidence intervals using logistic regression.
Results:
A novel locus, rs4495487 (T/C), with a mutated T allele was significantly associated with PV. Similar to rs10974944 and rs12343867, rs4495487 in the JAK2 locus is significantly associated with JAK2-positive MPN. Based on the results of SNP analysis of the three JAK2 locus, we defined the "GCC genotype" as having at least one minor allele in each SNP (G allele in rs10974944, C allele in rs4495487, and C allele in rs12343867). The GCC genotype was associated with increased risk of both JAK2 V617F-positive and JAK2 V617F-negative MPN. In ET patients, leukocyte count and hemoglobin were significantly associated with JAK2 V617F, rather than the GCC genotype. In contrast, none of the JAK2 V617F-negative ET patients without the GCC genotype had thrombosis, and splenomegaly was frequently seen in this subset of ET patients. PV patients without the GCC genotype were significantly associated with high platelet count.
Conclusions:
Our results indicate that the C allele of JAK2 rs4495487, in addition to the 46/1 haplotype, contributes significantly to the occurrence of JAK2 V617F-positive and JAK2 V617F-negative MPNs in the Japanese population. Because lack of the GCC genotype represents a distinct clinical-hematological subset of MPN, analyzing JAK2 SNPs and quantifying JAK2 V617F mutations will provide further insights into the molecular pathogenesis of MPN.
-
A genome wide association study of pulmonary tuberculosis susceptibility in Indonesians
Background:
There is reason to expect strong genetic influences on the risk of developing active pulmonary tuberculosis (TB) among latently infected individuals. Many of the genome wide linkage and association studies (GWAS) to date have been conducted on African populations. In order to identify additional targets in genetically dissimilar populations, and to enhance our understanding of this disease, we performed a multi-stage GWAS in a Southeast Asian cohort from Indonesia.
Methods:
In stage 1, we used the Affymetrix 100K SNP GeneChip marker set to genotype 259 Indonesian samples. After quality control filtering, 108 cases and 115 controls were analyzed for association of 95,207 SNPs. In stage 2, we attempted validation of 2,453 SNPs with promising associations from the first stage, in 1,189 individuals from the same Indonesian cohort, and finally in stage 3 we selected 251 SNPs from this stage to test TB association in an independent Caucasian cohort (n= 3,760) from Russia.
Results:
Our study suggests evidence of association (P = 0.0004-0.0067) for 8 independent loci (nominal significance P < 0.05), which are located within or near the following genes involved in immune signaling: JAG1, DYNLRB2, EBF1, TMEFF2, CCL17, HAUS6, PENK and TXNDC4.
Conclusions:
Mechanisms of immune defense suggested by some of the identified genes exhibit biological plausibility and may suggest novel pathways involved in the host containment of infection with TB.
-
Isolated brachydactyly type E can be caused by a HOXD13 nonsense mutation: a case report
Background:
Brachydactyly type E (BDE; MIM#113300) is characterized by shortening of the metacarpal, metatarsal, and often phalangeal bones, and predominantly affects postaxial ray(s) of the limb. BDE may occur as an isolated trait or as part of a syndrome. Isolated BDE is rare and in the majority of cases the molecular pathogenesis has so far not been resolved. Originally, the molecular cause of isolated BDE has been unravelled in 2 families and shown to result from heterozygous missense mutations in the homeodomain of the HOXD13 gene. Since the initial manuscript, one further HOXD13 mutation has been reported only in a single family manifesting isolated BDE.Case PresentationIn this paper, we report on a Polish family exhibiting isolated BDE caused by a novel nonsense heterozygous HOXD13 mutation. We investigated a Polish female proband and her father, both affected by isolated BDE, in whom we identified a nonsense heterozygous mutation c.820C>T(p.R274X) in the HOXD13 gene. So far, only two missense HOXD13 substitutions (p.S308C and p.I314L), localized within the homeodomain of the HOXD13 transcription factor, as well as a single nonsense mutation (p.E181X) were associated with BDE. Both missense changes were supposed to alter DNA binding affinity of the protein.
Conclusion:
The variant p.R274X identified in our proband is the fourth HOXD13 mutation, and the second truncating (nonsense) mutation, reported to result in typical isolated BDE. We refer our clinical and molecular findings to the previously described HOXD13 associated phenotypes and mutations.
-
Atypical case of Wolfram syndrome revealed through targeted exome sequencing in a patient with suspected mitochondrial disease
Background:
Mitochondrial diseases comprise a diverse set of clinical disorders that affect multiple organ systems with varying severity and age of onset. Due to their clinical and genetic heterogeneity, these diseases are difficult to diagnose. We have developed a targeted exome sequencing approach to improve our ability to properly diagnose mitochondrial diseases and apply it here to an individual patient. Our method targets mitochondrial DNA (mtDNA) and the exons of 1,600 nuclear genes involved in mitochondrial biology or Mendelian disorders with multi-system phenotypes, thereby allowing for simultaneous evaluation of multiple disease loci.Case Presentation: Targeted exome sequencing was performed on a patient initially suspected to have a mitochondrial disorder. The patient presented with diabetes mellitus, diffuse brain atrophy, autonomic neuropathy, optic nerve atrophy, and a severe amnestic syndrome. Further work-up revealed multiple heteroplasmic mtDNA deletions as well as profound thiamine deficiency without a clear nutritional cause. Targeted exome sequencing revealed a homozygous c.1672C>T (p.R558C) missense mutation in exon 8 of WFS1 that has previously been reported in a patient with Wolfram syndrome.
Conclusion:
This case demonstrates how clinical application of next-generation sequencing technology can enhance the diagnosis of patients suspected to have rare genetic disorders. Furthermore, the finding of unexplained thiamine deficiency in a patient with Wolfram syndrome suggests a potential link between WFS1 biology and thiamine metabolism that has implications for the clinical management of Wolfram syndrome patients.
-
Possible role of EMID2 on nasal polyps pathogenesis in Korean asthma patients
Background:
Since subepithelial fibrosis and protruded extracellular matrix are among the histological characteristics of polyps, the emilin/multimerin domain-containing protein 2 (EMID2) gene is speculated to be involved in the presence of nasal polyps in asthma and aspirin-hypersensitive patients.
Methods:
To investigate the association between EMID2 and nasal polyposis, 49 single-nucleotide polymorphisms (SNPs) were genotyped in 467 asthmatics of Korean ancestry who were stratified further into 114 aspirin exacerbated respiratory disease (AERD) and 353 aspirin-tolerant asthma (ATA) subgroups. From pairwise comparison of the genotyped polymorphisms, 14 major haplotypes (frequency > 0.05) were inferred and selected for association analysis. Differences in the frequency distribution of EMID2 variations between polyp-positive cases and polyp-negative controls were determined using logistic analyses.
Results:
Initially, 13 EMID2 variants were significantly associated with the presence of nasal polyps in the overall asthma group (P = 0.0008-0.05, OR = 0.54-1.32 using various modes of genetic inheritance). Although association signals from 12 variants disappeared after multiple testing corrections, the relationship between EMID2_BL1_ht2 and nasal polyposis remained significant via a codominant mechanism (Pcorr = 0.03). On the other hand, the nominal associations observed between the genetic variants tested for the presence of nasal polyps in AERD (P = 0.003-0.05, OR = 0.25-1.82) and ATA (P = 0.01-0.04, OR = 0.46-10.96) subgroups disappeared after multiple comparisons, suggesting lack of associations.
Conclusions:
These preliminary findings suggest that EMID2_BL1_ht2 may be a susceptibility marker of inflammation of the nasal passages among Korean asthma patients.
-
Association of polymorphisms in survivin gene with the risk of hepatocellular carcinoma in Chinese han population: a case control study
Background:
Survivin, one of the strongest apoptosis inhibitors, plays a critical role in the development and progression of hepatocellular carcinoma (HCC). By comparison, relatively little is known about the effect of survivin gene polymorphisms on HCC susceptibility. Our study aimed to investigate the association of survivin gene polymorphisms with the risk of HCC in Chinese han population.
Methods:
A case-control study was conducted in Chinese han population consisting of 178 HCC cases and 196 cancer-free controls. Information on demographic data and related risk factors was collected for all subjects. Polymorphisms of the survivin gene, including three loci of rs8073069, rs9904341 and rs1042489, were selected and genotyped by a polymerase chain reaction- restriction fragment length polymorphism (PCR-RFLP) technique. Association analysis of genotypes/alleles and haplotypes from these loci with the risk of HCC was conducted under different genetic models.
Results:
Using univariate analysis of rs8073069, rs9904341 and rs1042489 under different genetic models, no statistically significant difference was found in genotype or allele distribution of HCC cases relative to the controls (P > 0.05). Linkage disequilibrium (LD) analysis showed that these loci were in LD. Multivariate logistic regression indicated that with no G-C-T haplotype as reference, the haplotype of G-C-T from these loci was associated with a lower risk for HCC under the recessive model (OR = 0.46, 95% confidence interval (CI): 0.24~0.90, P = 0.023). Both HBsAg+ and the medical history of viral hepatitis type B were risk factors for HCC. However, no statistically significant haplotype-environment interaction existed.
Conclusions:
No association between rs8073069, rs9904341 or rs1042489 in survivin gene and the risk of HCC is found in Chinese han population, but rs8073069G-rs9904341C- rs1042489T is perhaps a protective haplotype for HCC.
-
Candidate gene analysis of spontaneous preterm delivery: New insights from re-analysis of a case-control study using case-parent triads and control-mother dyads
Background:
Spontaneous preterm delivery (PTD) has a multifactorial etiology with evidence of a genetic contribution to its pathogenesis. A number of candidate gene case-control studies have been performed on spontaneous PTD, but the results have been inconsistent, and do not fully assess the role of how two genotypes can impact outcome. To elucidate this latter point we re-analyzed data from a previously published case-control candidate gene study, using a case-parent triad design and a hybrid design combining case-parent triads and control-mother dyads. These methods offer a robust approach to genetic association studies for PTD compared to traditional case-control designs.
Methods:
The study participants were obtained from the Norwegian Mother and Child Cohort Study (MoBa). A total of 196 case triads and 211 control dyads were selected for the analysis. A case-parent triad design as well as a hybrid design was used to analyze 1,326 SNPs from 159 candidate genes. We compared our results to those from a previous case-control study on the same samples. Haplotypes were analyzed using a sliding window of three SNPs and a pathway analysis was performed to gain biological insight into the pathophysiology of preterm delivery.
Results:
The most consistent significant fetal gene across all analyses was COL5A2. The functionally similar COL5A1 was significant when combining fetal and maternal genotypes. PON1 was significant with analytical approaches for single locus association of fetal genes alone, but was possibly confounded by maternal effects. Focal adhesion (hsa04510), Cell Communication (hsa01430) and ECM receptor interaction (hsa04512) were the most constant significant pathways.
Conclusion:
This study suggests a fetal association of COL5A2 and a combined fetal-maternal association of COL5A1 with spontaneous PTD. In addition, the pathway analysis implied interactions of genes affecting cell communication and extracellular matrix.
|