Citation analyticsNew
Protocol design
Bioinformatics tools
MAPK triggered chromatin reprogramming by histone deacetylase in plant innate immunity
Microbial-associated molecular patterns activate several MAP kinases, which are major regulators of the innate immune response in Arabidopsis thaliana that induce large-scale changes in gene expression. Here, we determine whether microbial-associated molecular pattern-triggered gene expression involves modifications at the chromatin level. Histone acetylation and deacetylation are major regulators of microbial-associated molecular pattern-triggered gene expression and implicate the histone deacetylase HD2B in the reprogramming of defence gene expression and innate immunity. The MAP kinase MPK3 directly interacts with and phosphorylates HD2B, thereby regulating the intra-nuclear compartmentalization and function of the histone deacetylase. By studying a number of gene loci that undergo microbial-associated molecular pattern-dependent activation or repression, our data reveal a mechanistic model for how protein kinase signaling directly impacts chromatin reprogramming in plant defense. The online version of this article (doi:10.1186/s13059-017-1261-8) contains supplementary material, which is available to authorized users.
Whole Genome Sequence of Coxiella burnetii Nine Mile RSA439 (Phase II, Clone 4), a Laboratory Workhorse Strain
Here, we report the whole-genome sequence of Coxiella burnetii Nine Mile RSA439 (phase II, clone 4), a laboratory strain used extensively to investigate the biology of this intracellular bacterial pathogen. The genome consists of a 1.97-Mb chromosome and a 37.32-kb plasmid.
Draft Genome Sequence of the Clover (Trifolium repens L.) Root Endophyte Paraburkholderia sp. Strain A27
Paraburkholderia sp. strain A27, isolated from the root material of white clover, has plant growth-promoting activity on a range of agriculturally important plants. The draft genome of this bacterium is 7,393,089 bp and harbors a range of genes putatively involved in host colonization.
Draft Genome Sequence of a Kale (Brassica oleracea L.) Root Endophyte, Pseudomonas sp. Strain C9
Pseudomonas sp. strain C9 is a plant growth–promoting bacterium isolated from the root tissue of Brassica oleracea L. grown in soil from Marlborough, New Zealand. Its draft genome of 6,350,161 bp contains genes associated with plant growth promotion and biological control.
Genome Sequence of the Potato Plant Pathogen Dickeya dianthicola Strain RNS04.9
Dickeya dianthicola is one of the causative agents of soft rot and blackleg diseases, which are currently identified in European countries in a wide range of crops. Here, we report the draft genome sequence of D. dianthicola strain RNS04.9, which was isolated from a potato plant with blackleg symptoms in 2004.
Draft Genome Sequences of Pseudomonas fluorescens Strains PA4C2 and PA3G8 and Pseudomonas putida PA14H7, Three Biocontrol Bacteria against Dickeya Phytopathogens
Pseudomonas fluorescens strains PA4C2 and PA3G8 and Pseudomonas putida strain PA14H7 were isolated from potato rhizosphere and show an ability to inhibit the growth of Dickeya phytopathogens. Here, we report their draft genome sequences, which provide a basis for understanding the molecular mechanisms involved in antibiosis against Dickeya.
Draft Genome Sequence of Bacillus thuringiensis var. thuringiensis Strain T01 328, a Brazilian Isolate That Produces a Soluble Pesticide Protein, Cry1Ia
Bacillus thuringiensis var. thuringiensis strain T01-328, isolated from Cubatão county (São Paulo State, Brazil), produces a soluble pesticide protein, Cry1Ia, during vegetative growth. Here, we report the 7.089-Mbp draft genome sequence, composed of a 5.5-Mb chromosome and 14 plasmids, which is the largest B. thuringiensis genome sequenced to date.
Draft Genome Sequence of Sphingobacterium sp. CZ UAM, Isolated from a Methanotrophic Consortium
Sphingobacterium sp. CZ-UAM was isolated from a methanotrophic consortium in mineral medium using methane as the only carbon source. A draft genome of 5.84 Mb with a 40.77% G+C content is reported here. This genome sequence will allow the investigation of potential methanotrophy in this isolated strain.
Complete Genomic Sequence of “Thermofilum adornatus” Strain 1910bT, a Hyperthermophilic Anaerobic Organotrophic Crenarchaeon
The complete genomic sequence of a novel hyperthermophilic crenarchaeon, strain 1910bT, was determined. The genome comprises a 1,750,259-bp circular chromosome containing single copies of 3 rRNA genes, 43 tRNA genes, and 1,896 protein-coding sequences. In silico genome-genome hybridization suggests the proposal of a novel species, “Thermofilum adornatus” strain 1910bT.
Analysis of circulating angiopoietin like protein 3 and genetic variants in lipid metabolism and liver health: the DiOGenes study
Angiopoietin-like protein 3 (ANGPTL3), a liver-derived protein, plays an important role in the lipid and lipoprotein metabolism. Using data available from the DiOGenes study, we assessed the link with clinical improvements (weight, plasma lipid, and insulin levels) and changes in liver markers, alanine aminotransferase, aspartate aminotransferase (AST), adiponectin, fetuin A and B, and cytokeratin 18 (CK-18), upon low-calorie diet (LCD) intervention. We also examined the role of genetic variation in determining the level of circulating ANGPTL3 and the relation between the identified genetic markers and markers of hepatic steatosis. DiOGenes is a multicenter, controlled dietary intervention where obese participants followed an 8-week LCD (800 kcal/day, using a meal replacement product). Plasma ANGPTL3 and liver markers were measured using the SomaLogic (Boulder, CO) platform. Protein quantitative trait locus (pQTL) analyses assessed the link between more than four million common variants and the level of circulating ANGPTL3 at baseline and changes in levels during the LCD intervention. Changes in ANGPTL3 during weight loss showed only marginal association with changes in triglycerides (nominal p = 0.02) and insulin (p = 0.04); these results did not remain significant after correcting for multiple testing. However, significant association (after multiple-testing correction) were observed between changes in ANGPTL3 and AST during weight loss (p = 0.004) and between ANGPTL3 and CK-18 (baseline p = 1.03 × 10−7, during weight loss p = 1.47 × 10−13). Our pQTL study identified two loci significantly associated with changes in ANGPTL3. One of these loci (the APOA4-APOA5-ZNF259-BUD13 gene cluster) also displayed significant association with changes in CK-18 levels during weight loss (p = 0.007). We clarify the link between circulating levels of ANGPTL3 and specific markers of liver function. We demonstrate that changes in ANGPLT3 and CK-18 during LCD are under genetic control from trans-acting variants. Our results suggest an extended function of ANGPTL3 in the inflammatory state of liver steatosis and toward liver metabolic processes. The online version of this article (10.1186/s12263-018-0597-3) contains supplementary material, which is available to authorized users.
Hidden Effects of Seed Quality Breeding on Germination in Oilseed Rape (Brassica napus L.)
Intense selection for specific seed qualities in winter oilseed rape breeding has had an inadvertent negative influence on seed germination performance. In a panel of 215 diverse winter oilseed rape varieties spanning over 50 years of breeding progress in winter-type rapeseed, we found that low seed erucic acid content and reduced seed glucosinolate content were significantly related with prolonged germination time. Genome-wide association mapping revealed that this relationship is caused by linkage drag between important loci for seed quality and germination traits. One QTL for mean germination time on chromosome A09 co-localized with significant but minor QTL for both seed erucic acid and seed glucosinolate content. This suggested either potential pleiotropy or close linkage of minor factors influencing all three traits. Therefore, a reduction in germination performance may be due to inadvertent co-selection of genetic variants associated with 00 seed quality that have a negative influence on germination. Our results suggest that marker-assisted selection of positive alleles for mean germination time within the modern quality pool can help breeders to maintain maximal germination capacity in new 00-quality oilseed rape cultivars.
Meta analysis of gene–environment wide association scans accounting for education level identifies additional loci for refractive error
Myopia is the most common human eye disorder and it results from complex genetic and environmental causes. The rapidly increasing prevalence of myopia poses a major public health challenge. Here, the CREAM consortium performs a joint meta-analysis to test single-nucleotide polymorphism (SNP) main effects and SNP × education interaction effects on refractive error in 40,036 adults from 25 studies of European ancestry and 10,315 adults from 9 studies of Asian ancestry. In European ancestry individuals, we identify six novel loci (FAM150B-ACP1, LINC00340, FBN1, DIS3L-MAP2K1, ARID2-SNAT1 and SLC14A2) associated with refractive error. In Asian populations, three genome-wide significant loci AREG, GABRR1 and PDE10A also exhibit strong interactions with education (P<8.5 × 10−5), whereas the interactions are less evident in Europeans. The discovery of these loci represents an important advance in understanding how gene and environment interactions contribute to the heterogeneity of myopia. This report by the Consortium for Refractive Error and Myopia uses gene-environment-wide interaction study (GEWIS) to identify genetic loci that affect environmental influence in myopia development, and identifies ethnic specific genetic loci that attribute to eye refractive errors.
Comprehensive genome wide evaluation of lapatinib induced liver injury yields a single genetic signal centered on known risk allele HLA DRB1*07:01
Lapatinib is associated with a low incidence of serious liver injury. Previous investigations have identified and confirmed the Class II allele HLA-DRB1*07:01 to be strongly associated with lapatinib-induced liver injury; however, the moderate positive predictive value limits its clinical utility. To assess whether additional genetic variants located within the major histocompatibility complex locus or elsewhere in the genome may influence lapatinib-induced liver injury risk, and potentially lead to a genetic association with improved predictive qualities, we have taken two approaches: a genome-wide association study and a whole-genome sequencing study. This evaluation did not reveal additional associations other than the previously identified association for HLA-DRB1*07:01. The present study represents the most comprehensive genetic evaluation of drug-induced liver injury (DILI) or hypersensitivity, and suggests that investigation of possible human leukocyte antigen associations with DILI and other hypersensitivities represents an important first step in understanding the mechanism of these events.
Draft Genome Sequence of Burkholderia stabilis LA20W, a Trehalose Producer That Uses Levulinic Acid as a Substrate
Burkholderia stabilis LA20W produces trehalose using levulinic acid (LA) as a substrate. Here, we report the 7.97-Mb draft genome sequence of B. stabilis LA20W, which will be useful in investigations of the enzymes involved in LA metabolism and the mechanism of LA-induced trehalose production.
Cytosine methylation at CpCpG sites triggers accumulation of non CpG methylation in gene bodies
Methylation of cytosine is an epigenetic mark involved in the regulation of transcription, usually associated with transcriptional repression. In mammals, methylated cytosines are found predominantly in CpGs but in plants non-CpG methylation (in the CpHpG or CpHpH contexts, where H is A, C or T) is also present and is associated with the transcriptional silencing of transposable elements. In addition, CpG methylation is found in coding regions of active genes. In the absence of the demethylase of lysine 9 of histone 3 (IBM1), a subset of body-methylated genes acquires non-CpG methylation. This was shown to alter their expression and affect plant development. It is not clear why only certain body-methylated genes gain non-CpG methylation in the absence of IBM1 and others do not. Here we describe a link between CpG methylation and the establishment of methylation in the CpHpG context that explains the two classes of body-methylated genes. We provide evidence that external cytosines of CpCpG sites can only be methylated when internal cytosines are methylated. CpCpG sites methylated in both cytosines promote spreading of methylation in the CpHpG context in genes protected by IBM1. In contrast, CpCpG sites remain unmethylated in IBM1-independent genes and do not promote spread of CpHpG methylation.
Common exonic missense variants in the C2 domain of the human KIBRA protein modify lipid binding and cognitive performance
The human KIBRA gene has been linked to human cognition through a lead intronic single-nucleotide polymorphism (SNP; rs17070145) that is associated with episodic memory performance and the risk to develop Alzheimer's disease. However, it remains unknown how this relates to the function of the KIBRA protein. Here, we identified two common missense SNPs (rs3822660G/T [M734I], rs3822659T/G [S735A]) in exon 15 of the human KIBRA gene to affect cognitive performance, and to be in almost complete linkage disequilibrium with rs17070145. The identified SNPs encode variants of the KIBRA C2 domain with distinct Ca2+ dependent binding preferences for monophosphorylated phosphatidylinositols likely due to differences in the dynamics and folding of the lipid-binding pocket. Our results further implicate the KIBRA protein in higher brain function and provide direction to the cellular pathways involved.
Enrichment of schizophrenia heritability in both neuronal and glia cell regulatory elements
Genome-wide association studies have identified over 100 robust risk loci for schizophrenia with thousands of variants mediating genetic heritability, the majority of which reside in non-coding regions. Analytical approaches have shown this heritability is strongly enriched at variants within regulatory elements identified from human post-mortem brain tissue. However, bulk post-mortem brain tissue has a heterogeneous cell composition, making biological interpretations difficult. We sought to refine the cell types mediating schizophrenia heritability by separating neuronal and glial signals using data from: (1) NeuN-sorted post-mortem brain and (2) cell culture systems. Schizophrenia heritability was partitioned using linkage disequilbrium (LD) score regression. Variants within genomic regions marked by H3K4me3 (marker of active promoters) from NeuN-positive (neuronal) and NeuN-negative (non-neuronal) cells explained a significant amount of schizophrenia heritability (P = 1.38 × 10−10 and P = 7.97 × 10−10). However, variants located in H3K4me3 sites specific to NeuN-positive (neuronal) cells were enriched (P = 3.13 × 10−4), while those specific to NeuN-negative (non-neuronal) cells were not (P = 0.470). Data from cell culture systems mimicked this pattern of association. We show the previously observed enrichment of heritability from variants at brain H3K4me3 sites is mediated by both neuronal and non-neuronal brain cell types. However, only neuronal cell populations showed a unique contribution driven by cell-type specific regulatory elements. Cell culture systems recapitulate disease relevant gene-regulatory landscapes, validating them as a tool for future investigation of genetic mechanisms underlying schizophrenia. Identifying the cell types in which risk variants operate will greatly increase our understanding of schizophrenia pathobiology and aid in the development of novel model systems and therapies.
Comparison of genomic signatures of selection on Plasmodium falciparum between different regions of a country with high malaria endemicity
Genome wide sequence analyses of malaria parasites from widely separated areas of the world have identified contrasting population structures and signatures of selection. To compare relatively closely situated but ecologically contrasting regions within an endemic African country, population samples of Plasmodium falciparum clinical isolates were collected in Ghana from Kintampo in the central forest-savannah area, and Navrongo in a drier savannah area ~350 km to the north with more seasonally-restricted transmission. Parasite DNA was sequenced and paired-end reads mapped to the P. falciparum reference genome. High coverage genome wide sequence data for 85 different clinical isolates enabled analysis of 121,712 single nucleotide polymorphisms (SNPs). The local populations had similar proportions of mixed genotype infections, similar SNP allele frequency distributions, and eleven chromosomal regions had elevated integrated haplotype scores (|iHS|) in both. A between-population Rsb metric comparing extended haplotype homozygosity indicated a stronger signal within Kintampo for one of these regions (on chromosome 14) and in Navrongo for two of these regions (on chromosomes 10 and 13). At least one gene in each of these identified regions is a potential target of locally varying selection. The candidates include genes involved in parasite development in mosquitoes, members of variant-expressed multigene families, and a leading vaccine-candidate target of immunity. Against a background of very similar population structure and selection signatures in the P. falciparum populations of Ghana, three narrow genomic regions showed evidence indicating local differences in historical timing or intensity of selection. Sampling of closely situated populations across heterogeneous environments has potential to refine the mapping of important loci under temporally or spatially varying selection. The online version of this article (doi:10.1186/s12864-015-1746-3) contains supplementary material, which is available to authorized users.
Genetic mutations associated with metastatic clear cell renal cell carcinoma
Metastasis is the major cause of death among cancer patients, yet early detection and intervention of metastasis could significantly improve their clinical outcomes. We have sequenced and analyzed RNA (Expression) and DNA (Mutations) from the primary tumor (PT), tumor extension (TE) and lymphatic metastatic (LM) sites of patients with clear cell renal cell carcinoma (CCRCC) before treatment. Here, we report a three-nucleotide deletion near the C-region of Plk5 that is specifically associated with the lymphatic metastasis. This mutation is un-detectable in the PT, becomes detectable in the TE and dominates the LM tissue. So while only a few primary cancer cells carry this mutation, the majority of metastatic cells have this mutation. The increasing frequency of this mutation in metastatic tissue suggests that this Plk5 deletion could be used as an early indicator of CCRCC metastasis, and be identified by low cost PCR assay. A large scale clinical trial could reveal whether a simple PCR assay for this mutation at the time of nephrectomy could identify and stratify high-risk CCRCC patients for treatments.
Temporal Shift of Circadian Mediated Gene Expression and Carbon Fixation Contributes to Biomass Heterosis in Maize Hybrids
Heterosis has been widely used in agriculture, but the molecular mechanism for this remains largely elusive. In Arabidopsis hybrids and allopolyploids, increased photosynthetic and metabolic activities are linked to altered expression of circadian clock regulators, including CIRCADIAN CLOCK ASSOCIATED1 (CCA1). It is unknown whether a similar mechanism mediates heterosis in maize hybrids. Here we report that higher levels of carbon fixation and starch accumulation in the maize hybrids are associated with altered temporal gene expression. Two maize CCA1 homologs, ZmCCA1a and ZmCCA1b, are diurnally up-regulated in the hybrids. Expressing ZmCCA1 complements the cca1 mutant phenotype in Arabidopsis, and overexpressing ZmCCA1b disrupts circadian rhythms and biomass heterosis. Furthermore, overexpressing ZmCCA1b in maize reduced chlorophyll content and plant height. Reduced height stems from reduced node elongation but not total node number in both greenhouse and field conditions. Phenotypes are less severe in the field than in the greenhouse, suggesting that enhanced light and/or metabolic activities in the field can compensate for altered circadian regulation in growth vigor. Chromatin immunoprecipitation-sequencing (ChIP-seq) analysis reveals a temporal shift of ZmCCA1-binding targets to the early morning in the hybrids, suggesting that activation of morning-phased genes in the hybrids promotes photosynthesis and growth vigor. This temporal shift of ZmCCA1-binding targets correlated with nonadditive and additive gene expression in early and late stages of seedling development. These results could guide breeding better hybrid crops to meet the growing demand in food and bioenergy. All corn in the USA is grown as hybrids, which grow more vigorously and produce higher yield than their parents, a phenomenon known as heterosis. The molecular basis for heterosis remains elusive. Heterosis is predicted to arise from allelic interactions between parental genomes, leading to altered regulatory networks that promote the growth and fitness of hybrids. One such regulator is the circadian clock, which is functionally conserved in Arabidopsis and maize. Disrupting corn CCA1 expression reduces growth vigor. In corn hybrids, CCA1 proteins target thousands of output genes early in the morning, as if the hybrids wake up early to promote photosynthesis, starch metabolism and biomass accumulation. This early acting mechanism could guide breeding and selection of high-yield hybrids.
Associations of NINJ2 Sequence Variants with Incident Ischemic Stroke in the Cohorts for Heart and Aging in Genomic Epidemiology (CHARGE) Consortium
Stroke, the leading neurologic cause of death and disability, has a substantial genetic component. We previously conducted a genome-wide association study (GWAS) in four prospective studies from the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) consortium and demonstrated that sequence variants near the NINJ2 gene are associated with incident ischemic stroke. Here, we sought to fine-map functional variants in the region and evaluate the contribution of rare variants to ischemic stroke risk. We sequenced 196 kb around NINJ2 on chromosome 12p13 among 3,986 European ancestry participants, including 475 ischemic stroke cases, from the Atherosclerosis Risk in Communities Study, Cardiovascular Health Study, and Framingham Heart Study. Meta-analyses of single-variant tests for 425 common variants (minor allele frequency [MAF] ≥ 1%) confirmed the original GWAS results and identified an independent intronic variant, rs34166160 (MAF = 0.012), most significantly associated with incident ischemic stroke (HR = 1.80, p = 0.0003). Aggregating 278 putatively-functional variants with MAF≤ 1% using count statistics, we observed a nominally statistically significant association, with the burden of rare NINJ2 variants contributing to decreased ischemic stroke incidence (HR = 0.81; p = 0.026). Common and rare variants in the NINJ2 region were nominally associated with incident ischemic stroke among a subset of CHARGE participants. Allelic heterogeneity at this locus, caused by multiple rare, low frequency, and common variants with disparate effects on risk, may explain the difficulties in replicating the original GWAS results. Additional studies that take into account the complex allelic architecture at this locus are needed to confirm these findings.
Natural Genetic Variation in the Caenorhabditis elegans Response to Pseudomonas aeruginosa
Caenorhabditis elegans responds to pathogenic microorganisms by activating its innate immune system, which consists of physical barriers, behavioral responses, and microbial killing mechanisms. We examined whether natural variation plays a role in the response of C. elegans to Pseudomonas aeruginosa using two C. elegans strains that carry the same allele of npr-1, a gene that encodes a G-protein-coupled receptor related to mammalian neuropeptide Y receptors, but that differ in their genetic backgrounds. Strains carrying an allele for the NPR-1 215F isoform have been shown to exhibit lack of pathogen avoidance behavior and deficient immune response toward P. aeruginosa relative to the wild-type (N2) strain. We found that the wild isolate from Germany RC301, which carries the allele for NPR-1 215F, shows an enhanced resistance to P. aeruginosa infection when compared with strain DA650, which also carries NPR-1 215F but in an N2 background. Using a whole-genome sequencing single-nucleotide polymorphism (WGS-SNP) mapping strategy, we determined that the resistance to P. aeruginosa infection maps to a region on chromosome V. Furthermore, we demonstrated that the mechanism for the enhanced resistance to P. aeruginosa infection relies exclusively on strong P. aeruginosa avoidance behavior, and does not involve the main immune, stress, and lifespan extension pathways in C. elegans. Our findings underscore the importance of pathogen-specific behavioral immune defense in the wild, which seems to be favored over the more energy-costly mechanism of activation of physiological cellular defenses.
First Complete Genomic Sequence of a Rabies Virus from the Republic of Tajikistan Obtained Directly from a Flinders Technology Associates Card
A brain homogenate derived from a rabid dog in the district of Tojikobod, Republic of Tajikistan, was applied to a Flinders Technology Associates (FTA) card. A full-genome sequence of rabies virus (RABV) was generated from the FTA card directly without extraction, demonstrating the utility of these cards for readily obtaining genetic data.
Comprehensive Analysis of Transcriptome Variation Uncovers Known and Novel Driver Events in T Cell Acute Lymphoblastic Leukemia
RNA-seq is a promising technology to re-sequence protein coding genes for the identification of single nucleotide variants (SNV), while simultaneously obtaining information on structural variations and gene expression perturbations. We asked whether RNA-seq is suitable for the detection of driver mutations in T-cell acute lymphoblastic leukemia (T-ALL). These leukemias are caused by a combination of gene fusions, over-expression of transcription factors and cooperative point mutations in oncogenes and tumor suppressor genes. We analyzed 31 T-ALL patient samples and 18 T-ALL cell lines by high-coverage paired-end RNA-seq. First, we optimized the detection of SNVs in RNA-seq data by comparing the results with exome re-sequencing data. We identified known driver genes with recurrent protein altering variations, as well as several new candidates including H3F3A, PTK2B, and STAT5B. Next, we determined accurate gene expression levels from the RNA-seq data through normalizations and batch effect removal, and used these to classify patients into T-ALL subtypes. Finally, we detected gene fusions, of which several can explain the over-expression of key driver genes such as TLX1, PLAG1, LMO1, or NKX2-1; and others result in novel fusion transcripts encoding activated kinases (SSBP2-FER and TPM3-JAK2) or involving MLLT10. In conclusion, we present novel analysis pipelines for variant calling, variant filtering, and expression normalization on RNA-seq data, and successfully applied these for the detection of translocations, point mutations, INDELs, exon-skipping events, and expression perturbations in T-ALL. The quest for somatic mutations underlying oncogenic processes is a central theme in today's cancer research. High-throughput genomics approaches including amplicon re-sequencing, exome re-sequencing, full genome re-sequencing, and SNP arrays have contributed to cataloguing driver genes across cancer types. Thus far transcriptome sequencing by RNA-seq has been mainly used for the detection of fusion genes, while few studies have assessed its value for the combined detection of SNPs, INDELs, fusions, gene expression changes, and alternative transcript events. Here we apply RNA-seq to 49 T-ALL samples and perform a critical assessment of the bioinformatics pipelines and filters to identify each type of aberration. By comparing to exome re-sequencing, and by exploiting the catalogues of known cancer drivers, we identified many known and several novel driver genes in T-ALL. We also determined an optimal normalization strategy to obtain accurate gene expression levels and used these to identify over-expressed transcription factors that characterize different T-ALL subtypes. Finally, by PCR, cloning, and in vitro cellular assays we uncover new fusion genes that have consequences at the level of gene expression, oncogenic chimaeras, and tumor suppressor inactivation. In conclusion, we present the first RNA-seq data set across T-ALL patients and identify new driver events.
Quantitative trait locus analysis of body shape divergence in nine spined sticklebacks based on high density SNP panel
Heritable phenotypic differences between populations, caused by the selective effects of distinct environmental conditions, are of commonplace occurrence in nature. However, the actual genomic targets of this kind of selection are still poorly understood. We conducted a quantitative trait locus (QTL) mapping study to identify genomic regions responsible for morphometric differentiation between genetically and phenotypically divergent marine and freshwater nine-spined stickleback (Pungitius pungitius) populations. Using a dense panel of SNP-markers obtained by restriction site associated DNA sequencing of an F2 recombinant cross, we found 22 QTL that explained 3.5–12.9% of phenotypic variance in the traits under investigation. We detected one fairly large-effect (PVE = 9.6%) QTL for caudal peduncle length–a trait with a well-established adaptive function showing clear differentiation among marine and freshwater populations. We also identified two large-effect QTL for lateral plate numbers, which are different from the lateral plate QTL reported in earlier studies of this and related species. Hence, apart from identifying several large-effect QTL in shape traits showing adaptive differentiation in response to different environmental conditions, the results suggest intra- and interspecific heterogeneity in the genomic basis of lateral plate number variation.
Activation of individual L1 retrotransposon instances is restricted to cell type dependent permissive loci
LINE-1 (L1) retrotransposons represent approximately one sixth of the human genome, but only the human-specific L1HS-Ta subfamily acts as an endogenous mutagen in modern humans, reshaping both somatic and germline genomes. Due to their high levels of sequence identity and the existence of many polymorphic insertions absent from the reference genome, the transcriptional activation of individual genomic L1HS-Ta copies remains poorly understood. Here we comprehensively mapped fixed and polymorphic L1HS-Ta copies in 12 commonly-used somatic cell lines, and identified transcriptional and epigenetic signatures allowing the unambiguous identification of active L1HS-Ta copies in their genomic context. Strikingly, only a very restricted subset of L1HS-Ta loci - some being polymorphic among individuals - significantly contributes to the bulk of L1 expression, and these loci are differentially regulated among distinct cell lines. Thus, our data support a local model of L1 transcriptional activation in somatic cells, governed by individual-, locus-, and cell-type-specific determinants. DOI: Retrotransposons, also known as jumping genes, have invaded the genomes of most living organisms. These fragments of DNA have the ability to move or copy themselves from one location of a chromosome to another. Depending on where they insert themselves, retrotransposons can modify the sequence of nearby genes, which can alter or even abolish their activity. Although these genetic modifications have contributed significantly to the evolution of different species, retrotransposons can also have detrimental effects; for example, by causing new cases of genetic diseases. Adult human cells have a number of mechanisms that work to keep the activity of retrotransposons at a very low level. However, in many types of cancers retrotransposons escape these defense mechanisms and ‘jump’ actively. This is thought to contribute to the development and spread of cancerous tumors. To understand how jumping genes are mobilized, a fundamental question must be answered: is the high jumping gene activity observed in some cell types a result of activating many copies of the retrotransposons, or only a few of them? This question has been difficult to address because there are more than one hundred copies of retrotransposons that could potentially move in humans, many of which have not even been referenced in the human genome map. Furthermore, each copy is almost identical to another one, making it difficult to discriminate between them. Philippe et al. have now developed an approach that can map where individual retrotransposons are located in the genome of normal and cancerous cells and measure how active these jumping genes are. This revealed that only a very restricted number of them are active in any given cell type. Moreover, different subsets of jumping genes are active in different cell types, and their locations in the genome often do not overlap. Thus, whether jumping genes are activated depends on the cell type and their position in the genome. These results are in contrast to the prevalent view that retrotransposons are activated in a more widespread manner across the genome, at least in cancerous cells. Overall, Philippe et al.’s findings pave the way towards characterizing the chromosome regions in which retrotransposons are frequently activated and understanding how they contribute to cancer and other diseases. DOI:
Genome wide Analysis Reveals Extensive Functional Interaction between DNA Replication Initiation and Transcription in the Genome of Trypanosoma brucei
Identification of replication initiation sites, termed origins, is a crucial step in understanding genome transmission in any organism. Transcription of the Trypanosoma brucei genome is highly unusual, with each chromosome comprising a few discrete transcription units. To understand how DNA replication occurs in the context of such organization, we have performed genome-wide mapping of the binding sites of the replication initiator ORC1/CDC6 and have identified replication origins, revealing that both localize to the boundaries of the transcription units. A remarkably small number of active origins is seen, whose spacing is greater than in any other eukaryote. We show that replication and transcription in T. brucei have a profound functional overlap, as reducing ORC1/CDC6 levels leads to genome-wide increases in mRNA levels arising from the boundaries of the transcription units. In addition, ORC1/CDC6 loss causes derepression of silent Variant Surface Glycoprotein genes, which are critical for host immune evasion. ► DNA replication origins are widely dispersed in T. brucei chromosome cores ► Origins and ORC1/CDC6 localize at the boundaries of multigene transcription units ► Localization of T. brucei ORC1/CDC6 is distinct in chromosome cores and subtelomeres ► ORC1/CDC6 acts in transcription regulation, including of some VSGs, in T. brucei Designation of DNA replication origins, the sites where DNA synthesis initiates, remains poorly understood in most eukaryotic genomes, including the relationship of this reaction with transcription. McCulloch, Bell, and colleagues now map origins and the binding sites of a key replication initiator, termed ORC1/CDC6, in the African trypanosome genome, which is highly unusual among eukaryotes in that each chromosome comprises a few discrete multigene transcription units. These data, coupled with global transcript mapping, reveal extensive functional overlap between DNA replication and transcription.
Genome Wide Association Studies in Dogs and Humans Identify ADAMTS20 as a Risk Variant for Cleft Lip and Palate
Cleft lip with or without cleft palate (CL/P) is the most commonly occurring craniofacial birth defect. We provide insight into the genetic etiology of this birth defect by performing genome-wide association studies in two species: dogs and humans. In the dog, a genome-wide association study of 7 CL/P cases and 112 controls from the Nova Scotia Duck Tolling Retriever (NSDTR) breed identified a significantly associated region on canine chromosome 27 (unadjusted p=1.1 x 10-13; adjusted p= 2.2 x 10-3). Further analysis in NSDTR families and additional full sibling cases identified a 1.44 Mb homozygous haplotype (chromosome 27: 9.29 – 10.73 Mb) segregating with a more complex phenotype of cleft lip, cleft palate, and syndactyly (CLPS) in 13 cases. Whole-genome sequencing of 3 CLPS cases and 4 controls at 15X coverage led to the discovery of a frameshift mutation within ADAMTS20 (c.1360_1361delAA (p.Lys453Ilefs*3)), which segregated concordant with the phenotype. In a parallel study in humans, a family-based association analysis (DFAM) of 125 CL/P cases, 420 unaffected relatives, and 392 controls from a Guatemalan cohort, identified a suggestive association (rs10785430; p =2.67 x 10-6) with the same gene, ADAMTS20. Sequencing of cases from the Guatemalan cohort was unable to identify a causative mutation within the coding region of ADAMTS20, but four coding variants were found in additional cases of CL/P. In summary, this study provides genetic evidence for a role of ADAMTS20 in CL/P development in dogs and as a candidate gene for CL/P development in humans. Cleft lip with or without cleft palate (CL/P) is a commonly occurring birth defect that can lead to a lifetime of complications in affected children. To better understand the genetic cause of these disorders, we investigated CL/P in both dogs and humans. Genome-wide association studies in both species independently identify ADAMTS20 as a candidate gene for CL/P development. In dogs, a deletion within a functional domain of ADAMTS20 is responsible for CL/P in the Nova Scotia Duck Tolling Retriever dog breed. In humans, an associated region containing the same gene, ADAMTS20, was identified in a study population of native Guatemalans. Subsequent sequencing in humans was unable to identify a causative mutation within the coding region of ADAMTS20 in the Guatemalan cohort; however, sequencing of ADAMTS20 in additional cases with CL/P identified four novel coding variants. This work provides genetic evidence for a role for ADAMTS20 in CL/P development in both dogs and humans.
DisAp dependent striated fiber elongation is required to organize ciliary arrays
DisAp is a novel kinetodesmal fiber component that is essential for force-dependent fiber elongation and the alignment of basal body orientation in multiciliary arrays. Cilia-organizing basal bodies (BBs) are microtubule scaffolds that are visibly asymmetrical because they have attached auxiliary structures, such as striated fibers. In multiciliated cells, BB orientation aligns to ensure coherent ciliary beating, but the mechanisms that maintain BB orientation are unclear. For the first time in Tetrahymena thermophila, we use comparative whole-genome sequencing to identify the mutation in the BB disorientation mutant disA-1. disA-1 abolishes the localization of the novel protein DisAp to T. thermophila striated fibers (kinetodesmal fibers; KFs), which is consistent with DisAp’s similarity to the striated fiber protein SF-assemblin. We demonstrate that DisAp is required for KFs to elongate and to resist BB disorientation in response to ciliary forces. Newly formed BBs move along KFs as they approach their cortical attachment sites. However, because they contain short KFs that are rotated, BBs in disA-1 cells display aberrant spacing and disorientation. Therefore, DisAp is a novel KF component that is essential for force-dependent KF elongation and BB orientation in multiciliary arrays.
Genome Wide SNP Identification and Characterization in Two Soybean Cultivars with Contrasting Mungbean Yellow Mosaic India Virus Disease Resistance Traits
Mungbean yellow mosaic India virus (MYMIV) is a bipartite Geminivirus, which causes severe yield loss in soybean (Glycine max). Considering this, the present study was conducted to develop large-scale genome-wide single nucleotide polymorphism (SNP) markers and identify potential markers linked with known disease resistance loci for their effective use in genomics-assisted breeding to impart durable MYMIV tolerance. The whole-genome re-sequencing of MYMIV resistant cultivar ‘UPSM-534’ and susceptible Indian cultivar ‘JS-335’ was performed to identify high-quality SNPs and InDels (insertion and deletions). Approximately 234 and 255 million of 100-bp paired-end reads were generated from UPSM-534 and JS-335, respectively, which provided ~98% coverage of reference soybean genome. A total of 3083987 SNPs (1559556 in UPSM-534 and 1524431 in JS-335) and 562858 InDels (281958 in UPSM-534 and 280900 in JS-335) were identified. Of these, 1514 SNPs were found to be present in 564 candidate disease resistance genes. Among these, 829 non-synonymous and 671 synonymous SNPs were detected in 266 and 286 defence-related genes, respectively. Noteworthy, a non-synonymous SNP (in chromosome 18, named 18-1861613) at the 149th base-pair of LEUCINE-RICH REPEAT RECEPTOR-LIKE PROTEIN KINASE gene responsible for a G/C transversion [proline (CCC) to alanine(GCC)] was identified and validated in a set of 12 soybean cultivars. Taken together, the present study generated a large-scale genomic resource such as, SNPs and InDels at a genome-wide scale that will facilitate the dissection of various complex traits through construction of high-density linkage maps and fine mapping. In the present scenario, these markers can be effectively used to design high-density SNP arrays for their large-scale validation and high-throughput genotyping in diverse natural and mapping populations, which could accelerate genomics-assisted MYMIV disease resistance breeding in soybean.
Activation of Bicyclic Nitro drugs by a Novel Nitroreductase (NTR2) in Leishmania
Drug discovery pipelines for the “neglected diseases” are now heavily populated with nitroheterocyclic compounds. Recently, the bicyclic nitro-compounds (R)-PA-824, DNDI-VL-2098 and delamanid have been identified as potential candidates for the treatment of visceral leishmaniasis. Using a combination of quantitative proteomics and whole genome sequencing of susceptible and drug-resistant parasites we identified a putative NAD(P)H oxidase as the activating nitroreductase (NTR2). Whole genome sequencing revealed that deletion of a single cytosine in the gene for NTR2 that is likely to result in the expression of a non-functional truncated protein. Susceptibility of leishmania was restored by reintroduction of the wild-type gene into the resistant line, which was accompanied by the ability to metabolise these compounds. Overexpression of NTR2 in wild-type parasites rendered cells hyper-sensitive to bicyclic nitro-compounds, but only marginally to the monocyclic nitro-drugs, nifurtimox and fexinidazole sulfone, known to be activated by a mitochondrial oxygen-insensitive nitroreductase (NTR1). Conversely, a double knockout NTR2 null cell line was completely resistant to bicyclic nitro-compounds and only marginally resistant to nifurtimox. Sensitivity was fully restored on expression of NTR2 in the null background. Thus, NTR2 is necessary and sufficient for activation of these bicyclic nitro-drugs. Recombinant NTR2 was capable of reducing bicyclic nitro-compounds in the same rank order as drug sensitivity in vitro. These findings may aid the future development of better, novel anti-leishmanial drugs. Moreover, the discovery of anti-leishmanial nitro-drugs with independent modes of activation and independent mechanisms of resistance alleviates many of the concerns over the continued development of these compound series. Visceral leishmaniasis (kala-azar) is a serious vector borne disease afflicting people, particularly in parts of Asia, Africa and Latin America. There are approximately 400,000 new cases and an estimated 40,000 deaths each year, making it the second biggest parasitic killer after malaria. We recently discovered that delamanid–an oral nitro-drug used for the treatment of tuberculosis–shows promise for the treatment of leishmaniasis with potential to provide a much needed alternative to the current unsatisfactory anti-leishmanial drugs. Understanding how a drug works is important for selecting the most appropriate partner drugs to be used to increase efficacy and decrease toxicity in patients, to minimise the risk of drug resistance emerging and in designing second generation drugs. Using a combination of biochemical and genetic approaches we have discovered a novel nitroreductase (NTR2) that is necessary and sufficient for the anti-leishmanial activity of delamanid and related experimental drugs containing a nitro-group attached to two fused rings. This enzyme is responsible for activating bicyclic nitro-compounds to form toxic products that kill the parasite. In contrast, the previously identified nitroreductase (NTR1), which specifically activates monocyclic drugs, is not involved in this process. This knowledge can be applied to develop novel treatments for this disease.
Genomic analysis and clinical management of adolescent cutaneous melanoma
Melanoma in young children is rare; however, its incidence in adolescents and young adults is rising. We describe the clinical course of a 15‐year‐old female diagnosed with AJCC stage IB non‐ulcerated primary melanoma, who died from metastatic disease 4 years after diagnosis despite three lines of modern systemic therapy. We also present the complete genomic profile of her tumour and compare this to a further series of 13 adolescent melanomas and 275 adult cutaneous melanomas. A somatic BRAFV 600E mutation and a high mutational load equivalent to that found in adult melanoma and composed primarily of C>T mutations were observed. A germline genomic analysis alongside a series of 23 children and adolescents with melanoma revealed no mutations in known germline melanoma‐predisposing genes. Adolescent melanomas appear to have genomes that are as complex as those arising in adulthood and their clinical course can, as with adults, be unpredictable.
Glucocerebrosidase Deficiency in Drosophila Results in α Synuclein Independent Protein Aggregation and Neurodegeneration
Mutations in the glucosidase, beta, acid (GBA1) gene cause Gaucher’s disease, and are the most common genetic risk factor for Parkinson’s disease (PD) and dementia with Lewy bodies (DLB) excluding variants of low penetrance. Because α-synuclein-containing neuronal aggregates are a defining feature of PD and DLB, it is widely believed that mutations in GBA1 act by enhancing α-synuclein toxicity. To explore this hypothesis, we deleted the Drosophila GBA1 homolog, dGBA1b, and compared the phenotypes of dGBA1b mutants in the presence and absence of α-synuclein expression. Homozygous dGBA1b mutants exhibit shortened lifespan, locomotor and memory deficits, neurodegeneration, and dramatically increased accumulation of ubiquitinated protein aggregates that are normally degraded through an autophagic mechanism. Ectopic expression of human α-synuclein in dGBA1b mutants resulted in a mild enhancement of dopaminergic neuron loss and increased α-synuclein aggregation relative to controls. However, α-synuclein expression did not substantially enhance other dGBA1b mutant phenotypes. Our findings indicate that dGBA1b plays an important role in the metabolism of protein aggregates, but that the deleterious consequences of mutations in dGBA1b are largely independent of α-synuclein. Future work with dGBA1b mutants should reveal the mechanism by which mutations in dGBA1b lead to accumulation of protein aggregates, and the potential influence of this protein aggregation on neuronal integrity. Mutations in the glucosidase, beta, acid (GBA1) gene cause Gaucher’s disease (GD), a lysosomal storage disease that includes neurodegenerative phenotypes. Recently, mutations in GBA1 were identified as the strongest genetic risk factor for Parkinson’s disease (PD) and dementia with Lewy bodies (DLB), which are neurodegenerative conditions characterized by intraneuronal protein aggregates containing α-synuclein. To explore how GBA1 mutations lead to neurodegeneration in GD, PD and DLB, we developed a novel invertebrate model of GBA1 insufficiency by deleting the Drosophila GBA1 homolog, dGBA1b. We found that dGBA1b mutants have multiple phenotypes consistent with neuronal dysfunction as seen in PD, DLB and GD, and dramatically increased protein aggregation that is normally cleared through an autophagic mechanism. dGBA1b mutants expressing human α-synuclein in dopaminergic neurons led to dopaminergic neuron loss and α-synuclein aggregation. However, α-synuclein expression had minimal effect on dGBA1b mutant phenotypes, suggesting that the deleterious consequences of glucocerebrosidase deficiency are independent of α-synuclein. These findings significantly contribute to our understanding of the role of GBA1 mutations in the pathogenesis of PD, DLB, and GD, and further studies using this model should elucidate mechanisms underlying these diseases.
Analysis of nuclear and organellar genomes of Plasmodium knowlesi in humans reveals ancient population structure and recent recombination among host specific subpopulations
The macaque parasite Plasmodium knowlesi is a significant concern in Malaysia where cases of human infection are increasing. Parasites infecting humans originate from genetically distinct subpopulations associated with the long-tailed (Macaca fascicularis (Mf)) or pig-tailed macaques (Macaca nemestrina (Mn)). We used a new high-quality reference genome to re-evaluate previously described subpopulations among human and macaque isolates from Malaysian-Borneo and Peninsular-Malaysia. Nuclear genomes were dimorphic, as expected, but new evidence of chromosomal-segment exchanges between subpopulations was found. A large segment on chromosome 8 originating from the Mn subpopulation and containing genes encoding proteins expressed in mosquito-borne parasite stages, was found in Mf genotypes. By contrast, non-recombining organelle genomes partitioned into 3 deeply branched lineages, unlinked with nuclear genomic dimorphism. Subpopulations which diverged in isolation have re-connected, possibly due to deforestation and disruption of wild macaque habitats. The resulting genomic mosaics reveal traits selected by host-vector-parasite interactions in a setting of ecological transition. Plasmodium knowlesi, a common malaria parasite of long-tailed and pig-tailed macaques, is now recognized as a significant cause of human malaria, accounting for up to 70% of malaria cases in certain areas in Southeast Asia including Malaysian Borneo. Rapid human population growth, deforestation and encroachment on wild macaque habitats potentially increase contact with humans and drive up the prevalence of human Plasmodium knowlesi infections. Appropriate molecular tools and sampling are needed to assist surveillance by malaria control programmes, and to understand the genetics underpinning Plasmodium knowlesi transmission and switching of hosts from macaques to humans. We report a comprehensive analysis of the largest assembled set of Plasmodium knowlesi genome sequences from Malaysia. It reveals genetic regions that have been recently exchanged between long-tailed and pig-tailed macaques, which contain genes with signals indicative of rapid contemporary ecological change, including deforestation. Additional analyses partition Plasmodium knowlesi infections in Borneo into 3 deeply branched lineages of ancient origin, which founded the two divergent populations associated with long-tailed and pig-tailed macaques and a third, highly diverse population, on the Peninsular mainland. Overall, the complex Plasmodium parasite evolution observed and likelihood of further host transitions are potential challenges to malaria control in Malaysia.
The transcriptome of Leishmania major in the axenic promastigote stage: transcript annotation and relative expression levels by RNA seq
Although the genome sequence of the protozoan parasite Leishmania major was determined several years ago, the knowledge of its transcriptome was incomplete, both regarding the real number of genes and their primary structure. Here, we describe the first comprehensive transcriptome analysis of a parasite from the genus Leishmania. Using high-throughput RNA sequencing (RNA-seq), a total of 10285 transcripts were identified, of which 1884 were considered novel, as they did not match previously annotated genes. In addition, our data indicate that current annotations should be modified for many of the genes. The detailed analysis of the transcript processing sites revealed extensive heterogeneity in the spliced leader (SL) and polyadenylation addition sites. As a result, around 50% of the genes presented multiple transcripts differing in the length of the UTRs, sometimes in the order of hundreds of nucleotides. This transcript heterogeneity could provide an additional source for regulation as the different sizes of UTRs could modify RNA stability and/or influence the efficiency of RNA translation. In addition, for the first time for the Leishmania major promastigote stage, we are providing relative expression transcript levels. This study provides a concise view of the global transcriptome of the L. major promastigote stage, providing the basis for future comparative analysis with other development stages or other Leishmania species.
Predictive Studies Suggest that the Risk for the Selection of Antibiotic Resistance by Biocides Is Likely Low in Stenotrophomonas maltophilia
Biocides are used without restriction for several purposes. As a consequence, large amounts of biocides are released without any control in the environment, a situation that can challenge the microbial population dynamics, including selection of antibiotic resistant bacteria. Previous work has shown that triclosan selects Stenotrophomonas maltophilia antibiotic resistant mutants overexpressing the efflux pump SmeDEF and induces expression of this pump triggering transient low-level resistance. In the present work we analyze if two other common biocides, benzalkonium chloride and hexachlorophene, trigger antibiotic resistance in S. maltophilia. Bioinformatic and biochemical methods showed that benzalkonium chloride and hexachlorophene bind the repressor of smeDEF, SmeT. Only benzalkonium chloride triggers expression of smeD and its effect in transient antibiotic resistance is minor. None of the hexachlorophene-selected mutants was antibiotic resistant. Two benzalkonium chloride resistant mutants presented reduced susceptibility to antibiotics and were impaired in growth. Metabolic profiling showed they were more proficient than their parental strain in the use of some dipeptides. We can then conclude that although bioinformatic predictions and biochemical studies suggest that both hexachlorophene and benzalkonium chloride should induce smeDEF expression leading to transient S. maltophilia resistance to antibiotics, phenotypic assays showed this not to be true. The facts that hexachlorophene resistant mutants are not antibiotic resistant and that the benzalkonium chloride resistant mutants presenting altered susceptibility to antibiotics were impaired in growth suggests that the risk for the selection (and fixation) of S. maltophilia antibiotic resistant mutants by these biocides is likely low, at least in the absence of constant selection pressure.
De novo mutations in PURA are associated with hypotonia and developmental delay
PURA is the leading candidate gene responsible for the developmental phenotype in the 5q31.3 microdeletion syndrome. De novo mutations in PURA were recently reported in 15 individuals with developmental features similar to the 5q31.3 microdeletion syndrome. Here we describe six unrelated children who were identified by clinical whole-exome sequencing (WES) to have novel de novo variants in PURA with a similar phenotype of hypotonia and developmental delay and frequently associated with seizures. The protein Purα (encoded by PURA) is involved in neuronal proliferation, dendrite maturation, and the transport of mRNA to translation sites during neuronal development. Mutations in PURA may alter normal brain development and impair neuronal function, leading to developmental delay and the seizures observed in patients with mutations in PURA.
Finding the sources of missing heritability in a yeast cross
For many traits, including susceptibility to common diseases in humans, causal loci uncovered by genetic mapping studies explain only a minority of the heritable contribution to trait variation. Multiple explanations for this “missing heritability” have been proposed1. Here we use a large cross between two yeast strains to accurately estimate different sources of heritable variation for 46 quantitative traits and to detect underlying loci with high statistical power. We find that the detected loci explain nearly the entire additive contribution to heritable variation for the traits studied. We also show that the contribution to heritability of gene-gene interactions varies among traits, from near zero to approximately 50%. Detected two-locus interactions explain only a minority of this contribution. These results substantially advance our understanding of the missing heritability problem and have important implications for future studies of complex and quantitative traits.
First step in using molecular data for microbial food safety risk assessment; hazard identification of Escherichia coli O157:H7 by coupling genomic data with in vitro adherence to human epithelial cells
The potential for using whole genome sequencing (WGS) data in microbiological risk assessment (MRA) has been discussed on several occasions since the beginning of this century. Still, the proposed heuristic approaches have never been applied in a practical framework. This is due to the non-trivial problem of mapping microbial information consisting of thousands of loci onto a probabilistic scale for risks. The paradigm change for MRA involves translation of multidimensional microbial genotypic information to much reduced (integrated) phenotypic information and onwards to a single measure of human risk (i.e. probability of illness). In this paper a first approach in methodology development is described for the application of WGS data in MRA; this is supported by a practical example. That is, combining genetic data (single nucleotide polymorphisms; SNPs) for Shiga toxin-producing Escherichia coli (STEC) O157 with phenotypic data (in vitro adherence to epithelial cells as a proxy for virulence) leads to hazard identification in a Genome Wide Association Study (GWAS). This application revealed practical implications when using SNP data for MRA. These can be summarized by considering the following main issues: optimum sample size for valid inference on population level, correction for population structure, quantification and calibration of results, reproducibility of the analysis, links with epidemiological data, anchoring and integration of results into a systems biology approach for the translation of molecular studies to human health risk. Future developments in genetic data analysis for MRA should aim at resolving the mapping problem of processing genetic sequences to come to a quantitative description of risk. The development of a clustering scheme focusing on biologically relevant information of the microbe involved would be a useful approach in molecular data reduction for risk assessment.
Hemizygous mutations in SNAP29 unmask autosomal recessive conditions and contribute to atypical findings in patients with 22q11.2DS
22q11.2 deletion syndrome (22q11.2DS) is the most common microdeletion disorder, affecting an estimated 1 : 2000–4000 live births. Patients with 22q11.2DS have a broad spectrum of phenotypic abnormalities which generally includes congenital cardiac abnormalities, palatal anomalies, and immunodeficiency. Additional findings, such as skeletal anomalies and autoimmune disorders, can confer significant morbidity in a subset of patients. 22q11.2DS is a contiguous gene DS and over 40 genes are deleted in patients; thus deletion of several genes within this region contributes to the clinical features. Mutations outside or on the remaining 22q11.2 allele are also known to modify the phenotype. We utilised whole exome, targeted exome and/or Sanger sequencing to examine the genome of 17 patients with 22q11.2 deletions and phenotypic features found in <10% of affected individuals. In four unrelated patients, we identified three novel mutations in SNAP29, the gene implicated in the autosomal recessive condition cerebral dysgenesis, neuropathy, ichthyosis and keratoderma (CEDNIK). SNAP29 maps to 22q11.2 and encodes a soluble SNARE protein that is predicted to mediate vesicle fusion at the endoplasmic reticulum or Golgi membranes. This work confirms that the phenotypic variability observed in a subset of patients with 22q11.2DS is due to mutations on the non-deleted chromosome, which leads to unmasking of autosomal recessive conditions such as CEDNIK, Kousseff, and a potentially autosomal recessive form of Opitz G/BBB syndrome. Furthermore, our work implicates SNAP29 as a major modifier of variable expressivity in 22q11.2 DS patients.
Caenorhabditis elegans BUB 3 and SAN 1/MAD3 Spindle Assembly Checkpoint Components Are Required for Genome Stability in Response to Treatment with Ionizing Radiation
Relatively little is known about the cross-talk between the spindle assembly checkpoint and the DNA damage response, especially in multicellular organisms. We performed a Caenorhabditis elegans forward genetic screen to uncover new genes involved in the repair of DNA damage induced by ionizing radiation. We isolated a mutation, gt2000, which confers hypersensitivity to ionizing radiation and showed that gt2000 introduces a premature stop in bub-3. BUB-3 is a key component of the spindle assembly checkpoint. We provide evidence that BUB-3 acts during development and in the germline; irradiated bub-3(gt2000) larvae are developmentally retarded and form abnormal vulvae. Moreover, bub-3(gt2000) embryos sired from irradiated worms show increased levels of lethality. Both bub-3 and san-1 (the C. elegans homolog of MAD3) deletion alleles confer hypersensitivity to ionizing radiation, consistent with the notion that the spindle assembly checkpoint pathway is required for the DNA damage response. bub-3(gt2000) is moderately sensitive to the cross-linking drug cisplatin but not to ultraviolet light or methyl methanesulfonate. This is consistent with a role in dealing with DNA double-strand breaks and not with base damage. Double mutant analysis revealed that bub-3 does not act within any of the three major pathways involved in the repair of double-strand breaks. Finally, the cdc-20 gain-of-function mutant cdc-20/fzy-1(av15), which is refractory to the cell cycle delay conferred by the spindle checkpoint, showed phenotypes similar to bub-3 and san-1 mutants. We speculate that BUB-3 is involved in the DNA damage response through regulation of cell cycle timing.
Dynamic hydroxymethylation of deoxyribonucleic acid marks differentiation associated enhancers
Enhancers are developmentally controlled transcriptional regulatory regions whose activities are modulated through histone modifications or histone variant deposition. In this study, we show by genome-wide mapping that the newly discovered deoxyribonucleic acid (DNA) modification 5-hydroxymethylcytosine (5hmC) is dynamically associated with transcription factor binding to distal regulatory sites during neural differentiation of mouse P19 cells and during adipocyte differentiation of mouse 3T3-L1 cells. Functional annotation reveals that regions gaining 5hmC are associated with genes expressed either in neural tissues when P19 cells undergo neural differentiation or in adipose tissue when 3T3-L1 cells undergo adipocyte differentiation. Furthermore, distal regions gaining 5hmC together with H3K4me2 and H3K27ac in P19 cells behave as differentiation-dependent transcriptional enhancers. Identified regions are enriched in motifs for transcription factors regulating specific cell fates such as Meis1 in P19 cells and PPARγ in 3T3-L1 cells. Accordingly, a fraction of hydroxymethylated Meis1 sites were associated with a dynamic engagement of the 5-methylcytosine hydroxylase Tet1. In addition, kinetic studies of cytosine hydroxymethylation of selected enhancers indicated that DNA hydroxymethylation is an early event of enhancer activation. Hence, acquisition of 5hmC in cell-specific distal regulatory regions may represent a major event of enhancer progression toward an active state and participate in selective activation of tissue-specific genes.
The Burden of Dengue Fever and Chikungunya in Southern Coastal Ecuador: Epidemiology, Clinical Presentation, and Phylogenetics from the First Two Years of a Prospective Study
Here, we report the findings from the first 2 years (2014–2015) of an arbovirus surveillance study conducted in Machala, Ecuador, a dengue-endemic region. Patients with suspected dengue virus (DENV) infections (index cases, N = 324) were referred from five Ministry of Health clinical sites. A subset of DENV-positive index cases (N = 44) were selected, and individuals from the index household and four neighboring homes within 200 m were recruited (N = 400). Individuals who entered the study, other than the index cases, are referred to as associates. In 2014, 70.9% of index cases and 35.6% of associates had acute or recent DENV infections. In 2015, 28.3% of index cases and 12.8% of associates had acute or recent DENV infections. For every DENV infection captured by passive surveillance, we detected an additional three acute or recent DENV infections in associates. Of associates with acute DENV infections, 68% reported dengue-like symptoms, with the highest prevalence of symptomatic acute infections in children aged less than 10 years. The first chikungunya virus (CHIKV) infections were detected on epidemiological week 12 in 2015; 43.1% of index cases and 3.5% of associates had acute CHIKV infections. No Zika virus infections were detected. Phylogenetic analyses of isolates of DENV from 2014 revealed genetic relatedness and shared ancestry of DENV1, DENV2, and DENV4 genomes from Ecuador with those from Venezuela and Colombia, indicating the presence of viral flow between Ecuador and surrounding countries. Enhanced surveillance studies, such as this, provide high-resolution data on symptomatic and inapparent infections across the population.
Methylation of Histone H3 on Lysine 79 Associates with a Group of Replication Origins and Helps Limit DNA Replication Once per Cell Cycle
Mammalian DNA replication starts at distinct chromosomal sites in a tissue-specific pattern coordinated with transcription, but previous studies have not yet identified a chromatin modification that correlates with the initiation of DNA replication at particular genomic locations. Here we report that a distinct fraction of replication initiation sites in the human genome are associated with a high frequency of dimethylation of histone H3 lysine K79 (H3K79Me2). H3K79Me2-containing chromatin exhibited the highest genome-wide enrichment for replication initiation events observed for any chromatin modification examined thus far (23.39% of H3K79Me2 peaks were detected in regions adjacent to replication initiation events). The association of H3K79Me2 with replication initiation sites was independent and not synergistic with other chromatin modifications. H3K79 dimethylation exhibited wider distribution on chromatin during S-phase, but only regions with H3K79 methylation in G1 and G2 were enriched in replication initiation events. H3K79 was dimethylated in a region containing a functional replicator (a DNA sequence capable of initiating DNA replication), but the methylation was not evident in a mutant replicator that could not initiate replication. Depletion of DOT1L, the sole enzyme responsible for H3K79 methylation, triggered limited genomic over-replication although most cells could continue to proliferate and replicate DNA in the absence of methylated H3K79. Thus, prevention of H3K79 methylation might affect regulatory processes that modulate the order and timing of DNA replication. These data are consistent with the hypothesis that dimethylated H3K79 associates with some replication origins and marks replicated chromatin during S-phase to prevent re-replication and preserve genomic stability. Before each cell division, cells must accurately duplicate their chromosomes. It is critical that cells coordinate the replication of DNA with the packaging of DNA into chromosomes to insure that all genetic and epigenetic information is accurately transmitted to the next generation. In eukaryotes, replication starts at multiple sites, called “replication origins,” which are distributed throughout the genome and initiate replication in a strict order to maintain genomic stability and prevent cancer. Previous studies looked at the effect of chemical modifications on histone proteins, which affect chromosome packaging, on replication but no particular histone modifications distinctly associated with replication start sites. Here, we took advantage of recent advances in whole genome sequencing to map replication origins and histone modifications for the entire DNA in human cancer cells. One of the histone modifications we tested, methylation of lysine 79 on histone H3, was remarkably enriched at a group of replication origins. Inhibiting the enzyme that catalyzes this histone modification caused some DNA to replicate more than once during a single cell cycle, suggesting that methylation of histone H3 on lysine 79 might play an important role in controlling DNA replication.
Mutations of RagA GTPase in mTORC1 Pathway Are Associated with Autosomal Dominant Cataracts
Cataracts are a significant public health problem with no proven methods for prevention. Discovery of novel disease mechanisms to delineate new therapeutic targets is of importance in cataract prevention and therapy. Herein, we report that mutations in the RagA GTPase (RRAGA), a key regulator of the mechanistic rapamycin complex 1 (mTORC1), are associated with autosomal dominant cataracts. We performed whole exome sequencing in a family with autosomal dominant juvenile-onset cataracts, and identified a novel p.Leu60Arg mutation in RRAGA that co-segregated with the disease, after filtering against the dbSNP database, and at least 123,000 control chromosomes from public and in-house exome databases. In a follow-up direct screening of RRAGA in another 22 families and 142 unrelated patients with congenital or juvenile-onset cataracts, RRAGA was found to be mutated in two unrelated patients (p.Leu60Arg and c.-16G>A respectively). Functional studies in human lens epithelial cells revealed that the RRAGA mutations exerted deleterious effects on mTORC1 signaling, including increased relocation of RRAGA to the lysosomes, up-regulated mTORC1 phosphorylation, down-regulated autophagy, altered cell growth or compromised promoter activity. These data indicate that the RRAGA mutations, associated with autosomal dominant cataracts, play a role in the disease by acting through disruption of mTORC1 signaling. A group of guanine nucleotide-binding molecules called Rag GTPases are known to play a crucial role in regulation of mTORC1 signaling cascade. In the current study, whole exome sequencing has led to the identification of the RagA GTPase (RRAGA) gene for cataracts and we proceeded to study properties of the RRAGA protein. We captured and sequenced the whole exome for four affected patients from a family with autosomal dominant juvenile-onset posterior cataracts, and found a novel rare mutation in RagA GTPase (RRAGA). To validate this finding, we then sequenced more families and patients, and observed RRAGA mutations in unrelated patients with related phenotypes, suggesting that RRAGA could be mutated in congenital and juvenile-onset cataracts. We further demonstrated supporting evidence that in human lens epithelial cells the RRAGA mutations exerted deleterious effects on relocation of RRAGA to the lysosomes, mTORC1 phosphorylation, autophagy and cell growth. This study gives important new insight into the roles of RRAGA and mTROC1 signaling in the etiology of cataracts.
Mapping and Characterization of the fefe Gene That Controls Iron Uptake in Melon (Cucumis melo L.)
Iron (Fe) deficiency in plants limits crop growth and productivity. Molecular mechanisms that plants use to sense and respond to Fe deficiency by coordinated expression of Fe-uptake genes are not fully understood. The C940-fe chlorotic melon (Cucumis melo) mutant known as fefe is unable to upregulate Fe-uptake genes, however, the FeFe gene had not been identified. In this study, we used two F2 mapping populations to map and identify the FeFe gene as bHLH38, a homolog of subgroup Ib bHLH genes from Arabidopsis thaliana that are involved in transcriptional regulation of Fe-uptake genes in partnership with the FIT gene. A Ty1-copia type retrotransposon insertion of 5.056 kb within bHLH38 is responsible for the defect in bHLH38 in fefe, based on sequencing and expression analysis. This retrotransposon insertion results in multiple non-functional transcripts expected to result in an altered and truncated protein sequence. Hairy root transformation of fefe plants using wild-type bHLH38 resulted in functional complementation of the chlorotic fefe phenotype. Using a yeast-2-hybrid assay, the transcription factor Fit interacted with the wild-type bHLH38 protein, but did not interact with the fefe bHLH38 protein, suggesting that heterodimer formation of Fit/bHLH38 to regulate Fe-uptake genes does not occur in fefe roots. The second subgroup Ib bHLH gene in the melon genome is not functionally redundant to bHLH38, in contrast to Arabidopsis where four subgroup Ib bHLH genes are functionally redundant. Whereas the Arabidopsis bHLH transcript levels are upregulated by Fe deficiency, melon bHLH38 was not regulated at the transcript level. Thus, the fefe mutant may provide a platform for studying bHLH38 genes and proteins from other plant species.
The Fusarium graminearum Histone Acetyltransferases Are Important for Morphogenesis, DON Biosynthesis, and Pathogenicity
Post-translational modifications of chromatin structure by histone acetyltransferase (HATs) play a central role in the regulation of gene expression and various biological processes in eukaryotes. Although HAT genes have been studied in many fungi, few of them have been functionally characterized. In this study, we identified and characterized four putative HATs (FgGCN5, FgRTT109, FgSAS2, FgSAS3) in the plant pathogenic ascomycete Fusarium graminearum, the causal agent of Fusarium head blight of wheat and barley. We replaced the genes and all mutant strains showed reduced growth of F. graminearum. The ΔFgSAS3 and ΔFgGCN5 mutant increased sensitivity to oxidative and osmotic stresses. Additionally, ΔFgSAS3 showed reduced conidia sporulation and perithecium formation. Mutant ΔFgGCN5 was unable to generate any conidia and lost its ability to form perithecia. Our data showed also that FgSAS3 and FgGCN5 are pathogenicity factors required for infecting wheat heads as well as tomato fruits. Importantly, almost no Deoxynivalenol (DON) was produced either in ΔFgSAS3 or ΔFgGCN5 mutants, which was consistent with a significant downregulation of TRI genes expression. Furthermore, we discovered for the first time that FgSAS3 is indispensable for the acetylation of histone site H3K4, while FgGCN5 is essential for the acetylation of H3K9, H3K18, and H3K27. H3K14 can be completely acetylated when FgSAS3 and FgGCN5 were both present. The RNA-seq analyses of the two mutant strains provide insight into their functions in development and metabolism. Results from this study clarify the functional divergence of HATs in F. graminearum, and may provide novel targeted strategies to control secondary metabolite expression and infections of F. graminearum.
Efficiency of whole genome amplification of single circulating tumor cells enriched by CellSearch and sorted by FACS
Tumor cells in the blood of patients with metastatic carcinomas are associated with poor survival. Knowledge of the cells’ genetic make-up can help to guide targeted therapy. We evaluated the efficiency and quality of isolation and amplification of DNA from single circulating tumor cells (CTC). The efficiency of the procedure was determined by spiking blood with SKBR-3 cells, enrichment with the CellSearch system, followed by single cell sorting by fluorescence-activated cell sorting (FACS) and whole genome amplification. A selection of single cell DNA from fixed and unfixed SKBR-3 cells was exome sequenced and the DNA quality analyzed. Single CTC from patients with lung cancer were used to demonstrate the potential of single CTC molecular characterization. The overall efficiency of the procedure from spiked cell to amplified DNA was approximately 20%. Losses attributed to the CellSearch system were around 20%, transfer to FACS around 25%, sorting around 5% and DNA amplification around 25%. Exome sequencing revealed that the quality of the DNA was affected by the fixation of the cells, amplification, and the low starting quantity of DNA. A single fixed cell had an average coverage at 20× depth of 30% when sequencing to an average of 40× depth, whereas a single unfixed cell had 45% coverage. GenomiPhi-amplified genomic DNA had a coverage of 72% versus a coverage of 87% of genomic DNA. Twenty-one percent of the CTC from patients with lung cancer identified by the CellSearch system could be isolated individually and amplified. CTC enriched by the CellSearch system were sorted by FACS, and DNA retrieved and amplified with an overall efficiency of 20%. Analysis of the sequencing data showed that this DNA could be used for variant calling, but not for quantitative measurements such as copy number detection. Close to 55% of the exome of single SKBR-3 cells were successfully sequenced to 20× depth making it possible to call 72% of the variants. The overall coverage was reduced to 30% at 20× depth, making it possible to call 56% of the variants in CellSave-fixed cells.
Agammaglobulinaemia despite terminal B cell differentiation in a patient with a novel LRBA mutation
Mutations in lipopolysaccharide-responsive vesicle trafficking, beach and anchor-containing protein (LRBA) cause immune deficiency and inflammation. Here, we are reporting a novel homozygous mutation in LRBA allele in 7-year-old Omani boy, born to consanguineous parents. He presented with type 1 diabetes, autoimmune haematological cytopenia, recurrent chest infections and lymphocytic interstitial lung disease. The patient was treated with CTLA4-Ig (abatacept) with good outcome every 2 weeks for a period of 3 months. He developed complete IgG deficiency, but remarkably, histological examination revealed germinal centres and plasma cells in lymphoid and inflamed lung tissue. Further charatecterisation showed these cells to express IgM but not IgG. This ex vivo analysis suggests that LRBA mutation confers a defect in class switching despite plasma cell formation.
Exploring the Phenotypic Space and the Evolutionary History of a Natural Mutation in Drosophila melanogaster
A major challenge of modern Biology is elucidating the functional consequences of natural mutations. Although we have a good understanding of the effects of laboratory-induced mutations on the molecular- and organismal-level phenotypes, the study of natural mutations has lagged behind. In this work, we explore the phenotypic space and the evolutionary history of a previously identified adaptive transposable element insertion. We first combined several tests that capture different signatures of selection to show that there is evidence of positive selection in the regions flanking FBti0019386 insertion. We then explored several phenotypes related to known phenotypic effects of nearby genes, and having plausible connections to fitness variation in nature. We found that flies with FBti0019386 insertion had a shorter developmental time and were more sensitive to stress, which are likely to be the adaptive effect and the cost of selection of this mutation, respectively. Interestingly, these phenotypic effects are not consistent with a role of FBti0019386 in temperate adaptation as has been previously suggested. Indeed, a global analysis of the population frequency of FBti0019386 showed that climatic variables explain well the FBti0019386 frequency patterns only in Australia. Finally, although FBti0019386 insertion could be inducing the formation of heterochromatin by recruiting HP1a (Heterochromatin Protein 1a) protein, the insertion is associated with upregulation of sra in adult females. Overall, our integrative approach allowed us to shed light on the evolutionary history, the relevant fitness effects, and the likely molecular mechanisms of an adaptive mutation and highlights the complexity of natural genetic variants.
Whole Genome Sequence of CMY 2 β Lactamase Producing Salmonella enterica Serovar Typhimurium Strain L 3553
Salmonella enterica serovar Typhimurium pulsed-field gel electrophoresis cluster VII has been isolated from cattle populations in Japan since the mid-2000s. Some cluster VII isolates exhibited extended-spectrum cephalosporin resistance defined by the blaCMY-2 gene located in a chromosomal genomic island, GI-VII-6. We determined the whole-genome sequence of strain L-3553 as the reference strain.
WOX11 recruits a histone H3K27me3 demethylase to promote gene expression during shoot development in rice
WUSCHEL-related homeobox (WOX) genes are key regulators of meristem activity and plant development, the chromatin mechanism of which to reprogram gene expression remains unclear. Histone H3K27me3 is a chromatin mark of developmentally repressed genes. How the repressive mark is removed from specific genes during plant development is largely unknown. Here, we show that WOX11 interacts with the H3K27me3 demethylase JMJ705 to activate gene expression during shoot development in rice. Genetic analysis indicates that WOX11 and JMJ705 cooperatively control shoot growth and commonly regulate the expression of a set of genes involved in meristem identity, chloroplast biogenesis, and energy metabolism in the shoot apex. Loss of WOX11 led to increased H3K27me3 and overexpression of JMJ705 decreased the methylation levels at a subset of common targets. JMJ705 is associated with most of the WOX11-binding sites found in the tested common targets in vivo, regardless of presence or absence of the JMJ705-binding motif. Furthermore, wox11 mutation reduced JMJ705-binding to many targets genome-wide. The results suggest that recruitment of JMJ705 to specific developmental pathway genes is promoted by DNA-binding transcription factors and that WOX11 functions to stimulate shoot growth through epigenetic reprogramming of genes involved in meristem development and energy-generating pathways.
Genomic and Molecular Characterization of Miltefosine Resistance in Leishmania infantum Strains with Either Natural or Acquired Resistance through Experimental Selection of Intracellular Amastigotes
During the last decade miltefosine (MIL) has been used as first-line treatment for visceral leishmaniasis in endemic areas with antimonial resistance, but a decline in clinical effectiveness is now being reported. While only two MIL-resistant Leishmania infantum strains from HIV co-infected patients have been documented, phenotypic MIL-resistance for L. donovani has not yet been identified in the laboratory. Hence, a better understanding of the factors contributing to increased MIL-treatment failure is necessary. Given the paucity of defined MIL-resistant L. donovani clinical isolates, this study used an experimental amastigote-selected MIL-resistant L. infantum isolate (LEM3323). In-depth exploration of the MIL-resistant phenotype was performed by coupling genomic with phenotypic data to gain insight into gene function and the mutant phenotype. A naturally MIL-resistant L. infantum clinical isolate (LEM5159) was included to compare both datasets. Phenotypically, resistance was evaluated by determining intracellular amastigote susceptibility in vitro and actual MIL-uptake. Genomic analysis provided supportive evidence that the resistance selection model on intracellular amastigotes can be a good proxy for the in vivo field situation since both resistant strains showed mutations in the same inward transporter system responsible for the acquired MIL-resistant phenotype. In line with previous literature findings in promastigotes, our data confirm a defective import machinery through inactivation of the LiMT/LiRos3 protein complex as the main mechanism for MIL-resistance also in intracellular amastigotes. Whole genome sequencing analysis of LEM3323 revealed a 2 base pair deletion in the LiMT gene that led to the formation an early stop codon and a truncation of the LiMT protein. Interestingly, LEM5159 revealed mutations in both the LiMT and LiRos3 genes, resulting in an aberrant expression of the LiMT protein. To verify that these mutations were indeed accountable for the acquired resistance, transfection experiments were performed to re-establish MIL-susceptibility. In LEM3323, susceptibility was restored upon expression of a LiMT wild-type gene, whereas the MIL-susceptibility of LEM5159 could be reversed after expression of the LiRos3 wild-type gene. The aberrant expression profile of the LiMT protein could be restored upon rescue of the LiRos3 gene both in the LEM5159 clinical isolate and a ΔLiRos3 strain, showing that expression of LdMT is dependent on LdRos3 expression. The present findings clearly corroborate the pivotal role of the LiMT/LiRos3 complex in resistance towards MIL.
Epigenetic variation between urban and rural populations of Darwin’s finches
The molecular basis of evolutionary change is assumed to be genetic variation. However, growing evidence suggests that epigenetic mechanisms, such as DNA methylation, may also be involved in rapid adaptation to new environments. An important first step in evaluating this hypothesis is to test for the presence of epigenetic variation between natural populations living under different environmental conditions. In the current study we explored variation between populations of Darwin’s finches, which comprise one of the best-studied examples of adaptive radiation. We tested for morphological, genetic, and epigenetic differences between adjacent “urban” and “rural” populations of each of two species of ground finches, Geospiza fortis and G. fuliginosa, on Santa Cruz Island in the Galápagos. Using data collected from more than 1000 birds, we found significant morphological differences between populations of G. fortis, but not G. fuliginosa. We did not find large size copy number variation (CNV) genetic differences between populations of either species. However, other genetic variants were not investigated. In contrast, we did find dramatic epigenetic differences between the urban and rural populations of both species, based on DNA methylation analysis. We explored genomic features and gene associations of the differentially DNA methylated regions (DMR), as well as their possible functional significance. In summary, our study documents local population epigenetic variation within each of two species of Darwin’s finches. The online version of this article (doi:10.1186/s12862-017-1025-9) contains supplementary material, which is available to authorized users.
Multidrug resistant Escherichia coli from canine urinary tract infections tend to have commensal phylotypes, lower prevalence of virulence determinants and ampC replicons☆
Multidrug-resistant Escherichia coli is an emerging clinical challenge in domestic species. Treatment options in many cases are limited. This study characterized MDR E. coli isolates from urinary tract infections in dogs, collected between 2002 and 2011. Isolates were evaluated in terms of β-lactamase production, phylogenetic group, ST type, replicon type and virulence marker profile. Comparisons were made with antibiotic susceptible isolates also collected from dogs with urinary tract infections. AmpC β-lactamase was produced in 67% of the MDR isolates (12/18). Of these, 8 could be specifically attributed to the CMY-2 gene. None of the isolates tested in either group expressed ESBLs. Phylo-group distribution was as expected in the susceptible isolates, with an over representation of the pathogenic B2 phylo-group (67%). In contrast, the phylogenetic background for the MDR group was mixed, with representation of commensal phylo-groups A and B1. The B2 phylo-group represented the smallest proportion (A, B1, B2 or D was 28%, 22%, 11% and 33%, respectively). Virulence marker profiles, evaluated using Identibac® microarray, discriminated between the two groups. Marker sequences for a core panel of virulence determinants were identified in most of the susceptible isolates, but not in most of the MDR isolates. These findings indicate that for MDR isolates, plasmid-mediated AmpC is an important resistance mechanism, and while still capable of causing clinical disease, there is evidence for a shift towards phylogenetic groups of reduced inferred virulence potential. There was no evidence of zoonotic potential in either the susceptible or MDR urinary tract isolates in this study.
Coordinate Regulation of Antimycin and Candicidin Biosynthesis
Natural products produced by members of the phylum Actinobacteria underpin many industrially and medically important compounds; however, the majority of the ~30 biosynthetic pathways harbored by an average species are not expressed in the laboratory. Understanding the diversity of regulatory strategies controlling the expression of these pathways is therefore critical if their biosynthetic potential is to be explored for new drug leads. Our findings reveal that the candicidin cluster-situated regulator FscRI coordinately controls the biosynthesis of both candicidin and antimycin, which is the first observation of cross-regulation of disparate biosynthetic gene clusters specifying unrelated natural products. We anticipate that this will emerge as a major strategy by which members of the phylum Actinobacteria coordinately produce natural products, which will advance our understanding of how the expression of secondary metabolism is controlled and will aid the pursuit of “silent” biosynthetic pathway activation. Streptomyces species produce an incredible array of high-value specialty chemicals and medicinal therapeutics. A single species typically harbors ~30 biosynthetic pathways, but only a few them are expressed in the laboratory; thus, poor understanding of how natural-product biosynthesis is regulated is a major bottleneck in drug discovery. Antimycins are a large family of anticancer compounds widely produced by Streptomyces species, and their regulation is atypical compared to that of most other natural products. Here we demonstrate that antimycin production by Streptomyces albus S4 is regulated by FscRI, a PAS-LuxR family cluster-situated regulator of the polyene antifungal agent candicidin. We report that heterologous production of antimycins by Streptomyces coelicolor is dependent on FscRI and show that FscRI activates the transcription of key biosynthetic genes. We also demonstrate through chromatin immunoprecipitation sequencing that FscRI regulation is direct, and we provide evidence that this regulation strategy is conserved and unique to short-form antimycin gene clusters. Our study provides direct in vivo evidence of the cross-regulation of disparate biosynthetic gene clusters specifying unrelated natural products and expands the paradigmatic understanding of the regulation of secondary metabolism. IMPORTANCE Natural products produced by members of the phylum Actinobacteria underpin many industrially and medically important compounds; however, the majority of the ~30 biosynthetic pathways harbored by an average species are not expressed in the laboratory. Understanding the diversity of regulatory strategies controlling the expression of these pathways is therefore critical if their biosynthetic potential is to be explored for new drug leads. Our findings reveal that the candicidin cluster-situated regulator FscRI coordinately controls the biosynthesis of both candicidin and antimycin, which is the first observation of cross-regulation of disparate biosynthetic gene clusters specifying unrelated natural products. We anticipate that this will emerge as a major strategy by which members of the phylum Actinobacteria coordinately produce natural products, which will advance our understanding of how the expression of secondary metabolism is controlled and will aid the pursuit of “silent” biosynthetic pathway activation.
Time programmable drug dosing allows the manipulation, suppression and reversal of antibiotic drug resistance in vitro
Multi-drug strategies have been attempted to prolong the efficacy of existing antibiotics, but with limited success. Here we show that the evolution of multi-drug-resistant Escherichia coli can be manipulated in vitro by administering pairs of antibiotics and switching between them in ON/OFF manner. Using a multiplexed cell culture system, we find that switching between certain combinations of antibiotics completely suppresses the development of resistance to one of the antibiotics. Using this data, we develop a simple deterministic model, which allows us to predict the fate of multi-drug evolution in this system. Furthermore, we are able to reverse established drug resistance based on the model prediction by modulating antibiotic selection stresses. Our results support the idea that the development of antibiotic resistance may be potentially controlled via continuous switching of drugs. It is unclear whether strategies involving antibiotic cycling can efficiently control the emergence of antibiotic-resistant bacteria. Here, Yoshida et al. show that the evolution of multi-drug-resistant bacteria in vitro can be manipulated by administering pairs of antibiotics and switching between them.
Whole genome sequencing for prediction of Mycobacterium tuberculosis drug susceptibility and resistance: a retrospective cohort study
Diagnosing drug-resistance remains an obstacle to the elimination of tuberculosis. Phenotypic drug-susceptibility testing is slow and expensive, and commercial genotypic assays screen only common resistance-determining mutations. We used whole-genome sequencing to characterise common and rare mutations predicting drug resistance, or consistency with susceptibility, for all first-line and second-line drugs for tuberculosis. Between Sept 1, 2010, and Dec 1, 2013, we sequenced a training set of 2099 Mycobacterium tuberculosis genomes. For 23 candidate genes identified from the drug-resistance scientific literature, we algorithmically characterised genetic mutations as not conferring resistance (benign), resistance determinants, or uncharacterised. We then assessed the ability of these characterisations to predict phenotypic drug-susceptibility testing for an independent validation set of 1552 genomes. We sought mutations under similar selection pressure to those characterised as resistance determinants outside candidate genes to account for residual phenotypic resistance. We characterised 120 training-set mutations as resistance determining, and 772 as benign. With these mutations, we could predict 89·2% of the validation-set phenotypes with a mean 92·3% sensitivity (95% CI 90·7–93·7) and 98·4% specificity (98·1–98·7). 10·8% of validation-set phenotypes could not be predicted because uncharacterised mutations were present. With an in-silico comparison, characterised resistance determinants had higher sensitivity than the mutations from three line-probe assays (85·1% vs 81·6%). No additional resistance determinants were identified among mutations under selection pressure in non-candidate genes. A broad catalogue of genetic mutations enable data from whole-genome sequencing to be used clinically to predict drug resistance, drug susceptibility, or to identify drug phenotypes that cannot yet be genetically predicted. This approach could be integrated into routine diagnostic workflows, phasing out phenotypic drug-susceptibility testing while reporting drug resistance early. Wellcome Trust, National Institute of Health Research, Medical Research Council, and the European Union.
RNA sequencing provides evidence for functional variability between naturally co existing Alteromonas macleodii lineages
Alteromonas macleodii is a ubiquitous gammaproteobacterium shown to play a biogeochemical role in marine environments. Two A. macleodii strains (AltDE and AltDE1) isolated from the same sample (i.e., the same place at the same time) show considerable genomic differences. In this study, we investigate the transcriptional response of these two strains to varying growth conditions in order to investigate differences in their ability to adapt to varying environmental parameters. RNA sequencing revealed transcriptional changes between all growth conditions examined (e.g., temperature and medium) as well as differences between the two A. macleodii strains within a given condition. The main inter-strain differences were more marked in the adaptation to grow on minimal medium with glucose and, even more so, under starvation. These differences suggested that AltDE1 may have an advantage over AltDE when glucose is the major carbon source, and co-culture experiments confirmed this advantage. Additional differences were observed between the two strains in the expression of ncRNAs and phage-related genes, as well as motility. This study shows that the genomic diversity observed in closely related strains of A. macleodii from a single environment result in different transcriptional responses to changing environmental parameters. This data provides additional support for the idea that greater diversity at the strain level of a microbial community could enhance the community’s ability to adapt to environmental shifts. The online version of this article (doi:10.1186/1471-2164-15-938) contains supplementary material, which is available to authorized users.
Genetic variability of the activity of bidirectional promoters: a pilot study in bovine muscle
Bidirectional promoters are regulatory regions co-regulating the expression of two neighbouring genes organized in a head-to-head orientation. In recent years, these regulatory regions have been studied in many organisms; however, no investigation to date has been done to analyse the genetic variation of the activity of this type of promoter regions. In our study, we conducted an investigation to first identify bidirectional promoters sharing genes expressed in bovine Longissimus thoracis and then to find genetic variants affecting the activity of some of these bidirectional promoters. Combining bovine gene information and expression data obtained using RNA-Seq, we identified 120 putative bidirectional promoters active in bovine muscle. We experimentally validated in vitro 16 of these bidirectional promoters. Finally, using gene expression and whole-genome genotyping data, we explored the variability of the activity in muscle of the identified bidirectional promoters and discovered genetic variants affecting their activity. We found that the expression level of 77 genes is correlated with the activity of 12 bidirectional promoters. We also identified 57 single nucleotide polymorphisms associated with the activity of 5 bidirectional promoters. To our knowledge, our study is the first analysis in any species of the genetic variability of the activity of bidirectional promoters.
Absence of population structure across elevational gradients despite large phenotypic variation in mountain chickadees (Poecile gambeli)
Montane habitats are characterized by predictably rapid heterogeneity along elevational gradients and are useful for investigating the consequences of environmental heterogeneity for local adaptation and population genetic structure. Food-caching mountain chickadees inhabit a continuous elevation gradient in the Sierra Nevada, and birds living at harsher, high elevations have better spatial memory ability and exhibit differences in male song structure and female mate preference compared to birds inhabiting milder, low elevations. While high elevation birds breed, on average, two weeks later than low elevation birds, the extent of gene flow between elevations is unknown. Despite phenotypic variation and indirect evidence for local adaptation, population genetic analyses based on 18 073 single nucleotide polymorphisms across three transects of high and low elevation populations provided no evidence for genetic differentiation. Analyses based on individual genotypes revealed no patterns of clustering, pairwise estimates of genetic differentiation (FST, Nei's D) were very low, and AMOVA revealed no evidence for genetic variation structured by transect or by low and high elevation sites within transects. In addition, we found no consistent evidence for strong parallel allele frequency divergence between low and high elevation sites within the three transects. Large elevation-related phenotypic variation may be maintained by strong selection despite gene flow and future work should focus on the mechanisms underlying such variation.
Altered Levels of Mitochondrial DNA Are Associated with Female Age, Aneuploidy, and Provide an Independent Measure of Embryonic Implantation Potential
Mitochondria play a vital role in embryo development. They are the principal site of energy production and have various other critical cellular functions. Despite the importance of this organelle, little is known about the extent of variation in mitochondrial DNA (mtDNA) between individual human embryos prior to implantation. This study investigated the biological and clinical relevance of the quantity of mtDNA in 379 embryos. These were examined via a combination of microarray comparative genomic hybridisation (aCGH), quantitative PCR and next generation sequencing (NGS), providing information on chromosomal status, amount of mtDNA, and presence of mutations in the mitochondrial genome. The quantity of mtDNA was significantly higher in embryos from older women (P=0.003). Additionally, mtDNA levels were elevated in aneuploid embryos, independent of age (P=0.025). Assessment of clinical outcomes after transfer of euploid embryos to the uterus revealed that blastocysts that successfully implanted tended to contain lower mtDNA quantities than those failing to implant (P=0.007). Importantly, an mtDNA quantity threshold was established, above which implantation was never observed. Subsequently, the predictive value of this threshold was confirmed in an independent blinded prospective study, indicating that abnormal mtDNA levels are present in 30% of non-implanting euploid embryos, but are not seen in embryos forming a viable pregnancy. NGS did not reveal any increase in mutation in blastocysts with elevated mtDNA levels. The results of this study suggest that increased mtDNA may be related to elevated metabolism and are associated with reduced viability, a possibility consistent with the ‘quiet embryo’ hypothesis. Importantly, the findings suggest a potential role for mitochondria in female reproductive aging and the genesis of aneuploidy. Of clinical significance, we propose that mtDNA content represents a novel biomarker with potential value for in vitro fertilisation (IVF) treatment, revealing chromosomally normal blastocysts incapable of producing a viable pregnancy. Mitochondria are small membrane-enclosed structures and are found inside the cells of the body. Mitochondria actively participate in cellular life, and their main function is to generate energy which is used by the cell. For this reason mitochondria are considered as the powerhouses of cells. Unlike other cellular organelles, mitochondria contain their own DNA (mtDNA). MtDNA carries important genetic information concerning cellular metabolism and the generation of energy. It has been suggested that mitochondria and mtDNA could be of significance during early embryo development. Our work confirms this hypothesis. Specifically, our findings implicate mitochondria and their genome in female reproductive aging and the generation of embryonic chromosome abnormalities. Importantly, we describe a direct relationship between mtDNA quantity and the potential of an embryo to successfully become a baby. We propose that assessment of mtDNA quantity could be a novel way of identifying embryos with the highest ability to lead to healthy pregnancies and live births.
The B. subtilis Accessory Helicase PcrA Facilitates DNA Replication through Transcription Units
In bacteria the concurrence of DNA replication and transcription leads to potentially deleterious encounters between the two machineries, which can occur in either the head-on (lagging strand genes) or co-directional (leading strand genes) orientations. These conflicts lead to replication fork stalling and can destabilize the genome. Both eukaryotic and prokaryotic cells possess resolution factors that reduce the severity of these encounters. Though Escherichia coli accessory helicases have been implicated in the mitigation of head-on conflicts, direct evidence of these proteins mitigating co-directional conflicts is lacking. Furthermore, the endogenous chromosomal regions where these helicases act, and the mechanism of recruitment, have not been identified. We show that the essential Bacillus subtilis accessory helicase PcrA aids replication progression through protein coding genes of both head-on and co-directional orientations, as well as rRNA and tRNA genes. ChIP-Seq experiments show that co-directional conflicts at highly transcribed rRNA, tRNA, and head-on protein coding genes are major targets of PcrA activity on the chromosome. Partial depletion of PcrA renders cells extremely sensitive to head-on conflicts, linking the essential function of PcrA to conflict resolution. Furthermore, ablating PcrA’s ATPase/helicase activity simultaneously increases its association with conflict regions, while incapacitating its ability to mitigate conflicts, and leads to cell death. In contrast, disruption of PcrA’s C-terminal RNA polymerase interaction domain does not impact its ability to mitigate conflicts between replication and transcription, its association with conflict regions, or cell survival. Altogether, this work establishes PcrA as an essential factor involved in mitigating transcription-replication conflicts and identifies chromosomal regions where it routinely acts. As both conflicts and accessory helicases are found in all domains of life, these results are broadly relevant. In bacteria the concurrence of DNA replication and transcription leads to potentially deleterious encounters between the two machineries. These encounters can destabilize the genome and lead to mutations. Both eukaryotic and prokaryotic cells possess conflict resolution factors that reduce the detrimental effects of these collisions. In this study we show that without the essential Bacillus subtilis accessory DNA helicase, PcrA, the replication machinery slows down at certain regions of the chromosome in a transcription-dependent manner. PcrA is essential to life but incomplete depletion of PcrA only partially inhibits cell survival. We find that, under these conditions, partial survival defects are significantly exacerbated in the presence of a single severe conflict. In summary our work identifies a high degree of conservation for accessory helicase function in conflict resolution, directly establishes PcrA’s role in co-directional conflict resolution, and maps the natural chromosomal regions where such activities are routinely needed. Because both conflicts and accessory helicases are found in all domains of life, the results of this work are broadly relevant.
A novel strategy for clustering major depression individuals using whole genome sequencing variant data
Major depressive disorder (MDD) is highly prevalent, resulting in an exceedingly high disease burden. The identification of generic risk factors could lead to advance prevention and therapeutics. Current approaches examine genotyping data to identify specific variations between cases and controls. Compared to genotyping, whole-genome sequencing (WGS) allows for the detection of private mutations. In this proof-of-concept study, we establish a conceptually novel computational approach that clusters subjects based on the entirety of their WGS. Those clusters predicted MDD diagnosis. This strategy yielded encouraging results, showing that depressed Mexican-American participants were grouped closer; in contrast ethnically-matched controls grouped away from MDD patients. This implies that within the same ancestry, the WGS data of an individual can be used to check whether this individual is within or closer to MDD subjects or to controls. We propose a novel strategy to apply WGS data to clinical medicine by facilitating diagnosis through genetic clustering. Further studies utilising our method should examine larger WGS datasets on other ethnical groups.
Genome sequences of 12 isolates of the EU1 lineage of Phytophthora ramorum, a fungus like pathogen that causes extensive damage and mortality to a wide range of trees and other plants
Here we present genome sequences for twelve isolates of the invasive pathogen Phytophthora ramorum EU1. The assembled genome sequences and raw sequence data are available via BioProject accession number PRJNA177509. These data will be useful in developing molecular tools for specific detection and identification of this pathogen.
A human immunodeficiency syndrome caused by mutations in CARMIL2
Human T-cell function is dependent on T-cell antigen receptor (TCR) and co-signalling as evidenced by immunodeficiencies affecting TCR-dependent signalling pathways. Here, we show four human patients with EBV+ disseminated smooth muscle tumours that carry two homozygous loss-of-function mutations in the CARMIL2 (RLTPR) gene encoding the capping protein regulator and myosin 1 linker 2. These patients lack regulatory T cells without evidence of organ-specific autoimmunity, and have defective CD28 co-signalling associated with impaired T-cell activation, differentiation and function, as well as perturbed cytoskeletal organization associated with T-cell polarity and migration disorders. Human CARMIL2-deficiency is therefore an autosomal recessive primary immunodeficiency disorder associated with defective CD28-mediated TCR co-signalling and impaired cytoskeletal dynamics. CARMIL2 (Rltpr) is involved in T-cell function. Here, the authors identify human CARMIL2-deficiency as an autosomal recessive primary immunodeficiency disorder characterized by EBV+ smooth muscle tumours, CD28 co-signalling deficiency and impaired cytoskeletal dynamics.
Loss of AXIN1 drives acquired resistance to WNT pathway blockade in colorectal cancer cells carrying RSPO3 fusions
In colorectal cancer (CRC), WNT pathway activation by genetic rearrangements of RSPO3 is emerging as a promising target. However, its low prevalence severely limits availability of preclinical models for in‐depth characterization. Using a pipeline designed to suppress stroma‐derived signal, we find that RSPO3 “outlier” expression in CRC samples highlights translocation and fusion transcript expression. Outlier search in 151 CRC cell lines identified VACO6 and SNU1411 cells as carriers of, respectively, a canonical PTPRK(e1)‐RSPO3(e2) fusion and a novel PTPRK(e13)‐RSPO3(e2) fusion. Both lines displayed marked in vitro and in vivo sensitivity to WNT blockade by the porcupine inhibitor LGK974, associated with transcriptional and morphological evidence of WNT pathway suppression. Long‐term treatment of VACO6 cells with LGK974 led to the emergence of a resistant population carrying two frameshift deletions of the WNT pathway inhibitor AXIN1, with consequent protein loss. Suppression of AXIN1 in parental VACO6 cells by RNA interference conferred marked resistance to LGK974. These results provide the first mechanism of secondary resistance to WNT pathway inhibition.
Nested Levels of Adaptive Divergence: The Genetic Basis of Craniofacial Divergence and Ecological Sexual Dimorphism
Exemplary systems for adaptive divergence are often characterized by their large degrees of phenotypic variation. This variation represents the outcome of generations of diversifying selection. However, adaptive radiations can also contain a hierarchy of differentiation nested within them where species display only subtle phenotypic differences that still have substantial effects on ecology, function, and ultimately fitness. Sexual dimorphisms are also common in species displaying adaptive divergence and can be the result of differential selection between sexes that produce ecological differences between sexes. Understanding the genetic basis of subtle variation (between certain species or sexes) is therefore important for understanding the process of adaptive divergence. Using cichlids from the dramatic adaptive radiation of Lake Malawi, we focus on understanding the genetic basis of two aspects of relatively subtle phenotypic variation. This included a morphometric comparison of the patterns of craniofacial divergence between two ecologically similar species in relation to the larger adaptive radiation of Malawi, and male–female morphological divergence between their F2 hybrids. We then genetically map craniofacial traits within the context of sex and locate several regions of the genome that contribute to variation in craniofacial shape that is relevant to sexual dimorphism within species and subtle divergence between closely related species, and possibly to craniofacial divergence in the Malawi radiation as a whole. To enhance our search for candidate genes we take advantage of population genomic data and a genetic map that is anchored to the cichlid genome to determine which genes within our QTL regions are associated with SNPs that are alternatively fixed between species. This study provides a holistic understanding of the genetic underpinnings of adaptive divergence in craniofacial shape.
Genome Wide Diversity and Phylogeography of Mycobacterium avium subsp. paratuberculosis in Canadian Dairy Cattle
Mycobacterium avium subsp. paratuberculosis (MAP) is the causative bacterium of Johne’s disease (JD) in ruminants. The control of JD in the dairy industry is challenging, but can be improved with a better understanding of the diversity and distribution of MAP subtypes. Previously established molecular typing techniques used to differentiate MAP have not been sufficiently discriminatory and/or reliable to accurately assess the population structure. In this study, the genetic diversity of 182 MAP isolates representing all Canadian provinces was compared to the known global diversity, using single nucleotide polymorphisms identified through whole genome sequencing. MAP isolates from Canada represented a subset of the known global diversity, as there were global isolates intermingled with Canadian isolates, as well as multiple global subtypes that were not found in Canada. One Type III and six “Bison type” isolates were found in Canada as well as one Type II subtype that represented 86% of all Canadian isolates. Rarefaction estimated larger subtype richness in Québec than in other Canadian provinces using a strict definition of MAP subtypes and lower subtype richness in the Atlantic region using a relaxed definition. Significant phylogeographic clustering was observed at the inter-provincial but not at the intra-provincial level, although most major clades were found in all provinces. The large number of shared subtypes among provinces suggests that cattle movement is a major driver of MAP transmission at the herd level, which is further supported by the lack of spatial clustering on an intra-provincial scale.
Cell cycle dependent resolution of DNA double strand breaks
DNA double strand breaks (DSBs) elicit prompt activation of DNA damage response (DDR), which arrests cell-cycle either in G1/S or G2/M in order to avoid entering S and M phase with damaged DNAs. Since mammalian tissues contain both proliferating and quiescent cells, there might be fundamental difference in DDR between proliferating and quiescent cells (or G0-arrested). To investigate these differences, we studied recruitment of DSB repair factors and resolution of DNA lesions induced at site-specific DSBs in asynchronously proliferating, G0-, or G1-arrested cells. Strikingly, DSBs occurring in G0 quiescent cells are not repaired and maintain a sustained activation of the p53-pathway. Conversely, re-entry into cell cycle of damaged G0-arrested cells, occurs with a delayed clearance of DNA repair factors initially recruited to DSBs, indicating an inefficient repair when compared to DSBs induced in asynchronously proliferating or G1-synchronized cells. Moreover, we found that initial recognition of DSBs and assembly of DSB factors is largely similar in asynchronously proliferating, G0-, or G1-synchronized cells. Our study thereby demonstrates that repair and resolution of DSBs is strongly dependent on the cell-cycle state.
A high throughput RNA seq approach to profile transcriptional responses
In recent years RNA-seq protocols have been developed to investigate a variety of biological problems by measuring the abundance of different RNAs. Many study designs involve performing expensive preliminary studies to screen or optimize experimental conditions. Testing a large number of conditions in parallel may be more cost effective. For example, analyzing tissue/environment-specific gene expression generally implies screening a large number of cellular conditions and samples, without prior knowledge of which conditions are most informative (e.g., some cell types may not respond to certain treatments). To circumvent these challenges, we have established a new two-step high-throughput RNA-seq approach: the first step consists of gene expression screening of a large number of conditions, while the second step focuses on deep sequencing of the most relevant conditions (e.g., largest number of differentially expressed genes). This study design allows for a fast and economical screen in step one, with a more efficient allocation of resources for the deep sequencing of the most biologically relevant libraries in step two. We have applied this approach to study the response to 23 treatments in three lymphoblastoid cell lines demonstrating that it should also be useful for other high-throughput transcriptome profiling applications requiring iterative refinement or screening.
Innate Immune Response and Off Target Mis splicing Are Common Morpholino Induced Side Effects in Xenopus
Antisense morpholino oligomers (MOs) have been indispensable tools for developmental biologists to transiently knock down (KD) genes rather than to knock them out (KO). Here we report on the implications of genetic KO versus MO-mediated KD of the mesoderm-specifying Brachyury paralogs in the frog Xenopus tropicalis. While both KO and KD embryos fail to activate the same core gene regulatory network, resulting in virtually identical morphological defects, embryos injected with control or target MOs also show a systemic GC content-dependent immune response and many off-target splicing defects. Optimization of MO dosage and increasing incubation temperatures can mitigate, but not eliminate, these MO side effects, which are consistent with the high affinity measured between MO and off-target sequence in vitro. We conclude that while MOs can be useful to profile loss-of-function phenotypes at a molecular level, careful attention must be paid to their immunogenic and off-target side effects. • Brachyury KO and KD in frog equally affect phenotype-causing downstream genes • Other transcriptional anomalies are unique to morpholino-based KDs and controls • Morpholinos can trigger an innate immune response and off-target mis-splicing • Optimization of KD conditions mitigates but does not eliminate these side effects Brachyury KO and KD in frog equally affect phenotype-causing downstream genes Other transcriptional anomalies are unique to morpholino-based KDs and controls Morpholinos can trigger an innate immune response and off-target mis-splicing Optimization of KD conditions mitigates but does not eliminate these side effects Antisense morpholino oligomers have frequently been used to knock down genes. By systematically comparing the knockdown and knockout of the same genes in frog embryos, Gentsch et al. reveal that morpholinos have significant side effects. They trigger an innate immune response and cause off-target splicing defects.
The Transcriptome of the Zoanthid Protopalythoa variabilis (Cnidaria, Anthozoa) Predicts a Basal Repertoire of Toxin like and Venom Auxiliary Polypeptides
Protopalythoa is a zoanthid that, together with thousands of predominantly marine species, such as hydra, jellyfish, and sea anemones, composes the oldest eumetazoan phylum, i.e., the Cnidaria. Some of these species, such as sea wasps and sea anemones, are highly venomous organisms that can produce deadly toxins for preying, for defense or for territorial disputes. Despite the fact that hundreds of organic and polypeptide toxins have been characterized from sea anemones and jellyfish, practically nothing is known about the toxin repertoire in zoanthids. Here, based on a transcriptome analysis of the zoanthid Protopalythoa variabilis, numerous predicted polypeptides with canonical venom protein features are identified. These polypeptides comprise putative proteins from different toxin families: neurotoxic peptides, hemostatic and hemorrhagic toxins, membrane-active (pore-forming) proteins, protease inhibitors, mixed-function venom enzymes, and venom auxiliary proteins. The synthesis and functional analysis of two of these predicted toxin products, one related to the ShK/Aurelin family and the other to a recently discovered anthozoan toxin, displayed potent in vivo neurotoxicity that impaired swimming in larval zebrafish. Altogether, the complex array of venom-related transcripts that are identified in P. variabilis, some of which are first reported in Cnidaria, provides novel insight into the toxin distribution among species and might contribute to the understanding of composition and evolution of venom polypeptides in toxiferous animals.
Identification and Analysis of RNA Editing Sites in the Chloroplast Transcripts of Aegilops tauschii L.
RNA editing is an important way to convert cytidine (C) to uridine (U) at specific sites within RNA molecules at a post-transcriptional level in the chloroplasts of higher plants. Although it has been systematically studied in many plants, little is known about RNA editing in the wheat D genome donor Aegilops tauschii L. Here, we investigated the chloroplast RNA editing of Ae. tauschii and compared it with other wheat relatives to trace the evolution of wheat. Through bioinformatics prediction, a total of 34 C-to-U editing sites were identified, 17 of which were validated using RT-PCR product sequencing. Furthermore, 60 sites were found by the RNA-Seq read mapping approach, 24 of which agreed with the prediction and six were validated experimentally. The editing sites were biased toward tCn or nCa trinucleotides and 5′-pyrimidines, which were consistent with the flanking bases of editing sites of other seed plants. Furthermore, the editing events could result in the alteration of the secondary structures and topologies of the corresponding proteins, suggesting that RNA editing might impact the function of target genes. Finally, comparative analysis found some evolutionarily conserved editing sites in wheat and two species-specific sites were also obtained. This study is the first to report on RNA editing in Aegilops tauschii L, which not only sheds light on the evolution of wheat from the point of view of RNA editing, but also lays a foundation for further studies to identify the mechanisms of C-to-U alterations.
Genome Sequence of Hafnia alvei bta3_1, a Bacterium with Antimicrobial Properties Isolated from Honey Bee Gut
Hafnia alvei bta3_1, a strain with antibacterial properties, was isolated from honey bee gut and cultured under aerobic and anaerobic conditions. To explore the potential genetic bases of its antibacterial and possible pathogenic properties, the complete genome of this organism was sequenced and analyzed.
hnRNPK Recruits PCGF3/5 PRC1 to the Xist RNA B Repeat to Establish Polycomb Mediated Chromosomal Silencing
The Polycomb-repressive complexes PRC1 and PRC2 play a key role in chromosome silencing induced by the non-coding RNA Xist. Polycomb recruitment is initiated by the PCGF3/5-PRC1 complex, which catalyzes chromosome-wide H2A lysine 119 ubiquitylation, signaling recruitment of other PRC1 complexes, and PRC2. However, the molecular mechanism for PCGF3/5-PRC1 recruitment by Xist RNA is not understood. Here we define the Xist RNA Polycomb Interaction Domain (XR-PID), a 600 nt sequence encompassing the Xist B-repeat element. Deletion of XR-PID abolishes Xist-dependent Polycomb recruitment, in turn abrogating Xist-mediated gene silencing and reversing Xist-induced chromatin inaccessibility. We identify the RNA-binding protein hnRNPK as the principal XR-PID binding factor required to recruit PCGF3/5-PRC1. Accordingly, synthetically tethering hnRNPK to Xist RNA lacking XR-PID is sufficient for Xist-dependent Polycomb recruitment. Our findings define a key pathway for Polycomb recruitment by Xist RNA, providing important insights into mechanisms of chromatin modification by non-coding RNA. • A 600 nt element in Xist RNA, XR-PID, is required for Polycomb recruitment • Deletion of XR-PID abrogates Xist-mediated chromosome silencing • hnRNPK binds XR-PID to recruit the Polycomb-initiating complex PCGF3/5-PRC1 • Tethering hnRNPK to Xist RNA bypasses the requirement for XR-PID A 600 nt element in Xist RNA, XR-PID, is required for Polycomb recruitment Deletion of XR-PID abrogates Xist-mediated chromosome silencing hnRNPK binds XR-PID to recruit the Polycomb-initiating complex PCGF3/5-PRC1 Tethering hnRNPK to Xist RNA bypasses the requirement for XR-PID This study advances our understanding of the molecular mechanism of X chromosome inactivation in mammals, defining XR-PID, the critical element in Xist RNA that recruits Polycomb complexes to the inactive X chromosome, and further demonstrating that the RNA binding protein hnRNPK bridges XR-PID with the initiating Polycomb complex, PCGF3/5-PRC1.
Constitutively Active SMAD2/3 Are Broad Scope Potentiators of Transcription Factor Mediated Cellular Reprogramming
Reprogramming of cellular identity using exogenous expression of transcription factors (TFs) is a powerful and exciting tool for tissue engineering, disease modeling, and regenerative medicine. However, generation of desired cell types using this approach is often plagued by inefficiency, slow conversion, and an inability to produce mature functional cells. Here, we show that expression of constitutively active SMAD2/3 significantly improves the efficiency of induced pluripotent stem cell (iPSC) generation by the Yamanaka factors. Mechanistically, SMAD3 interacts with reprogramming factors and co-activators and co-occupies OCT4 target loci during reprogramming. Unexpectedly, active SMAD2/3 also markedly enhances three other TF-mediated direct reprogramming conversions, from B cells to macrophages, myoblasts to adipocytes, and human fibroblasts to neurons, highlighting broad and general roles for SMAD2/3 as cell-reprogramming potentiators. Our results suggest that co-expression of active SMAD2/3 could enhance multiple types of TF-based cell identity conversion and therefore be a powerful tool for cellular engineering. • Enhancement of iPSC reprogramming by TGF-βR inhibition is independent of SMAD2/3 • Constitutively active SMAD2/3 improves reprogramming efficiency and kinetics • SMAD3 co-occupies OCT4 targets during reprogramming • Constitutively active SMAD2/3 also boosts TF-mediated direct lineage reprogramming Enhancement of iPSC reprogramming by TGF-βR inhibition is independent of SMAD2/3 Constitutively active SMAD2/3 improves reprogramming efficiency and kinetics SMAD3 co-occupies OCT4 targets during reprogramming Constitutively active SMAD2/3 also boosts TF-mediated direct lineage reprogramming Ruetz et al. show that constitutively active SMAD2/3 has a surprising ability to boost the efficiency of cell reprogramming both to iPSCs and across lineages and may therefore be a general factor that can enhance transcription-factor-mediated reprogramming in a variety of contexts.
Next generation exome sequencing of paediatric inflammatory bowel disease patients identifies rare and novel variants in candidate genes
Multiple genes have been implicated by association studies in altering inflammatory bowel disease (IBD) predisposition. Paediatric patients often manifest more extensive disease and a particularly severe disease course. It is likely that genetic predisposition plays a more substantial role in this group. To identify the spectrum of rare and novel variation in known IBD susceptibility genes using exome sequencing analysis in eight individual cases of childhood onset severe disease. DNA samples from the eight patients underwent targeted exome capture and sequencing. Data were processed through an analytical pipeline to align sequence reads, conduct quality checks, and identify and annotate variants where patient sequence differed from the reference sequence. For each patient, the entire complement of rare variation within strongly associated candidate genes was catalogued. Across the panel of 169 known IBD susceptibility genes, approximately 300 variants in 104 genes were found. Excluding splicing and HLA-class variants, 58 variants across 39 of these genes were classified as rare, with an alternative allele frequency of <5%, of which 17 were novel. Only two patients with early onset Crohn's disease exhibited rare deleterious variations within NOD2: the previously described R702W variant was the sole NOD2 variant in one patient, while the second patient also carried the L1007 frameshift insertion. Both patients harboured other potentially damaging mutations in the GSDMB, ERAP2 and SEC16A genes. The two patients severely affected with ulcerative colitis exhibited a distinct profile: both carried potentially detrimental variation in the BACH2 and IL10 genes not seen in other patients. For each of the eight individuals studied, all non-synonymous, truncating and frameshift mutations across all known IBD genes were identified. A unique profile of rare and potentially damaging variants was evident for each patient with this complex disease.
Identification and Fine Mapping of a Stably Expressed QTL for Cold Tolerance at the Booting Stage Using an Interconnected Breeding Population in Rice
Cold stress is one of the major abiotic stresses that impede rice production. A interconnected breeding (IB) population consisted of 497 advanced lines developed using HHZ as the recurrent parent and eight diverse elite indica lines as the donors were used to identify stably expressed QTLs for CT at the booting stage. A total of 41,754 high-quality SNPs were obtained through re-sequencing of the IB population. Phenotyping was conducted under field conditions in two years and three locations. Association analysis identified six QTLs for CT on the chromosomes 3, 4 and 12. QTL qCT-3-2 that showed stable CT across years and locations was fine-mapped to an approximately 192.9 kb region. Our results suggested that GWAS applied to an IB population allows better integration of gene discovery and breeding. QTLs can be mapped in high resolution and quickly utilized in breeding.
Spatiotemporal coupling and decoupling of gene transcription with DNA replication origins during embryogenesis in C. elegans
The primary task of developing embryos is genome replication, yet how DNA replication is integrated with the profound cellular changes that occur through development is largely unknown. Using an approach to map DNA replication at high resolution in C. elegans, we show that replication origins are marked with specific histone modifications that define gene enhancers. We demonstrate that the level of enhancer associated modifications scale with the efficiency at which the origin is utilized. By mapping replication origins at different developmental stages, we show that the positions and activity of origins is largely invariant through embryogenesis. Contrary to expectation, we find that replication origins are specified prior to the broad onset of zygotic transcription, yet when transcription initiates it does so in close proximity to the pre-defined replication origins. Transcription and DNA replication origins are correlated, but the association breaks down when embryonic cell division ceases. Collectively, our data indicate that replication origins are fundamental organizers and regulators of gene activity through embryonic development. DOI:
Efficient termination of nuclear lncRNA transcription promotes mitochondrial genome maintenance
Most DNA in the genomes of higher organisms does not code for proteins. RNA Polymerase II (Pol II) transcribes non-coding DNA into long non-coding RNAs (lncRNAs), but biological roles of lncRNA are unclear. We find that mutations in the yeast lncRNA CUT60 result in poor growth. Defective termination of CUT60 transcription causes read-through transcription across the ATP16 gene promoter. Read-through transcription localizes chromatin signatures associated with Pol II elongation to the ATP16 promoter. The act of Pol II elongation across this promoter represses functional ATP16 expression by a Transcriptional Interference (TI) mechanism. Atp16p function in the mitochondrial ATP-synthase complex promotes mitochondrial DNA stability. ATP16 repression by TI through inefficient termination of CUT60 therefore triggers mitochondrial genome loss. Our results expand the functional and mechanistic implications of non-coding DNA in eukaryotes by highlighting termination of nuclear lncRNA transcription as mechanism to stabilize an organellar genome. Genes are made up of DNA and contain the information to make proteins, which carry out a variety of roles in the cell and the body. First, the information found on DNA needs to be transcribed into RNA molecules, which then act as a template to build the actual proteins. However, the vast majority of DNA does not encode proteins. Nevertheless, these non-coding regions of DNA (often given the popular but misleading name ‘junk-DNA’) are still transcribed into non-coding RNA. The purpose of this type of RNA is largely unclear, although some are known to activate certain genes. The transcription of non-coding RNA is also sensitive to environmental changes, suggesting it may play other important roles in the cell. Not all DNA – including non-coding DNA– is copied into RNA in one go. Usually, every DNA sequence is transcribed separately as one unit. These units have clearly marked start and end points. If these marker points are overridden the transcription process can overlap onto the next sequence.Thus, in the case of coding DNA, proteins may not form properly. However, until now it was unclear if missed marker points in non-coding RNA may also have consequences. To investigate this further, du Mee et al. mutated several non-coding parts of the DNA in yeast. The experiments showed that a non-coding RNA sequence called CUT60, appeared to be important to help yeast cells grow. When CUT60 was modified so that it lacked the end marker, its RNA transcript fused with the neighbouring gene called ATP16. As a result, the protein of the ATP16 gene could no longer be produced properly. Normally, ATP16 plays important roles in a cell structure called the mitochondrion, also known as the energy powerhouse of the cell. The mitochondrion has its own DNA, and without CUT60 and ATP16, the yeast cells lost their mitochondrial DNA and could not grow as quickly. This shows that non-coding DNA sequences can have a purpose and can affect other parts of the cell. Moreover, start and end markers of transcription are also important in non-coding DNA sequences. The same mechanism could be at play in other genes or even other organisms. As well as revealing a new role for non-coding DNA, the findings could also help to develop a new method to cleanse yeast cells of disease-causing mutations in their mitochondrial DNA.
Impact of the number of mutations in survival and response outcomes to hypomethylating agents in patients with myelodysplastic syndromes or myelodysplastic/myeloproliferative neoplasms
The prognostic and predictive value of sequencing analysis in myelodysplastic syndromes (MDS) has not been fully integrated into clinical practice. We performed whole exome sequencing (WES) of bone marrow samples from 83 patients with MDS and 31 with MDS/MPN identifying 218 driver mutations in 31 genes in 98 (86%) patients. A total of 65 (57%) patients received therapy with hypomethylating agents. By univariate analysis, mutations in BCOR, STAG2, TP53 and SF3B1 significantly influenced survival. Increased number of mutations (≥ 3), but not clonal heterogeneity, predicted for shorter survival and LFS. Presence of 3 or more mutations also predicted for lower likelihood of response (26 vs 50%, p = 0.055), and shorter response duration (3.6 vs 26.5 months, p = 0.022). By multivariate analysis, TP53 mutations (HR 3.1, CI 1.3–7.5, p = 0.011) and number of mutations (≥ 3) (HR 2.5, CI 1.3–4.8, p = 0.005) predicted for shorter survival. A novel prognostic model integrating this mutation data with IPSS-R separated patients into three categories with median survival of not reached, 29 months and 12 months respectively (p < 0.001) and increased stratification potential, compared to IPSS-R, in patients with high/very-high IPSS-R. This model was validated in a separate cohort of 413 patients with untreated MDS. Although the use of WES did not provide significant more information than that obtained with targeted sequencing, our findings indicate that increased number of mutations is an independent prognostic factor in MDS and that mutation data can add value to clinical prognostic models.
The genetic landscape of high risk neuroblastoma
Neuroblastoma is a malignancy of the developing sympathetic nervous system that often presents with widespread metastatic disease, resulting in survival rates of less than 50%1. To determine the spectrum of somatic mutation in high-risk neuroblastoma, we studied 240 cases using a combination of whole exome, genome and transcriptome sequencing as part of the Therapeutically Applicable Research to Generate Effective Treatments (TARGET) initiative. Here we report a low median exonic mutation frequency of 0.60 per megabase (0.48 non-silent), and remarkably few recurrently mutated genes in these tumors. Genes with significant somatic mutation frequencies included ALK (9.2% of cases), PTPN11 (2.9%), ATRX (2.5%, an additional 7.1% had focal deletions), MYCN (1.7%, a recurrent p.Pro44Leu alteration), and NRAS (0.83%). Rare, potentially pathogenic germline variants were significantly enriched in ALK, CHEK2, PINK1, and BARD1. The relative paucity of recurrent somatic mutations in neuroblastoma challenges current therapeutic strategies reliant upon frequently altered oncogenic drivers.
Integrative analysis of genomic alterations in triple negative breast cancer in association with homologous recombination deficiency
Triple-negative breast cancer (TNBC) cells do not express estrogen receptors, progesterone receptors, or human epidermal growth factor receptor 2. Currently, apart from poly ADP-ribose polymerase inhibitors, there are few effective therapeutic options for this type of cancer. Here, we present comprehensive characterization of the genetic alterations in TNBC performed by high coverage whole genome sequencing together with transcriptome and whole exome sequencing. Silencing of the BRCA1 gene impaired the homologous recombination pathway in a subset of TNBCs, which exhibited similar phenotypes to tumors with BRCA1 mutations; they harbored many structural variations (SVs) with relative enrichment for tandem duplication. Clonal analysis suggested that TP53 mutations and methylation of CpG dinucleotides in the BRCA1 promoter were early events of carcinogenesis. SVs were associated with driver oncogenic events such as amplification of MYC, NOTCH2, or NOTCH3 and affected tumor suppressor genes including RB1, PTEN, and KMT2C. Furthermore, we identified putative TGFA enhancer regions. Recurrent SVs that affected the TGFA enhancer region led to enhanced expression of the TGFA oncogene that encodes one of the high affinity ligands for epidermal growth factor receptor. We also identified a variety of oncogenes that could transform 3T3 mouse fibroblasts, suggesting that individual TNBC tumors may undergo a unique driver event that can be targetable. Thus, we revealed several features of TNBC with clinically important implications. Cancer can result from genetic alterations, some of which can be good drug targets. To reveal genetic alterations that provide important information for the development of ideal therapeutic strategies for triple-negative breast cancer (TNBC), TNBC tumor samples were subjected to comprehensive genomic analyses. We identified novel recurrent structural variations associated with enhanced expression of the TGFA gene that encodes one of the high affinity ligands for epidermal growth factor receptor (EGFR). Although TGFA expression is known to be elevated in a subset of TNBC tumors, this is the first report of the mechanistic basis of this phenomenon. It is of particular importance considering that anti-EGFR agents are possible therapeutic options for TNBC patients. Our study also revealed several features associated with “BRCAness”, which is critical for identification of patients who may be responsive to platinum agents and/or poly ADP-ribose polymerase inhibitors. Thus, the data presented in this report may advance our understanding of the pathogenesis of TNBC.
Diverse drug resistance mechanisms can emerge from drug tolerant cancer persister cells
Cancer therapy has traditionally focused on eliminating fast-growing populations of cells. Yet, an increasing body of evidence suggests that small subpopulations of cancer cells can evade strong selective drug pressure by entering a ‘persister' state of negligible growth. This drug-tolerant state has been hypothesized to be part of an initial strategy towards eventual acquisition of bona fide drug-resistance mechanisms. However, the diversity of drug-resistance mechanisms that can expand from a persister bottleneck is unknown. Here we compare persister-derived, erlotinib-resistant colonies that arose from a single, EGFR-addicted lung cancer cell. We find, using a combination of large-scale drug screening and whole-exome sequencing, that our erlotinib-resistant colonies acquired diverse resistance mechanisms, including the most commonly observed clinical resistance mechanisms. Thus, the drug-tolerant persister state does not limit—and may even provide a latent reservoir of cells for—the emergence of heterogeneous drug-resistance mechanisms. Cancer cells that survive initial drug treatment can persist in the presence of drugs. Here, the authors generate persister cells that are resistant to the EGFR tyrosine kinase inhibitor erlotinib and show by single cell analysis that multiple mechanism give rise to the drug-resistant persister state.
A comprehensive comparison of tools for differential ChIP seq analysis
ChIP-seq has become a widely adopted genomic assay in recent years to determine binding sites for transcription factors or enrichments for specific histone modifications. Beside detection of enriched or bound regions, an important question is to determine differences between conditions. While this is a common analysis for gene expression, for which a large number of computational approaches have been validated, the same question for ChIP-seq is particularly challenging owing to the complexity of ChIP-seq data in terms of noisiness and variability. Many different tools have been developed and published in recent years. However, a comprehensive comparison and review of these tools is still missing. Here, we have reviewed 14 tools, which have been developed to determine differential enrichment between two conditions. They differ in their algorithmic setups, and also in the range of applicability. Hence, we have benchmarked these tools on real data sets for transcription factors and histone modifications, as well as on simulated data sets to quantitatively evaluate their performance. Overall, there is a great variety in the type of signal detected by these tools with a surprisingly low level of agreement. Depending on the type of analysis performed, the choice of method will crucially impact the outcome.
DNA methylation directs genomic localization of Mbd2 and Mbd3 in embryonic stem cells
Cytosine methylation is an epigenetic and regulatory mark that functions in part through recruitment of chromatin remodeling complexes containing methyl-CpG binding domain (MBD) proteins. Two MBD proteins, Mbd2 and Mbd3, were previously shown to bind methylated or hydroxymethylated DNA, respectively; however, both of these findings have been disputed. Here, we investigated this controversy using experimental approaches and re-analysis of published data and find no evidence for methylation-independent functions of Mbd2 or Mbd3. We show that chromatin localization of Mbd2 and Mbd3 is highly overlapping and, unexpectedly, we find Mbd2 and Mbd3 are interdependent for chromatin association. Further investigation reveals that both proteins are required for normal levels of cytosine methylation and hydroxymethylation in murine embryonic stem cells. Furthermore, Mbd2 and Mbd3 regulate overlapping sets of genes that are also regulated by DNA methylation/hydroxymethylation factors. These findings reveal an interdependent regulatory mechanism mediated by the DNA methylation machinery and its readers. DOI:
Pichia pastoris regulates its gene specific response to different carbon sources at the transcriptional, rather than the translational, level
The methylotrophic, Crabtree-negative yeast Pichia pastoris is widely used as a heterologous protein production host. Strong inducible promoters derived from methanol utilization genes or constitutive glycolytic promoters are typically used to drive gene expression. Notably, genes involved in methanol utilization are not only repressed by the presence of glucose, but also by glycerol. This unusual regulatory behavior prompted us to study the regulation of carbon substrate utilization in different bioprocess conditions on a genome wide scale. We performed microarray analysis on the total mRNA population as well as mRNA that had been fractionated according to ribosome occupancy. Translationally quiescent mRNAs were defined as being associated with single ribosomes (monosomes) and highly-translated mRNAs with multiple ribosomes (polysomes). We found that despite their lower growth rates, global translation was most active in methanol-grown P. pastoris cells, followed by excess glycerol- or glucose-grown cells. Transcript-specific translational responses were found to be minimal, while extensive transcriptional regulation was observed for cells grown on different carbon sources. Due to their respiratory metabolism, cells grown in excess glucose or glycerol had very similar expression profiles. Genes subject to glucose repression were mainly involved in the metabolism of alternative carbon sources including the control of glycerol uptake and metabolism. Peroxisomal and methanol utilization genes were confirmed to be subject to carbon substrate repression in excess glucose or glycerol, but were found to be strongly de-repressed in limiting glucose-conditions (as are often applied in fed batch cultivations) in addition to induction by methanol. P. pastoris cells grown in excess glycerol or glucose have similar transcript profiles in contrast to S. cerevisiae cells, in which the transcriptional response to these carbon sources is very different. The main response to different growth conditions in P. pastoris is transcriptional; translational regulation was not transcript-specific. The high proportion of mRNAs associated with polysomes in methanol-grown cells is a major finding of this study; it reveals that high productivity during methanol induction is directly linked to the growth condition and not only to promoter strength. The online version of this article (doi:10.1186/s12864-015-1393-8) contains supplementary material, which is available to authorized users.
Transcriptional profile of immediate response to ionizing radiation exposure
Astronauts participating in long duration space missions are likely to be exposed to ionizing radiation associated with highly energetic and charged heavy particles. Previously proposed gene biomarkers for radiation exposure include phosphorylated H2A Histone Family, Member X (γH2AX), Tumor Protein 53 (TP53), and Cyclin-Dependent Kinase Inhibitor 1A (CDKN1A). However, transcripts of these genes may not be the most suitable biomarkers for radiation exposure due to a lack of sensitivity or specificity. As part of a larger effort to develop lab-on-a-chip methods for detecting radiation exposure events using blood samples, we designed a dose–course microarray study in order to determine coding and non-coding RNA transcripts undergoing differential expression immediately following radiation exposure. The main goal was to elicit a small set of sensitive and specific radiation exposure biomarkers at low, medium, and high levels of ionizing radiation exposure. Four separate levels of radiation were considered: 0 Gray (Gy) control; 0.3 Gy; 1.5 Gy; and 3.0 Gy with four replicates at each radiation level. This report includes raw gene expression data files from the resulting microarray experiments from all three radiation levels ranging from a lower, typical exposure than an astronaut might see (0.3 Gy) to high, potentially lethal, levels of radiation (3.0 Gy). The data described here is available in NCBI's Gene Expression Omnibus (GEO), accession GSE64375.
Global Gene Expression Analysis of Murine Limb Development
Detailed information about stage-specific changes in gene expression is crucial for understanding the gene regulatory networks underlying development and the various signal transduction pathways contributing to morphogenesis. Here we describe the global gene expression dynamics during early murine limb development, when cartilage, tendons, muscle, joints, vasculature and nerves are specified and the musculoskeletal system of limbs is established. We used whole-genome microarrays to identify genes with differential expression at 5 stages of limb development (E9.5 to 13.5), during fore- and hind-limb patterning. We found that the onset of limb formation is characterized by an up-regulation of transcription factors, which is followed by a massive activation of genes during E10.5 and E11.5 which levels off at later time points. Among the 3520 genes identified as significantly up-regulated in the limb, we find ∼30% to be novel, dramatically expanding the repertoire of candidate genes likely to function in the limb. Hierarchical and stage-specific clustering identified expression profiles that are likely to correlate with functional programs during limb development and further characterization of these transcripts will provide new insights into specific tissue patterning processes. Here, we provide for the first time a comprehensive analysis of developmentally regulated genes during murine limb development, and provide some novel insights into the expression dynamics governing limb morphogenesis.
Growth Phase Specific Modulation of Cell Morphology and Gene Expression by an Archaeal Histone Protein
In all three domains of life, organisms use nonspecific DNA-binding proteins to compact and organize the genome as well as to regulate transcription on a global scale. Histone is the primary eukaryotic nucleoprotein, and its evolutionary roots can be traced to the archaea. However, not all archaea use this protein as the primary DNA-packaging component, raising questions regarding the role of histones in archaeal chromatin function. Here, quantitative phenotyping, transcriptomic, and proteomic assays were performed on deletion and overexpression mutants of the sole histone protein of the hypersaline-adapted haloarchaeal model organism Halobacterium salinarum. This protein is highly conserved among all sequenced haloarchaeal species and maintains hallmark residues required for eukaryotic histone functions. Surprisingly, despite this conservation at the sequence level, unlike in other archaea or eukaryotes, H. salinarum histone is required to regulate cell shape but is not necessary for survival. Genome-wide expression changes in histone deletion strains were global, significant but subtle in terms of fold change, bidirectional, and growth phase dependent. Mass spectrometric proteomic identification of proteins from chromatin enrichments yielded levels of histone and putative nucleoid-associated proteins similar to those of transcription factors, consistent with an open and transcriptionally active genome. Taken together, these data suggest that histone in H. salinarum plays a minor role in DNA compaction but important roles in growth-phase-dependent gene expression and regulation of cell shape. Histone function in haloarchaea more closely resembles a regulator of gene expression than a chromatin-organizing protein like canonical eukaryotic histone. Histones comprise the major protein component of eukaryotic chromatin and are required for both genome packaging and global regulation of expression. The current paradigm maintains that archaea whose genes encode histone also use these proteins to package DNA. In contrast, here we demonstrate that the sole histone encoded in the genome of the salt-adapted archaeon Halobacterium salinarum is both unessential and unlikely to be involved in DNA compaction despite conservation of residues important for eukaryotic histones. Rather, H. salinarum histone is required for global regulation of gene expression and cell shape. These data are consistent with the hypothesis that H. salinarum histone, strongly conserved across all other known salt-adapted archaea, serves a novel role in gene regulation and cell shape maintenance. Given that archaea possess the ancestral form of eukaryotic histone, this study has important implications for understanding the evolution of histone function.
Fibroblasts profiling in scarring trachoma identifies IL 6 as a functional component of a fibroblast macrophage pro fibrotic and pro inflammatory feedback loop
Trachoma is a conjunctiva scarring disease, which is the leading infectious cause of blindness worldwide. Yet, the molecular mechanisms underlying progressive fibrosis in trachoma are unknown. To investigate the contribution of local resident fibroblasts to disease progression, we isolated conjunctival fibroblasts from patients with scarring trachoma and matching control individuals, and compared their gene expression profiles and functional properties in vitro. We show that scarring trachoma fibroblasts substantially differ from control counterparts, displaying pro-fibrotic and pro-inflammatory features matched by an altered gene expression profile. This pro-inflammatory signature was exemplified by increased IL-6 expression and secretion, and a stronger response to macrophage-mediated stimulation of contraction. We further demonstrate that scarring trachoma fibroblasts can promote Akt phosphorylation in macrophages in an IL-6 –dependent manner. Overall this work has uncovered a distinctive molecular fingerprint for scarring trachoma fibroblasts, and identified IL-6- as a potential contributor to the chronic conjunctival fibrosis, mediating reciprocal pro-fibrotic/pro-inflammatory interactions between macrophages and fibroblasts.
Microarray analysis revealing common and distinct functions of promyelocytic leukemia protein (PML) and tumor necrosis factor alpha (TNFα) signaling in endothelial cells
Promyelocytic leukemia protein (PML) is a tumor suppressor that is highly expressed in endothelial cells nonetheless its role in endothelial cell biology remains elusive. Tumor necrosis factor alpha (TNFα) is an important cytokine associated with many inflammation-related diseases. We have previously demonstrated that TNFα induces PML protein accumulation. We hypothesized that PML may play a role in TNFα signaling pathway. To identify potential PML target genes and investigate the putative crosstalk between PML’s function and TNFα signaling in endothelial cells, we carried out a microarray analysis in human primary umbilical endothelial cells (HUVECs). We found that PML and TNFα regulate common and distinct genes involved in a similar spectrum of biological processes, pathways and human diseases. More importantly, we found that PML is required for fine-tuning of TNFα-mediated immune and inflammatory responses. Furthermore, our data suggest that PML and TNFα synergistically regulate cell adhesion by engaging multiple molecular mechanisms. Our biological functional assays exemplified that adhesion of U937 human leukocytes to HUVECs is co-regulated by PML and TNFα signaling. Together, our study identified PML as an essential regulator of TNFα signaling by revealing the crosstalk between PML knockdown-mediated effects and TNFα-elicited signaling, thereby providing novel insights into TNFα signaling in endothelial cells.
The non coding RNA landscape of human hematopoiesis and leukemia
Non-coding RNAs have emerged as crucial regulators of gene expression and cell fate decisions. However, their expression patterns and regulatory functions during normal and malignant human hematopoiesis are incompletely understood. Here we present a comprehensive resource defining the non-coding RNA landscape of the human hematopoietic system. Based on highly specific non-coding RNA expression portraits per blood cell population, we identify unique fingerprint non-coding RNAs—such as LINC00173 in granulocytes—and assign these to critical regulatory circuits involved in blood homeostasis. Following the incorporation of acute myeloid leukemia samples into the landscape, we further uncover prognostically relevant non-coding RNA stem cell signatures shared between acute myeloid leukemia blasts and healthy hematopoietic stem cells. Our findings highlight the importance of the non-coding transcriptome in the formation and maintenance of the human blood hierarchy. While micro-RNAs are known regulators of haematopoiesis and leukemogenesis, the role of long non-coding RNAs is less clear. Here the authors provide a non-coding RNA expression landscape of the human hematopoietic system, highlighting their role in the formation and maintenance of the human blood hierarchy.
Molecular adaptations to phosphorus deprivation and comparison with nitrogen deprivation responses in the diatom Phaeodactylum tricornutum
Phosphorus, an essential element for all living organisms, is a limiting nutrient in many regions of the ocean due to its fast recycling. Changes in phosphate (Pi) availability in aquatic systems affect diatom growth and productivity. We investigated the early adaptive mechanisms in the marine diatom Phaeodactylum tricornutum to P deprivation using a combination of transcriptomics, metabolomics, physiological and biochemical experiments. Our analysis revealed strong induction of gene expression for proteins involved in phosphate acquisition and scavenging, and down-regulation of processes such as photosynthesis, nitrogen assimilation and nucleic acid and ribosome biosynthesis. P deprivation resulted in alterations of carbon allocation through the induction of the pentose phosphate pathway and cytosolic gluconeogenesis, along with repression of the Calvin cycle. Reorganization of cellular lipids was indicated by coordinated induced expression of phospholipases, sulfolipid biosynthesis enzymes and a putative betaine lipid biosynthesis enzyme. A comparative analysis of nitrogen- and phosphorus-deprived P. tricornutum revealed both common and distinct regulation patterns in response to phosphate and nitrate stress. Regulation of central carbon metabolism and amino acid metabolism was similar, whereas unique responses were found in nitrogen assimilation and phosphorus scavenging in nitrogen-deprived and phosphorus-deprived cells, respectively.
Pridopidine activates neuroprotective pathways impaired in Huntington Disease
Pridopidine has demonstrated improvement in Huntington Disease (HD) motor symptoms as measured by secondary endpoints in clinical trials. Originally described as a dopamine stabilizer, this mechanism is insufficient to explain the clinical and preclinical effects of pridopidine. This study therefore explored pridopidine’s potential mechanisms of action. The effect of pridopidine versus sham treatment on genome-wide expression profiling in the rat striatum was analysed and compared to the pathological expression profile in Q175 knock-in (Q175 KI) vs Q25 WT mouse models. A broad, unbiased pathway analysis was conducted, followed by testing the enrichment of relevant pathways. Pridopidine upregulated the BDNF pathway (P = 1.73E-10), and its effect on BDNF secretion was sigma 1 receptor (S1R) dependent. Many of the same genes were independently found to be downregulated in Q175 KI mice compared to WT (5.2e-7 < P < 0.04). In addition, pridopidine treatment upregulated the glucocorticoid receptor (GR) response, D1R-associated genes and the AKT/PI3K pathway (P = 1E-10, P = 0.001, P = 0.004, respectively). Pridopidine upregulates expression of BDNF, D1R, GR and AKT/PI3K pathways, known to promote neuronal plasticity and survival, as well as reported to demonstrate therapeutic benefit in HD animal models. Activation of S1R, necessary for its effect on the BDNF pathway, represents a core component of the mode of action of pridopidine. Since the newly identified pathways are downregulated in neurodegenerative diseases, including HD, these findings suggest that pridopidine may exert neuroprotective effects beyond its role in alleviating some symptoms of HD.
The distribution and evolution of Arabidopsis thaliana cis natural antisense transcripts
Natural antisense transcripts (NATs) are regulatory RNAs that contain sequence complementary to other RNAs, these other RNAs usually being messenger RNAs. In eukaryotic genomes, cis-NATs overlap the gene they complement. Here, our goal is to analyze the distribution and evolutionary conservation of cis-NATs for a variety of available data sets for Arabidopsis thaliana, to gain insights into cis-NAT functional mechanisms and their significance. Cis-NATs derived from traditional sequencing are largely validated by other data sets, although different cis-NAT data sets have different prevalent cis-NAT topologies with respect to overlapping protein-coding genes. A. thaliana cis-NATs have substantial conservation (28-35% in the three substantive data sets analyzed) of expression in A. lyrata. We examined evolutionary sequence conservation at cis-NAT loci in Arabidopsis thaliana across nine sequenced Brassicaceae species (picked for optimal discernment of purifying selection), focussing on the parts of their sequences not overlapping protein-coding transcripts (dubbed ‘NOLPs’). We found significant NOLP sequence conservation for 28-34% NATs across different cis-NAT sets. This NAT NOLP sequence conservation versus A. lyrata is generally significantly correlated with conservation of expression. We discover a significant enrichment of transcription factor binding sites (as evidenced by CHIP-seq data) in NOLPs compared to randomly sampled near-gene NOLP-like DNA , that is linked to significant sequence conservation. Conversely, there is no such evidence for a general significant link between NOLPs and formation of small interfering RNAs (siRNAs), with the substantial majority of unique siRNAs arising from the overlapping portions of the cis-NATs. In aggregate, our results suggest that many cis-NAT NOLPs function in the regulation of conserved promoter/regulatory elements that they ‘over-hang’. The online version of this article (doi:10.1186/s12864-015-1587-0) contains supplementary material, which is available to authorized users.
Analysis of tigecycline resistance development in clinical Acinetobacter baumannii isolates through a combined genomic and transcriptomic approach
Tigecycline (Tgc) is considered a last-resort antibiotic for the treatment of multi-drug resistant bacteria. To study Tgc resistance development in the important nosocomial pathogen Acinetobacter baumannii, we adopted six clinical isolates from three patients undergoing antibiotic treatment, and bacterial genomic sequences and seven strand-specific transcriptomes were studied. Interestingly, the Tgc-intermediate 2015ZJAB1 only differed from Tgc-resistant 2015ZJAB2 in an SNP-clustered region including OprD, a sugar-type MFS permease, and a LuxR-type transcriptional regulator. Surprisingly, an almost identical region was found in 2015ZJAB3, which supports the possibility of a homologous recombination event that increased Tgc resistance. Furthermore, comparative transcriptomic analysis identified significantly regulated genes associated with Tgc resistance, which was verified using qRT-PCR. Three enriched COG categories included amino acid transport and metabolism, transcription, and inorganic ion transport and metabolism. KEGG analysis revealed common features under Tgc conditions, including up regulated benzoate degradation and a less active TCA cycle. This may be related to selective antimicrobial pressure in the environment and adaptation by lowering metabolism. This study provides the first report of an in vivo evolutionary process that included a putative homologous recombination event conferring Tgc resistance in clinical A. baumannii isolates in which transcriptome analysis revealed resistance-conferring genes and related metabolism characteristics.
Comparative phylogenetic analysis and transcriptional profiling of MADS box gene family identified DAM and FLC like genes in apple (Malusx domestica)
The MADS-box transcription factors play essential roles in various processes of plant growth and development. In the present study, phylogenetic analysis of 142 apple MADS-box proteins with that of other dicotyledonous species identified six putative Dormancy-Associated MADS-box (DAM) and four putative Flowering Locus C-like (FLC-like) proteins. In order to study the expression of apple MADS-box genes, RNA-seq analysis of 3 apical and 5 spur bud stages during dormancy, 6 flower stages and 7 fruit development stages was performed. The dramatic reduction in expression of two MdDAMs, MdMADS063 and MdMADS125 and two MdFLC-like genes, MdMADS135 and MdMADS136 during dormancy release suggests their role as flowering-repressors in apple. Apple orthologs of Arabidopsis genes, FLOWERING LOCUS T, FRIGIDA, SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 and LEAFY exhibit similar expression patterns as reported in Arabidopsis, suggesting functional conservation in floral signal integration and meristem determination pathways. Gene ontology enrichment analysis of predicted targets of DAM revealed their involvement in regulation of reproductive processes and meristematic activities, indicating functional conservation of SVP orthologs (DAM) in apple. This study provides valuable insights into the functions of MADS-box proteins during apple phenology, which may help in devising strategies to improve important traits in apple.
Comparative Transcriptome Analysis Reveals the Influence of Abscisic Acid on the Metabolism of Pigments, Ascorbic Acid and Folic Acid during Strawberry Fruit Ripening
A comprehensive investigation of abscisic acid (ABA) biosynthesis and its influence on other important phytochemicals is critical for understanding the versatile roles that ABA plays during strawberry fruit ripening. Using RNA-seq technology, we sampled strawberry fruit in response to ABA or nordihydroguaiaretic acid (NDGA; an ABA biosynthesis blocker) treatment during ripening and assessed the expression changes of genes involved in the metabolism of pigments, ascorbic acid (AsA) and folic acid in the receptacles. The transcriptome analysis identified a lot of genes differentially expressed in response to ABA or NDGA treatment. In particular, genes in the anthocyanin biosynthesis pathway were actively regulated by ABA, with the exception of the gene encoding cinnamate 4-hydroxylase. Chlorophyll degradation was accelerated by ABA mainly owing to the higher expression of gene encoding pheide a oxygenase. The decrease of β-carotene content was accelerated by ABA treatment and delayed by NDGA. A high negative correlation rate was found between ABA and β-carotene content, indicating the importance of the requirement for ABA synthesis during fruit ripening. In addition, evaluation on the folate biosynthetic pathway indicate that ABA might have minor function in this nutrient’s biosynthesis process, however, it might be involved in its homeostasis. Surprisingly, though AsA content accumulated during fruit ripening, expressions of genes involved in its biosynthesis in the receptacles were significantly lower in ABA-treated fruits. This transcriptome analysis expands our understanding of ABA’s role in phytochemical metabolism during strawberry fruit ripening and the regulatory mechanisms of ABA on these pathways were discussed. Our study provides a wealth of genetic information in the metabolism pathways and may be helpful for molecular manipulation in the future.