Table of content

BMC Genomics

ISSN/EISSN: 14712164
Subject: Genetics
Publisher: BioMed Central
Country: United Kingdom
Language: English
Start year 2000
Publication fee: Yes --- Further Information

Journal homepage at publisher site


Table of content: 2010 volume:11 issue:1

Article
Adaptation of Hansenula polymorpha to methanol: a transcriptome analysis

Authors: van Zutphen Tim --- Baerends Richard JS --- Susanna Kim A --- de Jong Anne --- et al.
DOI: 10.1186/1471-2164-11-1
Pages: 1
Loading...
Loading...
Abstract

Abstract

Background

Methylotrophic yeast species (e.g. Hansenula polymorpha, Pichia pastoris) can grow on methanol as sole source of carbon and energy. These organisms are important cell factories for the production of recombinant proteins, but are also used in fundamental research as model organisms to study peroxisome biology. During exponential growth on glucose, cells of H. polymorpha typically contain a single, small peroxisome that is redundant for growth while on methanol multiple, enlarged peroxisomes are present. These organelles are crucial to support growth on methanol, as they contain key enzymes of methanol metabolism.

In this study, changes in the transcriptional profiles during adaptation of H. polymorpha cells from glucose- to methanol-containing media were investigated using DNA-microarray analyses.

Results

Two hours after the shift of cells from glucose to methanol nearly 20% (1184 genes) of the approximately 6000 annotated H. polymorpha genes were significantly upregulated with at least a two-fold differential expression. Highest upregulation (> 300-fold) was observed for the genes encoding the transcription factor Mpp1 and formate dehydrogenase, an enzyme of the methanol dissimilation pathway. Upregulated genes also included genes encoding other enzymes of methanol metabolism as well as of peroxisomal β-oxidation.

A moderate increase in transcriptional levels (up to 4-fold) was observed for several PEX genes, which are involved in peroxisome biogenesis. Only PEX11 and PEX32 were higher upregulated. In addition, an increase was observed in expression of the several ATG genes, which encode proteins involved in autophagy and autophagy processes. The strongest upregulation was observed for ATG8 and ATG11.

Approximately 20% (1246 genes) of the genes were downregulated. These included glycolytic genes as well as genes involved in transcription and translation.

Conclusion

Transcriptional profiling of H. polymorpha cells shifted from glucose to methanol showed the expected downregulation of glycolytic genes together with upregulation of the methanol utilisation pathway. This serves as a confirmation and validation of the array data obtained. Consistent with this, also various PEX genes were upregulated. The strong upregulation of ATG genes is possibly due to induction of autophagy processes related to remodeling of the cell architecture required to support growth on methanol. These processes may also be responsible for the enhanced peroxisomal β-oxidation, as autophagy leads to recycling of membrane lipids. The prominent downregulation of transcription and translation may be explained by the reduced growth rate on methanol (td glucose 1 h vs td methanol 4.5 h).

Keywords


Article
Complete genome sequence of the fire blight pathogen Erwinia pyrifoliae DSM 12163T and comparative genomic insights into plant pathogenicity

Authors: Smits Theo HM --- Jaenicke Sebastian --- Rezzonico Fabio --- Kamber Tim --- et al.
DOI: 10.1186/1471-2164-11-2
Pages: 2
Loading...
Loading...
Abstract

Abstract

Background

Erwinia pyrifoliae is a newly described necrotrophic pathogen, which causes fire blight on Asian (Nashi) pear and is geographically restricted to Eastern Asia. Relatively little is known about its genetics compared to the closely related main fire blight pathogen E. amylovora.

Results

The genome of the type strain of E. pyrifoliae strain DSM 12163T, was sequenced using both 454 and Solexa pyrosequencing and annotated. The genome contains a circular chromosome of 4.026 Mb and four small plasmids. Based on their respective role in virulence in E. amylovora or related organisms, we identified several putative virulence factors, including type III and type VI secretion systems and their effectors, flagellar genes, sorbitol metabolism, iron uptake determinants, and quorum-sensing components. A deletion in the rpoS gene covering the most conserved region of the protein was identified which may contribute to the difference in virulence/host-range compared to E. amylovora. Comparative genomics with the pome fruit epiphyte Erwinia tasmaniensis Et1/99 showed that both species are overall highly similar, although specific differences were identified, for example the presence of some phage gene-containing regions and a high number of putative genomic islands containing transposases in the E. pyrifoliae DSM 12163T genome.

Conclusions

The E. pyrifoliae genome is an important addition to the published genome of E. tasmaniensis and the unfinished genome of E. amylovora providing a foundation for re-sequencing additional strains that may shed light on the evolution of the host-range and virulence/pathogenicity of this important group of plant-associated bacteria.

Keywords


Article
Extensive analysis of D-J-C arrangements allows the identification of different mechanisms enhancing the diversity in sheep T cell receptor β-chain repertoire

Authors: Di Tommaso Silvia --- Antonacci Rachele --- Ciccarese Salvatrice --- Massari Serafina
DOI: 10.1186/1471-2164-11-3
Pages: 3
Loading...
Loading...
Abstract

Abstract

Background

In most species of mammals, the TRB locus has the common feature of a library of TRBV genes positioned at the 5'- end of two in tandem aligned D-J-C gene clusters, each composed of a single TRBD gene, 6-7 TRBJ genes and one TRBC gene. An enhancer located at the 3'end of the last TRBC and a well-defined promoter situated at the 5'end of the TRBD gene and/or a undefined promoter situated at the 5'end of the TRBD2 are sufficient to generate the full recombinase accessibility at the locus. In ruminant species, the 3'end of the TRB locus is characterized by the presence of three D-J-C clusters, each constituted by a single TRBD, 5-7 TRBJ and one TRBC genes with the center cluster showing a structure combined with the clusters upstream and downstream, suggesting that a unequal crossover occurred in the duplication. An enhancer downstream the last TRBC, and a promoter at the 5'-end of each TRBD gene are also present.

Results

In this paper we focused our attention on the analysis of a large number of sheep TR β-chain transcripts derived from four different lymphoid tissues of three diverse sheep breed animals to certify the use and frequency of the three gene clusters in the β-chain repertoire. As the sheep TRB locus genomic organization is known, the exact interpretation of the V-D-J rearrangements was fully determined. Our results clearly demonstrate that sheep β-chain constitutes a level of variability that is substantially larger than that described in other mammalian species. This is due not only to the increase of the number of D and J genes available to the somatic recombination, but also to the presence of the trans-rearrangement process. Moreover, the functional complexity of β-chain repertoire is resolved by other mechanisms such as alternative cis- and trans-splicing and recombinational diversification that seems to affect the variety of the constant region.

Conclusion

All together our data demonstrate that a disparate set of molecular mechanisms operate to perform a diversified repertoire in the sheep β-chain and this could confer some special biological properties to the corresponding αβ T cells in the ruminant lineage.

Keywords


Article
Analysis of newly established EST databases reveals similarities between heart regeneration in newt and fish

Authors: Borchardt Thilo --- Looso Mario --- Bruckskotten Marc --- Weis Patrick --- et al.
DOI: 10.1186/1471-2164-11-4
Pages: 4
Loading...
Loading...
Abstract

Abstract

Background

The newt Notophthalmus viridescens possesses the remarkable ability to respond to cardiac damage by formation of new myocardial tissue. Surprisingly little is known about changes in gene activities that occur during the course of regeneration. To begin to decipher the molecular processes, that underlie restoration of functional cardiac tissue, we generated an EST database from regenerating newt hearts and compared the transcriptional profile of selected candidates with genes deregulated during zebrafish heart regeneration.

Results

A cDNA library of 100,000 cDNA clones was generated from newt hearts 14 days after ventricular injury. Sequencing of 11520 cDNA clones resulted in 2894 assembled contigs. BLAST searches revealed 1695 sequences with potential homology to sequences from the NCBI database. BLAST searches to TrEMBL and Swiss-Prot databases assigned 1116 proteins to Gene Ontology terms. We also identified a relatively large set of 174 ORFs, which are likely to be unique for urodele amphibians. Expression analysis of newt-zebrafish homologues confirmed the deregulation of selected genes during heart regeneration. Sequences, BLAST results and GO annotations were visualized in a relational web based database followed by grouping of identified proteins into clusters of GO Terms. Comparison of data from regenerating zebrafish hearts identified biological processes, which were uniformly overrepresented during cardiac regeneration in newt and zebrafish.

Conclusion

We concluded that heart regeneration in newts and zebrafish led to the activation of similar sets of genes, which suggests that heart regeneration in both species might follow similar principles. The design of the newly established newt EST database allows identification of molecular pathways important for heart regeneration.

Keywords


Article
Normal colon epithelium: a dataset for the analysis of gene expression and alternative splicing events in colon disease

Authors: Mojica Wilfrido --- Hawthorn Lesleyann
DOI: 10.1186/1471-2164-11-5
Pages: 5
Loading...
Loading...
Abstract

Abstract

Background

Studies using microarray analysis of colorectal cancer have been generally beleaguered by the lack of a normal cell population of the same lineage as the tumor cell. One of the main objectives of this study was to generate a reference gene expression data set for normal colonic epithelium which can be used in comparisons with diseased tissues, as well as to provide a dataset that could be used as a baseline for studies in alternative splicing.

Results

We present a dependable expression reference data set for non-neoplastic colonic epithelial cells. An enriched population of fresh colon epithelial cells were obtained from non-neoplastic, colectomy specimens and analyzed using Affymetrix GeneChip EXON 1.0 ST arrays. For demonstration purposes, we have compared the data derived from these cells to a publically available set of tumor and matched normal colon data. This analysis allowed an assessment of global gene expression alterations and demonstrated that adjacent normal tissues, with a high degree of cellular heterogeneity, are not always representative of normal cells for comparison to tumors which arise from the colon epithelium. We also examined alternative splicing events in tumors compared to normal colon epithelial cells.

Conclusions

The findings from this study represent the first comprehensive expression profile for non-neoplastic colonic epithelial cells reported. Our analysis of splice variants illustrate that this is a very labor intensive procedure, requiring vigilant examination of the data. It is projected that the contribution of this set of data derived from pure colonic epithelial cells will enhance studies in colon-related disease and offer a vital baseline for studies aimed at elucidating the mechanisms of alternative splicing.

Keywords


Article
An evolutionary consequence of dosage compensation on Drosophila melanogaster female X-chromatin structure?

Authors: Zhang Yu --- Oliver Brian
DOI: 10.1186/1471-2164-11-6
Pages: 6
Loading...
Loading...
Abstract

Abstract

Background

X chromosomes are subject to dosage compensation in Drosophila males. Dosage compensation requires cis sequence features of the X chromosome that are present in both sexes by definition and trans acting factors that target chromatin modifying machinery to the X specifically in males. The evolution of this system could result in neutral X chromatin changes that will be apparent in females.

Results

We find that the general chromatin structure of female X chromosomes is distinct from autosomes. Additionally, specific histone marks associated with dosage compensation and active chromatin marks on the male X chromosome are also enriched on the X chromosomes of females, albeit to a lesser degree.

Conclusions

Our data indicate that X chromatin structure is fundamentally different from autosome structure in both sexes. We suggest that the differences between the X chromosomes and autosomes in females are a consequence of mechanisms that have evolved to ensure sufficient X chromosome expression in the soma of males.

Keywords


Article
Decoding pooled RNAi screens by means of barcode tiling arrays

Authors: Boettcher Michael --- Fredebohm Johannes --- Gholami Amin --- Hachmo Yafit --- et al.
DOI: 10.1186/1471-2164-11-7
Pages: 7
Loading...
Loading...
Abstract

Abstract

Background

RNAi screens via pooled short hairpin RNAs (shRNAs) have recently become a powerful tool for the identification of essential genes in mammalian cells. In the past years, several pooled large-scale shRNA screens have identified a variety of genes involved in cancer cell proliferation. All of those studies employed microarray analysis, utilizing either the shRNA's half hairpin sequence or an additional shRNA-associated 60 nt barcode sequence as a molecular tag. Here we describe a novel method to decode pooled RNAi screens, namely barcode tiling array analysis, and demonstrate how this approach can be used to precisely quantify the abundance of individual shRNAs from a pool.

Results

We synthesized DNA microarrays with six overlapping 25 nt long tiling probes complementary to each unique 60 nt molecular barcode sequence associated with every shRNA expression construct. By analyzing dilution series of expression constructs we show how our approach allows quantification of shRNA abundance from a pool and how it clearly outperforms the commonly used analysis via the shRNA's half hairpin sequences. We further demonstrate how barcode tiling arrays can be used to predict anti-proliferative effects of individual shRNAs from pooled negative selection screens. Out of a pool of 305 shRNAs, we identified 28 candidate shRNAs to fully or partially impair the viability of the breast carcinoma cell line MDA-MB-231. Individual validation of a subset of eleven shRNA expression constructs with potential inhibitory, as well as non-inhibitory, effects on the cell line proliferation provides further evidence for the accuracy of the barcode tiling approach.

Conclusions

In summary, we present an improved method for the rapid, quantitative and statistically robust analysis of pooled RNAi screens. Our experimental approach, coupled with commercially available lentiviral vector shRNA libraries, has the potential to greatly facilitate the discovery of putative targets for cancer therapy as well as sensitizers of drug toxicity.

Keywords


Article
Detecting positive selection from genome scans of linkage disequilibrium

Authors: Huff Chad D --- Harpending Henry C --- Rogers Alan R
DOI: 10.1186/1471-2164-11-8
Pages: 8
Loading...
Loading...
Abstract

Abstract

Background

Though a variety of linkage disequilibrium tests have recently been introduced to measure the signal of recent positive selection, the statistical properties of the various methods have not been directly compared. While most applications of these tests have suggested that positive selection has played an important role in recent human history, the results of these tests have varied dramatically.

Results

Here, we evaluate the performance of three statistics designed to detect incomplete selective sweeps, LRH and iHS, and ALnLH. To analyze the properties of these tests, we introduce a new computational method that can model complex population histories with migration and changing population sizes to simulate gene trees influenced by recent positive selection. We demonstrate that iHS performs substantially better than the other two statistics, with power of up to 0.74 at the 0.01 level for the variation best suited for full genome scans and a power of over 0.8 at the 0.01 level for the variation best suited for candidate gene tests. The performance of the iHS statistic was robust to complex demographic histories and variable recombination rates. Genome scans involving the other two statistics suffer from low power and high false positive rates, with false discovery rates of up to 0.96 for ALnLH. The difference in performance between iHS and ALnLH, did not result from the properties of the statistics, but instead from the different methods for mitigating the multiple comparison problem inherent in full genome scans.

Conclusions

We introduce a new method for simulating genealogies influenced by positive selection with complex demographic scenarios. In a power analysis based on this method, iHS outperformed LRH and ALnLH in detecting incomplete selective sweeps. We also show that the single-site iHS statistic is more powerful in a candidate gene test than the multi-site statistic, but that the multi-site statistic maintains a low false discovery rate with only a minor loss of power when applied to a scan of the entire genome. Our results highlight the need for careful consideration of multiple comparison problems when evaluating and interpreting the results of full genome scans for positive selection.

Keywords


Article
An integrated functional genomic study of acute phenobarbital exposure in the rat

Authors: Waterman Claire L --- Currie Richard A --- Cottrell Lisa A --- Dow Jacky --- et al.
DOI: 10.1186/1471-2164-11-9
Pages: 9
Loading...
Loading...
Abstract

Abstract

Background

Non-genotoxic carcinogens are notoriously difficult to identify as they do not damage DNA directly and have diverse modes of action, necessitating long term in vivo studies. The early effects of the classic rodent non-genotoxic hepatocarcinogen phenobarbital have been investigated in the Fisher rat using a combination of metabolomics and transcriptomics, to investige early stage mechanistic changes that are predictive of longer term pathology.

Results

Liver and blood plasma were profiled across 14 days, and multivariate statistics used to identify perturbed pathways. Both metabolomics and transcriptomics detected changes in the liver which were dose dependent, even after one day of exposure. Integration of the two datasets associated perturbations with specific pathways. Hepatic glycogen was decreased due to a decrease in synthesis, and plasma triglycerides were decreased due to an increase in fatty acid uptake by the liver. Hepatic succinate was increased and this was associated with increased heme biosynthesis. Glutathione synthesis was also increased, presumably in response to oxidative stress. Liquid Chromatography Mass Spectrometry demonstrated a remodeling of lipid species, possibly resulting from proliferation of the smooth endoplasmic reticulum.

Conclusions

The data fusion of metabolomic and transcriptomic changes proved to be a highly sensitive approach for monitoring early stage changes in altered hepatic metabolism, oxidative stress and cytochrome P450 induction simultaneously. This approach is particularly useful in interpreting changes in metabolites such as succinate which are hubs of metabolism.

Keywords


Article
The dynamic architecture of the metabolic switch in Streptomyces coelicolor

Loading...
Loading...
Abstract

Abstract

Background

During the lifetime of a fermenter culture, the soil bacterium S. coelicolor undergoes a major metabolic switch from exponential growth to antibiotic production. We have studied gene expression patterns during this switch, using a specifically designed Affymetrix genechip and a high-resolution time-series of fermenter-grown samples.

Results

Surprisingly, we find that the metabolic switch actually consists of multiple finely orchestrated switching events. Strongly coherent clusters of genes show drastic changes in gene expression already many hours before the classically defined transition phase where the switch from primary to secondary metabolism was expected. The main switch in gene expression takes only 2 hours, and changes in antibiotic biosynthesis genes are delayed relative to the metabolic rearrangements. Furthermore, global variation in morphogenesis genes indicates an involvement of cell differentiation pathways in the decision phase leading up to the commitment to antibiotic biosynthesis.

Conclusions

Our study provides the first detailed insights into the complex sequence of early regulatory events during and preceding the major metabolic switch in S. coelicolor, which will form the starting point for future attempts at engineering antibiotic production in a biotechnological setting.

Keywords


Article
Global transcriptional analysis of nitrogen fixation and ammonium repression in root-associated Pseudomonas stutzeri A1501

Authors: Yan Yongliang --- Ping Shuzhen --- Peng Junping --- Han Yunlei --- et al.
DOI: 10.1186/1471-2164-11-11
Pages: 11
Loading...
Loading...
Abstract

Abstract

Background

Biological nitrogen fixation is highly controlled at the transcriptional level by regulatory networks that respond to the availability of fixed nitrogen. In many diazotrophs, addition of excess ammonium in the growth medium results in immediate repression of nif gene transcription. Although the regulatory cascades that control the transcription of the nif genes in proteobacteria have been well investigated, there are limited data on the kinetics of ammonium-dependent repression of nitrogen fixation.

Results

Here we report a global transcriptional profiling analysis of nitrogen fixation and ammonium repression in Pseudomonas stutzeri A1501, a root-associated and nitrogen-fixing bacterium. A total of 166 genes, including those coding for the global nitrogen regulation (Ntr) and Nif-specific regulatory proteins, were upregulated under nitrogen fixation conditions but rapidly downregulated as early as 10 min after ammonium shock. Among these nitrogen fixation-inducible genes, 95 have orthologs in each of Azoarcus sp. BH72 and Azotobacter vinelandii AvoP. In particular, a 49-kb expression island containing nif and other associated genes was markedly downregulated by ammonium shock. Further functional characterization of pnfA, a new NifA-σ54-dependent gene chromosomally linked to nifHDK, is reported. This gene encodes a protein product with an amino acid sequence similar to that of five hypothetical proteins found only in diazotrophic strains. No noticeable differences in the transcription of nifHDK were detected between the wild type strain and pnfA mutant. However, the mutant strain exhibited a significant decrease in nitrogenase activity under microaerobic conditions and lost its ability to use nitrate as a terminal electron acceptor for the support of nitrogen fixation under anaerobic conditions.

Conclusions

Based on our results, we conclude that transcriptional regulation of nif gene expression in A1501 is mediated by the nif-specific and ntr gene regulatory systems. Furthermore, microarray and mutational analyses revealed that many genes of unknown function may play some essential roles in controlling the expression or activity of nitrogenase. The findings presented here establish the foundation for further studies on the physiological function of nitrogen fixation-inducible genes.

Keywords


Article
The Zur regulon of Corynebacterium glutamicum ATCC 13032

Authors: Schröder Jasmin --- Jochmann Nina --- Rodionov Dmitry A --- Tauch Andreas
DOI: 10.1186/1471-2164-11-12
Pages: 12
Loading...
Loading...
Abstract

Abstract

Background

Zinc is considered as an essential element for all living organisms, but it can be toxic at large concentrations. Bacteria therefore tightly regulate zinc metabolism. The Cg2502 protein of Corynebacterium glutamicum was a candidate to control zinc metabolism in this species, since it was classified as metalloregulator of the zinc uptake regulator (Zur) subgroup of the ferric uptake regulator (Fur) family of DNA-binding transcription regulators.

Results

The cg2502 (zur) gene was deleted in the chromosome of C. glutamicum ATCC 13032 by an allelic exchange procedure to generate the zur-deficient mutant C. glutamicum JS2502. Whole-genome DNA microarray hybridizations and real-time RT-PCR assays comparing the gene expression in C. glutamicum JS2502 with that of the wild-type strain detected 18 genes with enhanced expression in the zur mutant. The expression data were combined with results from cross-genome comparisons of shared regulatory sites, revealing the presence of candidate Zur-binding sites in the mapped promoter regions of five transcription units encoding components of potential zinc ABC-type transporters (cg0041-cg0042/cg0043; cg2911-cg2912-cg2913), a putative secreted protein (cg0040), a putative oxidoreductase (cg0795), and a putative P-loop GTPase of the COG0523 protein family (cg0794). Enhanced transcript levels of the respective genes in C. glutamicum JS2502 were verified by real-time RT-PCR, and complementation of the mutant with a wild-type zur gene reversed the effect of differential gene expression. The zinc-dependent expression of the putative cg0042 and cg2911 operons was detected in vivo with a gfp reporter system. Moreover, the zinc-dependent binding of purified Zur protein to double-stranded 40-mer oligonucleotides containing candidate Zur-binding sites was demonstrated in vitro by DNA band shift assays.

Conclusion

Whole-genome expression profiling and DNA band shift assays demonstrated that Zur directly represses in a zinc-dependent manner the expression of nine genes organized in five transcription units. Accordingly, the Zur (Cg2502) protein is the key transcription regulator for genes involved in zinc homeostasis in C. glutamicum.

Keywords


Article
Sex-dimorphic gene expression and ineffective dosage compensation of Z-linked genes in gastrulating chicken embryos

Authors: Zhang Shaobing O --- Mathur Sachin --- Hattem Gaye --- Tassy Olivier --- et al.
DOI: 10.1186/1471-2164-11-13
Pages: 13
Loading...
Loading...
Abstract

Abstract

Background

Considerable progress has been made in our understanding of sex determination and dosage compensation mechanisms in model organisms such as C. elegans, Drosophila and M. musculus. Strikingly, the mechanism involved in sex determination and dosage compensation are very different among these three model organisms. Birds present yet another situation where the heterogametic sex is the female. Sex determination is still poorly understood in birds and few key determinants have so far been identified. In contrast to most other species, dosage compensation of bird sex chromosomal genes appears rather ineffective.

Results

By comparing microarrays from microdissected primitive streak from single chicken embryos, we identified a large number of genes differentially expressed between male and female embryos at a very early stage (Hamburger and Hamilton stage 4), long before any sexual differentiation occurs. Most of these genes are located on the Z chromosome, which indicates that dosage compensation is ineffective in early chicken embryos. Gene ontology analyses, using an enhanced annotation tool for Affymetrix probesets of the chicken genome developed in our laboratory (called Manteia), show that among these male-biased genes found on the Z chromosome, more than 20 genes play a role in sex differentiation.

Conclusions

These results corroborate previous studies demonstrating the rather inefficient dosage compensation for Z chromosome in birds and show that this sexual dimorphism in gene regulation is observed long before the onset of sexual differentiation. These data also suggest a potential role of non-compensated Z-linked genes in somatic sex differentiation in birds.

Keywords


Article
Snake mitochondrial genomes: phylogenetic relationships and implications of extended taxon sampling for interpretations of mitogenomic evolution

Authors: Douglas Desirée A --- Gower David J
DOI: 10.1186/1471-2164-11-14
Pages: 14
Loading...
Loading...
Abstract

Abstract

Background

Snake mitochondrial genomes are of great interest in understanding mitogenomic evolution because of gene duplications and rearrangements and the fast evolutionary rate of their genes compared to other vertebrates. Mitochondrial gene sequences have also played an important role in attempts to resolve the contentious phylogenetic relationships of especially the early divergences among alethinophidian snakes. Two recent innovative studies found dramatic gene- and branch-specific relative acceleration in snake protein-coding gene evolution, particularly along internal branches leading to Serpentes and Alethinophidia. It has been hypothesized that some of these rate shifts are temporally (and possibly causally) associated with control region duplication and/or major changes in ecology and anatomy.

Results

The near-complete mitochondrial (mt) genomes of three henophidian snakes were sequenced: Anilius scytale, Rhinophis philippinus, and Charina trivirgata. All three genomes share a duplicated control region and translocated tRNALEU, derived features found in all alethinophidian snakes studied to date. The new sequence data were aligned with mt genome data for 21 other species of snakes and used in phylogenetic analyses. Phylogenetic results agreed with many other studies in recovering several robust clades, including Colubroidea, Caenophidia, and Cylindrophiidae+Uropeltidae. Nodes within Henophidia that have been difficult to resolve robustly in previous analyses remained uncompellingly resolved here. Comparisons of relative rates of evolution of rRNA vs. protein-coding genes were conducted by estimating branch lengths across the tree. Our expanded sampling revealed dramatic acceleration along the branch leading to Typhlopidae, particularly long rRNA terminal branches within Scolecophidia, and that most of the dramatic acceleration in protein-coding gene rate along Serpentes and Alethinophidia branches occurred before Anilius diverged from other alethinophidians.

Conclusions

Mitochondrial gene sequence data alone may not be able to robustly resolve basal divergences among alethinophidian snakes. Taxon sampling plays an important role in identifying mitogenomic evolutionary events within snakes, and in testing hypotheses explaining their origin. Dramatic rate shifts in mitogenomic evolution occur within Scolecophidia as well as Alethinophidia, thus falsifying the hypothesis that these shifts in snakes are associated exclusively with evolution of a non-burrowing lifestyle, macrostomatan feeding ecology and/or duplication of the control region, both restricted to alethinophidians among living snakes.

Keywords


Article
Comparative analysis of missing value imputation methods to improve clustering and interpretation of microarray experiments

Authors: Celton Magalie --- Malpertuy Alain --- Lelandais Gaëlle --- de Brevern Alexandre G
DOI: 10.1186/1471-2164-11-15
Pages: 15
Loading...
Loading...
Abstract

Abstract

Background

Microarray technologies produced large amount of data. In a previous study, we have shown the interest of k-Nearest Neighbour approach for restoring the missing gene expression values, and its positive impact of the gene clustering by hierarchical algorithm. Since, numerous replacement methods have been proposed to impute missing values (MVs) for microarray data. In this study, we have evaluated twelve different usable methods, and their influence on the quality of gene clustering. Interestingly we have used several datasets, both kinetic and non kinetic experiments from yeast and human.

Results

We underline the excellent efficiency of approaches proposed and implemented by Bo and co-workers and especially one based on expected maximization (EM_array). These improvements have been observed also on the imputation of extreme values, the most difficult predictable values. We showed that the imputed MVs have still important effects on the stability of the gene clusters. The improvement on the clustering obtained by hierarchical clustering remains limited and, not sufficient to restore completely the correct gene associations. However, a common tendency can be found between the quality of the imputation method and the gene cluster stability. Even if the comparison between clustering algorithms is a complex task, we observed that k-means approach is more efficient to conserve gene associations.

Conclusions

More than 6.000.000 independent simulations have assessed the quality of 12 imputation methods on five very different biological datasets. Important improvements have so been done since our last study. The EM_array approach constitutes one efficient method for restoring the missing expression gene values, with a lower estimation error level. Nonetheless, the presence of MVs even at a low rate is a major factor of gene cluster instability. Our study highlights the need for a systematic assessment of imputation methods and so of dedicated benchmarks. A noticeable point is the specific influence of some biological dataset.

Keywords


Article
Analysis of the heat shock response in mouse liver reveals transcriptional dependence on the nuclear receptor peroxisome proliferator-activated receptor α (PPARα)

Loading...
Loading...
Abstract

Abstract

Background

The nuclear receptor peroxisome proliferator-activated receptor alpha (PPARα) regulates responses to chemical or physical stress in part by altering expression of genes involved in proteome maintenance. Many of these genes are also transcriptionally regulated by heat shock (HS) through activation by HS factor-1 (HSF1). We hypothesized that there are interactions on a genetic level between PPARα and the HS response mediated by HSF1.

Results

Wild-type and PPARα-null mice were exposed to HS, the PPARα agonist WY-14,643 (WY), or both; gene and protein expression was examined in the livers of the mice 4 or 24 hrs after HS. Gene expression profiling identified a number of Hsp family members that were altered similarly in both mouse strains. However, most of the targets of HS did not overlap between strains. A subset of genes was shown by microarray and RT-PCR to be regulated by HS in a PPARα-dependent manner. HS also down-regulated a large set of mitochondrial genes specifically in PPARα-null mice that are known targets of PPARγ co-activator-1 (PGC-1) family members. Pretreatment of PPARα-null mice with WY increased expression of PGC-1β and target genes and prevented the down-regulation of the mitochondrial genes by HS. A comparison of HS genes regulated in our dataset with those identified in wild-type and HSF1-null mouse embryonic fibroblasts indicated that although many HS genes are regulated independently of both PPARα and HSF1, a number require both factors for HS responsiveness.

Conclusions

These findings demonstrate that the PPARα genotype has a dramatic effect on the transcriptional targets of HS and support an expanded role for PPARα in the regulation of proteome maintenance genes after exposure to diverse forms of environmental stress including HS.

Keywords


Article
Glutamate, aspartate and nucleotide transporters in the SLC17 family form four main phylogenetic clusters: evolution and tissue expression

Authors: Sreedharan Smitha --- Shaik Jafar HA --- Olszewski Pawel K --- Levine Allen S --- et al.
DOI: 10.1186/1471-2164-11-17
Pages: 17
Loading...
Loading...
Abstract

Abstract

Background

The SLC17 family of transporters transports the amino acids: glutamate and aspartate, and, as shown recently, also nucleotides. Vesicular glutamate transporters are found in distinct species, such as C. elegans, but the evolutionary origin of most of the genes in this family has been obscure.

Results

Our phylogenetic analysis shows that the SLC17 family consists of four main phylogenetic clades which were all present before the divergence of the insect lineage. One of these clades has not been previously described and it is not found in vertebrates. The clade containing Slc17a9 had the most restricted evolutionary history with only one member in most species. We detected expression of Slc17a1-17a4 only in the peripheral tissues but not in the CNS, while Slc17a5- Slc17a9 are highly expressed in both the CNS and periphery.

Conclusions

The in situ hybridization studies on vesicular nucleotide transporter revealed high expression throughout the cerebral cortex, certain areas in the hippocampus and in specific nuclei of the hypothalamus and thalamus. Some of the regions with high expression, such as the medial habenula and the dentate gyrus of the hippocampus, are important sites for purinergic neurotransmission. Noteworthy, other areas relying on purine-mediated signaling, such as the molecular layer of the dentate gyrus and the periaqueductal gray, lack or have a very low expression of Slc17a9, suggesting that there could be another nucleotide transporter in these regions.

Keywords


Article
Comparative transcriptomic analysis of follicle-enclosed oocyte maturational and developmental competence acquisition in two non-mammalian vertebrates

Authors: Gohin Maella --- Bobe Julien --- Chesnel Franck
DOI: 10.1186/1471-2164-11-18
Pages: 18
Loading...
Loading...
Abstract

Abstract

Background

In vertebrates, late oogenesis is a key period during which the oocyte acquires its ability to resume meiosis (i.e. maturational competence) and to develop, once fertilized, into a normal embryo (i.e. developmental competence). However, the molecular mechanisms involved in these key biological processes are far from being fully understood. In order to identify key mechanisms conserved among teleosts and amphibians, we performed a comparative analysis using ovarian tissue sampled at successive steps of the maturational competence acquisition process in the rainbow trout (Oncorhynchus mykiss) and in the clawed toad (Xenopus laevis). Our study aimed at identifying common differentially expressed genes during late oogenesis in both species. Using an existing transcriptomic analysis that had previously been carried out in rainbow trout, candidate genes were selected for subsequent quantitative PCR-based comparative analysis.

Results

Among the 1200 differentially expressed clones in rainbow trout, twenty-six candidate genes were selected for further analysis by real-time PCR in both species during late oogenesis. Among these genes, eight had similar expression profiles in trout and Xenopus. Six genes were down-regulated during oocyte maturation (cyp19a1, cyp17a1, tescalcin, tfr1, cmah, hsd11b3) while two genes exhibited an opposite pattern (apoc1, star). In order to document possibly conserved molecular mechanisms, four genes (star, cyp19a1, cyp17a1 and hsd11b3) were further studied due to their known or suspected role in steroidogenesis after characterization of the orthology relationships between rainbow trout and Xenopus genes. Apoc1 was also selected for further analysis because of its reported function in cholesterol transport, which may modulate steroidogenesis by regulating cholesterol bioavailability in the steroidogenic cells.

Conclusions

We have successfully identified orthologous genes exhibiting conserved expression profiles in the ovarian follicle during late oogenesis in both trout and Xenopus. While some identified genes were previously uncharacterized during Xenopus late oogenesis, the nature of these genes has pointed out molecular mechanisms possibly conserved in amphibians and teleosts. It should also be stressed that in addition to the already suspected importance of steroidogenesis in maturational competence acquisition, our approach has shed light on other regulatory pathways which may be involved in maturational and developmental competence acquisitions that will require further studies.

Keywords


Article
Genome-wide cloning and sequence analysis of leucine-rich repeat receptor-like protein kinase genes in Arabidopsis thaliana

Authors: Gou Xiaoping --- He Kai --- Yang Hui --- Yuan Tong --- et al.
DOI: 10.1186/1471-2164-11-19
Pages: 19
Loading...
Loading...
Abstract

Abstract

Background

Transmembrane receptor kinases play critical roles in both animal and plant signaling pathways regulating growth, development, differentiation, cell death, and pathogenic defense responses. In Arabidopsis thaliana, there are at least 223 Leucine-rich repeat receptor-like kinases (LRR-RLKs), representing one of the largest protein families. Although functional roles for a handful of LRR-RLKs have been revealed, the functions of the majority of members in this protein family have not been elucidated.

Results

As a resource for the in-depth analysis of this important protein family, the complementary DNA sequences (cDNAs) of 194 LRR-RLKs were cloned into the GatewayR donor vector pDONR/ZeoR and analyzed by DNA sequencing. Among them, 157 clones showed sequences identical to the predictions in the Arabidopsis sequence resource, TAIR8. The other 37 cDNAs showed gene structures distinct from the predictions of TAIR8, which was mainly caused by alternative splicing of pre-mRNA. Most of the genes have been further cloned into GatewayR destination vectors with GFP or FLAG epitope tags and have been transformed into Arabidopsis for in planta functional analysis. All clones from this study have been submitted to the Arabidopsis Biological Resource Center (ABRC) at Ohio State University for full accessibility by the Arabidopsis research community.

Conclusions

Most of the Arabidopsis LRR-RLK genes have been isolated and the sequence analysis showed a number of alternatively spliced variants. The generated resources, including cDNA entry clones, expression constructs and transgenic plants, will facilitate further functional analysis of the members of this important gene family.

Keywords


Article
Gene expression profiling in C57BL/6J and A/J mouse inbred strains reveals gene networks specific for brain regions independent of genetic background

Authors: de Jong S Simone --- Fuller Tova F --- Janson Esther --- Strengman Eric --- et al.
DOI: 10.1186/1471-2164-11-20
Pages: 20
Loading...
Loading...
Abstract

Abstract

Background

We performed gene expression profiling of the amygdala and hippocampus taken from inbred mouse strains C57BL/6J and A/J. The selected brain areas are implicated in neurobehavioral traits while these mouse strains are known to differ widely in behavior. Consequently, we hypothesized that comparing gene expression profiles for specific brain regions in these strains might provide insight into the molecular mechanisms of human neuropsychiatric traits. We performed a whole-genome gene expression experiment and applied a systems biology approach using weighted gene co-expression network analysis.

Results

We were able to identify modules of co-expressed genes that distinguish a strain or brain region. Analysis of the networks that are most informative for hippocampus and amygdala revealed enrichment in neurologically, genetically and psychologically related pathways. Close examination of the strain-specific gene expression profiles, however, revealed no functional relevance but a significant enrichment of single nucleotide polymorphisms in the probe sequences used for array hybridization. This artifact was not observed for the modules of co-expressed genes that distinguish amygdala and hippocampus.

Conclusions

The brain-region specific modules were found to be independent of genetic background and are therefore likely to represent biologically relevant molecular networks that can be studied to complement our knowledge about pathways in neuropsychiatric disease.

Keywords


Article
Effort required to finish shotgun-generated genome sequences differs significantly among vertebrates

Loading...
Loading...
Abstract

Abstract

Background

The approaches for shotgun-based sequencing of vertebrate genomes are now well-established, and have resulted in the generation of numerous draft whole-genome sequence assemblies. In contrast, the process of refining those assemblies to improve contiguity and increase accuracy (known as 'sequence finishing') remains tedious, labor-intensive, and expensive. As a result, the vast majority of vertebrate genome sequences generated to date remain at a draft stage.

Results

To date, our genome sequencing efforts have focused on comparative studies of targeted genomic regions, requiring sequence finishing of large blocks of orthologous sequence (average size 0.5-2 Mb) from various subsets of 75 vertebrates. This experience has provided a unique opportunity to compare the relative effort required to finish shotgun-generated genome sequence assemblies from different species, which we report here. Importantly, we found that the sequence assemblies generated for the same orthologous regions from various vertebrates show substantial variation with respect to misassemblies and, in particular, the frequency and characteristics of sequence gaps. As a consequence, the work required to finish different species' sequences varied greatly. Application of the same standardized methods for finishing provided a novel opportunity to "assay" characteristics of genome sequences among many vertebrate species. It is important to note that many of the problems we have encountered during sequence finishing reflect unique architectural features of a particular vertebrate's genome, which in some cases may have important functional and/or evolutionary implications. Finally, based on our analyses, we have been able to improve our procedures to overcome some of these problems and to increase the overall efficiency of the sequence-finishing process, although significant challenges still remain.

Conclusion

Our findings have important implications for the eventual finishing of the draft whole-genome sequences that have now been generated for a large number of vertebrates.

Keywords


Article
Acetylation of p65 at lysine 314 is important for late NF-κB-dependent gene expression

Authors: Rothgiesser Karin M --- Fey Monika --- Hottiger Michael O
DOI: 10.1186/1471-2164-11-22
Pages: 22
Loading...
Loading...
Abstract

Abstract

Background

NF-κB regulates the expression of a large number of target genes involved in the immune and inflammatory response, apoptosis, cell proliferation, differentiation and survival. We have earlier reported that p65, a subunit of NF-κB, is acetylated in vitro and in vivo at three different lysines (K310, K314 and K315) by the histone acetyltransferase p300.

Results

In this study, we describe that site-specific mutation of p65 at lysines 314 and 315 enhances gene expression of a subset of NF-κB target genes including Mmp10 and Mmp13. Increased gene expression was mainly observed three hours after TNFα stimulation. Chromatin immunoprecipitation (ChIP) experiments with an antibody raised against acetylated lysine 314 revealed that chromatin-bound p65 is indeed acetylated at lysine 314.

Conclusions

Together, our results establish acetylation of K314 as an important regulatory modification of p65 and subsequently of NF-κB-dependent gene expression.

Keywords


Article
Transcriptomic profiling of bovine IVF embryos revealed candidate genes and pathways involved in early embryonic development

Authors: Huang Wen --- Yandell Brian S --- Khatib Hasan
DOI: 10.1186/1471-2164-11-23
Pages: 23
Loading...
Loading...
Abstract

Abstract

Background

Early embryonic loss is a large contributor to infertility in cattle. Although genetic factors are known to affect early embryonic development, the discovery of such factors has been a serious challenge. The objective of this study was to identify genes differentially expressed between blastocysts and degenerative embryos at early stages of development.

Results

Using microarrays, genome-wide RNA expression was profiled and compared for in vitro fertilization (IVF) - derived blastocysts and embryos undergoing degenerative development up to the same time point. Surprisingly similar transcriptomic profiles were found in degenerative embryos and blastocysts. Nonetheless, we identified 67 transcripts that significantly differed between these two groups of embryos at a 15% false discovery rate, including 33 transcripts showing at least a two-fold difference. Several signaling and metabolic pathways were found to be associated with the developmental status of embryos, among which were previously known important steroid biosynthesis and cell communication pathways in early embryonic development.

Conclusions

This study presents the first direct and comprehensive comparison of transcriptomes between IVF blastocysts and degenerative embryos, providing important information for potential genes and pathways associated with early embryonic development.

Keywords


Article
Comparison of clastogen-induced gene expression profiles in wild-type and DNA repair-deficient Rad54/Rad54B cells

Authors: Mahabir Anuska G --- Schaap Mirjam M --- Pennings Jeroen LA --- van Benthem Jan --- et al.
DOI: 10.1186/1471-2164-11-24
Pages: 24
Loading...
Loading...
Abstract

Abstract

Background

Previously we found that Rad54/Rad54B cells are more sensitive towards mitomycin C (MMC) as compared to wild-type (WT) cells. This difference in sensitivity was absent upon exposure to other clastogens like bleomycin (BLM) and γ-radiation. In order to get further insight into possible underlying mechanisms, gene expression changes in WT and Rad54/Rad54B MEFs (mouse embryonic fibroblasts) after exposure to the clastogens MMC and BLM were investigated. Exposures of these cells to mutagens (N-ac-AAF and ENU) and vehicle were taken as controls.

Results

Most exposures resulted in an induction of DNA damage signaling and apoptosis genes and a reduced expression of cell division genes in cells of both genotypes. As expected, responses to N-ac-AAF were very similar in both genotypes. ENU exposure did not lead to significant gene expression changes in cells of both genotypes, presumably due to its short half-life. Gene expression responses to clastogens, however, showed a genotype-dependent effect for BLM and MMC. MMC treated Rad54/Rad54B MEFs showed no induction of p53-signaling, DNA damage response and apoptosis as seen for all the other treatments.

Conclusion

These data support our finding that different types of clastogens exist and that responses to these types depend on the DNA repair status of the cells.

Keywords


Article
High-content siRNA screening of the kinome identifies kinases involved in Alzheimer's disease-related tau hyperphosphorylation

Loading...
Loading...
Abstract

Abstract

Background

Neurofibrillary tangles (NFT), a cardinal neuropathological feature of Alzheimer's disease (AD) that is highly correlated with synaptic loss and dementia severity, appear to be partly attributable to increased phosphorylation of the microtubule stabilizing protein tau at certain AD-related residues. Identifying the kinases involved in the pathologic phosphorylation of tau may provide targets at which to aim new AD-modifying treatments.

Results

We report results from a screen of 572 kinases in the human genome for effects on tau hyperphosphorylation using a loss of function, high-throughput RNAi approach. We confirm effects of three kinases from this screen, the eukaryotic translation initiation factor 2 α kinase 2 (EIF2AK2), the dual-specificity tyrosine-(Y)-phosphorylation regulated kinase 1A (DYRK1A), and the A-kinase anchor protein 13 (AKAP13) on tau phosphorylation at the 12E8 epitope (serine 262/serine 356). We provide evidence that EIF2AK2 effects may result from effects on tau protein expression, whereas DYRK1A and AKAP13 are likely more specifically involved in tau phosphorylation pathways.

Conclusions

These findings identify novel kinases that phosphorylate tau protein and provide a valuable reference data set describing the kinases involved in phosphorylating tau at an AD-relevant epitope.

Keywords


Article
1,25 dihydroxyvitamin D-mediated orchestration of anticancer, transcript-level effects in the immortalized, non-transformed prostate epithelial cell line, RWPE1

Authors: Kovalenko Pavlo L --- Zhang Zhentao --- Cui Min --- Clinton Steve K --- et al.
DOI: 10.1186/1471-2164-11-26
Pages: 26
Loading...
Loading...
Abstract

Abstract

Background

Prostate cancer is the second leading cause of cancer mortality among US men. Epidemiological evidence suggests that high vitamin D status protects men from prostate cancer and the active form of vitamin D, 1α,25 dihydroxyvitamin D3 (1,25(OH)2D) has anti-cancer effects in cultured prostate cells. Still, the molecular mechanisms and the gene targets for vitamin D-mediated prostate cancer prevention are unknown.

Results

We examined the effect of 1,25(OH)2D (+/- 100 nM, 6, 24, 48 h) on the transcript profile of proliferating RWPE1 cells, an immortalized, non-tumorigenic prostate epithelial cell line that is growth arrested by 1,25(OH)2D (Affymetrix U133 Plus 2.0, n = 4/treatment per time and dose). Our analysis revealed many transcript level changes at a 5% false detection rate: 6 h, 1571 (61% up), 24 h, 1816 (60% up), 48 h, 3566 (38% up). 288 transcripts were regulated similarly at all time points (182 up, 80 down) and many of the promoters for these transcripts contained putative vitamin D response elements. Functional analysis by pathway or Gene Set Analysis revealed early suppression of WNT, Notch, NF-kB, and IGF1 signaling. Transcripts related to inflammation were suppressed at 6 h (e.g. IL-1 pathway) and suppression of proinflammatory pathways continued at later time points (e.g. IL-17 and IL-6 pathways). There was also evidence for induction of anti-angiogenic pathways and induction of transcripts for protection from oxidative stress or maintenance of cell redox homeostasis at 6 h.

Conclusions

Our data reveal of large number of potential new, direct vitamin D target genes relevant to prostate cancer prevention. In addition, our data suggests that rather than having a single strong regulatory effect, vitamin D orchestrates a pattern of changes within prostate epithelial cells that limit or slow carcinogenesis.

Keywords


Article
Genomics Portals: integrative web-platform for mining genomics data

Authors: Shinde Kaustubh --- Phatak Mukta --- Johannes Freudenberg M --- Chen Jing --- et al.
DOI: 10.1186/1471-2164-11-27
Pages: 27
Loading...
Loading...
Abstract

Abstract

Background

A large amount of experimental data generated by modern high-throughput technologies is available through various public repositories. Our knowledge about molecular interaction networks, functional biological pathways and transcriptional regulatory modules is rapidly expanding, and is being organized in lists of functionally related genes. Jointly, these two sources of information hold a tremendous potential for gaining new insights into functioning of living systems.

Results

Genomics Portals platform integrates access to an extensive knowledge base and a large database of human, mouse, and rat genomics data with basic analytical visualization tools. It provides the context for analyzing and interpreting new experimental data and the tool for effective mining of a large number of publicly available genomics datasets stored in the back-end databases. The uniqueness of this platform lies in the volume and the diversity of genomics data that can be accessed and analyzed (gene expression, ChIP-chip, ChIP-seq, epigenomics, computationally predicted binding sites, etc), and the integration with an extensive knowledge base that can be used in such analysis.

Conclusion

The integrated access to primary genomics data, functional knowledge and analytical tools makes Genomics Portals platform a unique tool for interpreting results of new genomics experiments and for mining the vast amount of data stored in the Genomics Portals backend databases. Genomics Portals can be accessed and used freely at http://GenomicsPortals.org.

Keywords


Article
Regional regulation of transcription in the chicken genome

Authors: Nie Haisheng --- Crooijmans Richard PMA --- Bastiaansen John WM --- Megens Hendrik-Jan --- et al.
DOI: 10.1186/1471-2164-11-28
Pages: 28
Loading...
Loading...
Abstract

Abstract

Background

Over the past years, the relationship between gene transcription and chromosomal location has been studied in a number of different vertebrate genomes. Regional differences in gene expression have been found in several different species. The chicken genome, as the closest sequenced genome relative to mammals, is an important resource for investigating regional effects on transcription in birds and studying the regional dynamics of chromosome evolution by comparative analysis.

Results

We used gene expression data to survey eight chicken tissues and create transcriptome maps for all chicken chromosomes. The results reveal the presence of two distinct types of chromosomal regions characterized by clusters of highly or lowly expressed genes. Furthermore, these regions correlate highly with a number of genome characteristics. Regions with clusters of highly expressed genes have higher gene densities, shorter genes, shorter average intron and higher GC content compared to regions with clusters of lowly expressed genes. A comparative analysis between the chicken and human transcriptome maps constructed using similar panels of tissues suggests that the regions with clusters of highly expressed genes are relatively conserved between the two genomes.

Conclusions

Our results revealed the presence of a higher order organization of the chicken genome that affects gene expression, confirming similar observations in other species. These results will aid in the further understanding of the regional dynamics of chromosome evolution.

The microarray data used in this analysis have been submitted to NCBI GEO database under accession number GSE17108. The reviewer access link is: http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?token=tjwjpscyceqawjk&acc=GSE17108

Keywords


Article
Transcriptome analysis of the sex pheromone gland of the noctuid moth Heliothis virescens

Authors: Vogel Heiko --- Heidel Andrew J --- Heckel David G --- Groot Astrid T
DOI: 10.1186/1471-2164-11-29
Pages: 29
Loading...
Loading...
Abstract

Abstract

Background

The chemical components of sex pheromones have been determined for more than a thousand moth species, but so far only a handful of genes encoding enzymes responsible for the biosynthesis of these compounds have been identified. For understanding the evolution of moth sexual communication, it is essential to know which genes are involved in the production of specific pheromone components and what controls the variation in their relative frequencies in the pheromone blend. We used a transcriptomic approach to characterize the pheromone gland of the Noctuid moth Heliothis virescens, an important agricultural pest, in order to obtain substantial general sequence information and to identify a range of candidate genes involved in the pheromone biosynthetic pathway.

Results

To facilitate identifying sets of genes involved in a broad range of processes and to capture rare transcripts, we developed our majority of ESTs from a normalized cDNA library of Heliothis virescens pheromone glands (PG). Combining these with a non-normalized library yielded a total of 17,233 ESTs, which assembled into 2,082 contigs and 6,228 singletons. Using BLAST searches of the NR and Swissprot databases we were able to identify a large number of putative unique gene elements (unigenes), which we compared to those derived from previous transcriptomic surveys of the larval stage of Heliothis virescens. The distribution of unigenes among GO Biological Process functional groups shows an overall similarity between PG and larval transcriptomes, but with distinct enrichment of specific pathways in the PG. In addition, we identified a large number of candidate genes in the pheromone biosynthetic pathways.

Conclusion

These data constitute one of the first large-scale EST-projects for Noctuidae, a much-needed resource for exploring these pest species. Our analysis shows a surprisingly complex transcriptome and we identified a large number of potential pheromone biosynthetic pathway and immune-related genes that can be applied to population and systematic studies of Heliothis virescens and other Noctuidae.

Keywords


Article
Variable structure motifs for transcription factor binding sites

Authors: Reid John E --- Evans Kenneth J --- Dyer Nigel --- Wernisch Lorenz --- et al.
DOI: 10.1186/1471-2164-11-30
Pages: 30
Loading...
Loading...
Abstract

Abstract

Background

Classically, models of DNA-transcription factor binding sites (TFBSs) have been based on relatively few known instances and have treated them as sites of fixed length using position weight matrices (PWMs). Various extensions to this model have been proposed, most of which take account of dependencies between the bases in the binding sites. However, some transcription factors are known to exhibit some flexibility and bind to DNA in more than one possible physical configuration. In some cases this variation is known to affect the function of binding sites. With the increasing volume of ChIP-seq data available it is now possible to investigate models that incorporate this flexibility. Previous work on variable length models has been constrained by: a focus on specific zinc finger proteins in yeast using restrictive models; a reliance on hand-crafted models for just one transcription factor at a time; and a lack of evaluation on realistically sized data sets.

Results

We re-analysed binding sites from the TRANSFAC database and found motivating examples where our new variable length model provides a better fit. We analysed several ChIP-seq data sets with a novel motif search algorithm and compared the results to one of the best standard PWM finders and a recently developed alternative method for finding motifs of variable structure. All the methods performed comparably in held-out cross validation tests. Known motifs of variable structure were recovered for p53, Stat5a and Stat5b. In addition our method recovered a novel generalised version of an existing PWM for Sp1 that allows for variable length binding. This motif improved classification performance.

Conclusions

We have presented a new gapped PWM model for variable length DNA binding sites that is not too restrictive nor over-parameterised. Our comparison with existing tools shows that on average it does not have better predictive accuracy than existing methods. However, it does provide more interpretable models of motifs of variable structure that are suitable for follow-up structural studies. To our knowledge, we are the first to apply variable length motif models to eukaryotic ChIP-seq data sets and consequently the first to show their value in this domain. The results include a novel motif for the ubiquitous transcription factor Sp1.

Keywords


Article
Temperature increase prevails over acidification in gene expression modulation of amastigote differentiation in Leishmania infantum

Loading...
Loading...
Abstract

Abstract

Background

The extracellular promastigote and the intracellular amastigote stages alternate in the digenetic life cycle of the trypanosomatid parasite Leishmania. Amastigotes develop inside parasitophorous vacuoles of mammalian phagocytes, where they tolerate extreme environmental conditions. Temperature increase and pH decrease are crucial factors in the multifactorial differentiation process of promastigotes to amastigotes. Although expression profiling approaches for axenic, cell culture- and lesion-derived amastigotes have already been reported, the specific influence of temperature increase and acidification of the environment on developmental regulation of genes has not been previously studied. For the first time, we have used custom L. infantum genomic DNA microarrays to compare the isolated and the combined effects of both factors on the transcriptome.

Results

Immunofluorescence analysis of promastigote-specific glycoprotein gp46 and expression modulation analysis of the amastigote-specific A2 gene have revealed that concomitant exposure to temperature increase and acidification leads to amastigote-like forms. The temperature-induced gene expression profile in the absence of pH variation resembles the profile obtained under combined exposure to both factors unlike that obtained for exposure to acidification alone. In fact, the subsequent fold change-based global iterative hierarchical clustering analysis supports these findings.

Conclusions

The specific influence of temperature and pH on the differential regulation of genes described in this study and the evidence provided by clustering analysis is consistent with the predominant role of temperature increase over extracellular pH decrease in the amastigote differentiation process, which provides new insights into Leishmania physiology.

Keywords


Article
Integrative analysis of the heat shock response in Aspergillus fumigatus

Authors: Albrecht Daniela --- Guthke Reinhard --- Brakhage Axel A --- Kniemeyer Olaf
DOI: 10.1186/1471-2164-11-32
Pages: 32
Loading...
Loading...
Abstract

Abstract

Background

Aspergillus fumigatus is a thermotolerant human-pathogenic mold and the most common cause of invasive aspergillosis (IA) in immunocompromised patients. Its predominance is based on several factors most of which are still unknown. The thermotolerance of A. fumigatus is one of the traits which have been assigned to pathogenicity. It allows the fungus to grow at temperatures up to and above that of a fevered human host. To elucidate the mechanisms of heat resistance, we analyzed the change of the A. fumigatus proteome during a temperature shift from 30°C to 48°C by 2D-fluorescence difference gel electrophoresis (DIGE). To improve 2D gel image analysis results, protein spot quantitation was optimized by missing value imputation and normalization. Differentially regulated proteins were compared to previously published transcriptome data of A. fumigatus. The study was augmented by bioinformatical analysis of transcription factor binding sites (TFBSs) in the promoter region of genes whose corresponding proteins were differentially regulated upon heat shock.

Results

91 differentially regulated protein spots, representing 64 different proteins, were identified by mass spectrometry (MS). They showed a continuous up-, down- or an oscillating regulation. Many of the identified proteins were involved in protein folding (chaperones), oxidative stress response, signal transduction, transcription, translation, carbohydrate and nitrogen metabolism. A correlation between alteration of transcript levels and corresponding proteins was detected for half of the differentially regulated proteins. Interestingly, some previously undescribed putative targets for the heat shock regulator Hsf1 were identified. This provides evidence for Hsf1-dependent regulation of mannitol biosynthesis, translation, cytoskeletal dynamics and cell division in A. fumigatus. Furthermore, computational analysis of promoters revealed putative binding sites for an AP-2alpha-like transcription factor upstream of some heat shock induced genes. Until now, this factor has only been found in vertebrates.

Conclusions

Our newly established DIGE data analysis workflow yields improved data quality and is widely applicable for other DIGE datasets. Our findings suggest that the heat shock response in A. fumigatus differs from already well-studied yeasts and other filamentous fungi.

Keywords


Article
New insights into two distinct nucleosome distributions: comparison of cross-platform positioning datasets in the yeast genome

Authors: Feng Jihua --- Dai Xianhua --- Xiang Qian --- Dai Zhiming --- et al.
DOI: 10.1186/1471-2164-11-33
Pages: 33
Loading...
Loading...
Abstract

Abstract

Background

Recently, a number of high-resolution genome-wide maps of nucleosome locations in S. cerevisiae have been derived experimentally. However, nucleosome positions are determined in vivo by the combined effects of numerous factors. Consequently, nucleosomes are not simple static units, which may explain the discrepancies in reported nucleosome positions as measured by different experiments. In order to more accurately depict the genome-wide nucleosome distribution, we integrated multiple nucleosomal positioning datasets using a multi-angle analysis strategy.

Results

To evaluate the contribution of chromatin structure to transcription, we used the vast amount of available nucleosome analyzed data. Analysis of this data allowed for the comprehensive identification of the connections between promoter nucleosome positioning patterns and various transcription-dependent properties. Further, we characterised the function of nucleosome destabilisation in the context of transcription regulation. Our results indicate that genes with similar nucleosome occupancy patterns share general transcription attributes. We identified the local regulatory correlation (LRC) regions for two distinct types of nucleosomes and we assessed their regulatory properties. We also estimated the nucleosome reproducibility and measurement accuracy for high-confidence transcripts. We found that by maintaining a distance of ~13 bp between the upstream border of the +1 nucleosome and the transcription start sites (TSSs), the stable +1 nucleosome may form a barrier against the accessibility of the TSS and shape an optimum chromatin conformation for gene regulation. An in-depth analysis of nucleosome positioning in normally growing and heat shock cells suggested that the extent and patterns of nucleosome sliding are associated with gene activation.

Conclusions

Our results, which combine different types of data, suggest that cross-platform information, including discrepancy and consistency, reflects the mechanisms of nucleosome packaging in vivo more faithfully than individual studies. Furthermore, nucleosomes can be divided into two classes according to their stable and dynamic characteristics. We found that two different nucleosome-positioning characteristics may significantly impact transcription programs. Besides, some positioned-nucleosomes are involved in the transition from stable state to dynamic state in response to abrupt environmental changes.

Keywords


Article
In silico and biological survey of transcription-associated proteins implicated in the transcriptional machinery during the erythrocytic development of Plasmodium falciparum

Authors: Bischoff Emmanuel --- Vaquero Catherine
DOI: 10.1186/1471-2164-11-34
Pages: 34
Loading...
Loading...
Abstract

Abstract

Background

Malaria is the most important parasitic disease in the world with approximately two million people dying every year, mostly due to Plasmodium falciparum infection. During its complex life cycle in the Anopheles vector and human host, the parasite requires the coordinated and modulated expression of diverse sets of genes involved in epigenetic, transcriptional and post-transcriptional regulation. However, despite the availability of the complete sequence of the Plasmodium falciparum genome, we are still quite ignorant about Plasmodium mechanisms of transcriptional gene regulation. This is due to the poor prediction of nuclear proteins, cognate DNA motifs and structures involved in transcription.

Results

A comprehensive directory of proteins reported to be potentially involved in Plasmodium transcriptional machinery was built from all in silico reports and databanks. The transcription-associated proteins were clustered in three main sets of factors: general transcription factors, chromatin-related proteins (structuring, remodelling and histone modifying enzymes), and specific transcription factors. Only a few of these factors have been molecularly analysed. Furthermore, from transcriptome and proteome data we modelled expression patterns of transcripts and corresponding proteins during the intra-erythrocytic cycle. Finally, an interactome of these proteins based either on in silico or on 2-yeast-hybrid experimental approaches is discussed.

Conclusion

This is the first attempt to build a comprehensive directory of potential transcription-associated proteins in Plasmodium. In addition, all complete transcriptome, proteome and interactome raw data were re-analysed, compared and discussed for a better comprehension of the complex biological processes of Plasmodium falciparum transcriptional regulation during the erythrocytic development.

Keywords


Article
Assessing functional annotation transfers with inter-species conserved coexpression: application to Plasmodium falciparum

Authors: Bréhélin Laurent --- Florent Isabelle --- Gascuel Olivier --- Maréchal Éric
DOI: 10.1186/1471-2164-11-35
Pages: 35
Loading...
Loading...
Abstract

Abstract

Background

Plasmodium falciparum is the main causative agent of malaria. Of the 5 484 predicted genes of P. falciparum, about 57% do not have sufficient sequence similarity to characterized genes in other species to warrant functional assignments. Non-homology methods are thus needed to obtain functional clues for these uncharacterized genes. Gene expression data have been widely used in the recent years to help functional annotation in an intra-species way via the so-called Guilt By Association (GBA) principle.

Results

We propose a new method that uses gene expression data to assess inter-species annotation transfers. Our approach starts from a set of likely orthologs between a reference species (here S. cerevisiae and D. melanogaster) and a query species (P. falciparum). It aims at identifying clusters of coexpressed genes in the query species whose coexpression has been conserved in the reference species. These conserved clusters of coexpressed genes are then used to assess annotation transfers between genes with low sequence similarity, enabling reliable transfers of annotations from the reference to the query species. The approach was used with transcriptomic data sets of P. falciparum, S. cerevisiae and D. melanogaster, and enabled us to propose with high confidence new/refined annotations for several dozens hypothetical/putative P. falciparum genes. Notably, we revised the annotation of genes involved in ribosomal proteins and ribosome biogenesis and assembly, thus highlighting several potential drug targets.

Conclusions

Our approach uses both sequence similarity and gene expression data to help inter-species gene annotation transfers. Experiments show that this strategy improves the accuracy achieved when using solely sequence similarity and outperforms the accuracy of the GBA approach. In addition, our experiments with P. falciparum show that it can infer a function for numerous hypothetical genes.

Keywords


Article
The proteolytic system of lactic acid bacteria revisited: a genomic comparison

Authors: Liu Mengjin --- Bayjanov Jumamurat R --- Renckens Bernadet --- Nauta Arjen --- et al.
DOI: 10.1186/1471-2164-11-36
Pages: 36
Loading...
Loading...
Abstract

Abstract

Background

Lactic acid bacteria (LAB) are a group of gram-positive, lactic acid producing Firmicutes. They have been extensively used in food fermentations, including the production of various dairy products. The proteolytic system of LAB converts proteins to peptides and then to amino acids, which is essential for bacterial growth and also contributes significantly to flavor compounds as end-products. Recent developments in high-throughput genome sequencing and comparative genomics hybridization arrays provide us with opportunities to explore the diversity of the proteolytic system in various LAB strains.

Results

We performed a genome-wide comparative genomics analysis of proteolytic system components, including cell-wall bound proteinase, peptide transporters and peptidases, in 22 sequenced LAB strains. The peptidase families PepP/PepQ/PepM, PepD and PepI/PepR/PepL are described as examples of our in silico approach to refine the distinction of subfamilies with different enzymatic activities. Comparison of protein 3D structures of proline peptidases PepI/PepR/PepL and esterase A allowed identification of a conserved core structure, which was then used to improve phylogenetic analysis and functional annotation within this protein superfamily.

The diversity of proteolytic system components in 39 Lactococcus lactis strains was explored using pangenome comparative genome hybridization analysis. Variations were observed in the proteinase PrtP and its maturation protein PrtM, in one of the Opp transport systems and in several peptidases between strains from different Lactococcus subspecies or from different origin.

Conclusions

The improved functional annotation of the proteolytic system components provides an excellent framework for future experimental validations of predicted enzymatic activities. The genome sequence data can be coupled to other "omics" data e.g. transcriptomics and metabolomics for prediction of proteolytic and flavor-forming potential of LAB strains. Such an integrated approach can be used to tune the strain selection process in food fermentations.

Keywords


Article
A manual collection of Syt, Esyt, Rph3a, Rph3al, Doc2, and Dblc2 genes from 46 metazoan genomes - an open access resource for neuroscience and evolutionary biology

Authors: Craxton Molly
DOI: 10.1186/1471-2164-11-37
Pages: 37
Loading...
Loading...
Abstract

Abstract

Background

Synaptotagmin proteins were first identified in nervous tissue, residing in synaptic vesicles. Synaptotagmins were subsequently found to form a large family, some members of which play important roles in calcium triggered exocytic events. These members have been investigated intensively, but other family members are not well understood, making it difficult to grasp the meaning of family membership in functional terms. Further difficulty arises as families are defined quite legitimately in different ways: by common descent or by common possession of distinguishing features. One definition does not necessarily imply the other. The evolutionary range of genome sequences now available, can shed more light on synaptotagmin gene phylogeny and clarify family relationships. The aim of compiling this open access collection of synaptotagmin and synaptotagmin-like sequences, is that its use may lead to greater understanding of the biological function of these proteins in an evolutionary context.

Results

46 metazoan genomes were examined and their complement of Syt, Esyt, Rph3a, Rph3al, Doc2 and Dblc2 genes identified. All of the sequences were compared, named, then examined in detail. Esyt genes were formerly named Fam62. The species in this collection are Trichoplax, Nematostella, Capitella, Helobdella, Lottia, Ciona, Strongylocentrotus, Branchiostoma, Ixodes, Daphnia, Acyrthosiphon, Tribolium, Nasonia, Apis, Anopheles, Drosophila, Caenorhabditis, Takifugu, Tetraodon, Gasterosteus, Oryzias, Danio, Xenopus, Anolis, Gallus, Taeniopygia,Ornithorhynchus, Monodelphis, Mus and Homo. All of the data described in this paper is available as additional files.

Conclusions

Only a subset of synaptotagmin proteins appear able to function as calcium triggers. Syt1, Syt7 and Syt9 are ancient conserved synaptotagmins of this type. Some animals carry extensive repertoires of synaptotagmin genes. Other animals of no less complexity, carry only a small repertoire. Current understanding does not explain why this is so. The biological roles of many synaptotagmins remain to be understood. This collection of genes offers prospects for fruitful speculation about the functional roles of the synaptotagmin repertoires of different animals and includes a great range of biological complexity. With reference to this gene collection, functional relationships among Syt, Esyt, Rph3a, Rph3al, Doc2 and Dblc2 genes, which encode similar proteins, can better be assessed in future.

Keywords


Article
High-throughput SNP discovery through deep resequencing of a reduced representation library to anchor and orient scaffolds in the soybean whole genome sequence

Loading...
Loading...
Abstract

Abstract

Background

The Soybean Consensus Map 4.0 facilitated the anchoring of 95.6% of the soybean whole genome sequence developed by the Joint Genome Institute, Department of Energy, but its marker density was only sufficient to properly orient 66% of the sequence scaffolds. The discovery and genetic mapping of more single nucleotide polymorphism (SNP) markers were needed to anchor and orient the remaining genome sequence. To that end, next generation sequencing and high-throughput genotyping were combined to obtain a much higher resolution genetic map that could be used to anchor and orient most of the remaining sequence and to help validate the integrity of the existing scaffold builds.

Results

A total of 7,108 to 25,047 predicted SNPs were discovered using a reduced representation library that was subsequently sequenced by the Illumina sequence-by-synthesis method on the clonal single molecule array platform. Using multiple SNP prediction methods, the validation rate of these SNPs ranged from 79% to 92.5%. A high resolution genetic map using 444 recombinant inbred lines was created with 1,790 SNP markers. Of the 1,790 mapped SNP markers, 1,240 markers had been selectively chosen to target existing unanchored or un-oriented sequence scaffolds, thereby increasing the amount of anchored sequence to 97%.

Conclusion

We have demonstrated how next generation sequencing was combined with high-throughput SNP detection assays to quickly discover large numbers of SNPs. Those SNPs were then used to create a high resolution genetic map that assisted in the assembly of scaffolds from the 8× whole genome shotgun sequences into pseudomolecules corresponding to chromosomes of the organism.

Keywords


Article
Gene expression profiles in Atlantic salmon adipose-derived stromo-vascular fraction during differentiation into adipocytes

Authors: Todorčević Marijana --- Škugor Stanko --- Krasnov Aleksei --- Ruyter Bente
DOI: 10.1186/1471-2164-11-39
Pages: 39
Loading...
Loading...
Abstract

Abstract

Background

Excessive fat deposition is one of the largest problems faced by salmon aquaculture industries, leading to production losses due to high volume of adipose tissue offal. In addition, increased lipid accumulation may impose considerable stress on adipocytes leading to adipocyte activation and production and secretion of inflammatory mediators, as observed in mammals.

Results

Microarray and qPCR analyses were performed to follow transcriptome changes during adipogenesis in the primary culture of adipose stromo-vascular fraction (aSVF) of Atlantic salmon. Cellular heterogeneity decreased by confluence as evidenced by the down-regulation of markers of osteo/chondrogenic, myogenic, immune and vasculature lineages. Transgelin (TAGLN), a marker of the multipotent pericyte, was prominently expressed around confluence while adipogenic PPARγ was up-regulated already in subconfluent cells. Proliferative activity and subsequent cell cycle arrest were reflected in the fluctuations of pro- and anti-mitotic regulators. Marked regulation of genes involved in lipid and glucose metabolism and pathways producing NADPH and glycerol-3-phosphate (G3P) was seen during the terminal differentiation, also characterised by diverse stress responses. Activation of the glutathione and thioredoxin antioxidant systems and changes in the iron metabolism suggested the need for protection against oxidative stress. Signs of endoplasmic reticulum (ER) stress and unfolded protein response (UPR) occured in parallel with the increased lipid droplet (LD) formation and production of secretory proteins (adipsin, visfatin). The UPR markers XBP1 and ATF6 were induced together with genes involved in ubiquitin-proteasome and lysosomal proteolysis. Concurrently, translation was suppressed as evidenced by the down-regulation of genes encoding elongation factors and components of the ribosomal machinery. Notably, expression changes of a panel of genes that belong to different immune pathways were seen throughout adipogenesis. The induction of AP1 (Jun, Fos), which is a master regulator of stress responses, culminated by the end of adipogenesis, concurrent with the maximal observed lipid deposition.

Conclusions

Our data point to an intimate relationship between metabolic regulation and immune responses in white adipocytes of a cold-blooded vertebrate. Stress imposed on adipocytes by LD formation and expansion is prominently reflected in the ER compartment and the activated UPR response could have an important role at visceral obesity in fish.

Keywords


Article
Evolution of electron transfer out of the cell: comparative genomics of six Geobacter genomes

Authors: Butler Jessica E --- Young Nelson D --- Lovley Derek R
DOI: 10.1186/1471-2164-11-40
Pages: 40
Loading...
Loading...
Abstract

Abstract

Background

Geobacter species grow by transferring electrons out of the cell - either to Fe(III)-oxides or to man-made substances like energy-harvesting electrodes. Study of Geobacter sulfurreducens has shown that TCA cycle enzymes, inner-membrane respiratory enzymes, and periplasmic and outer-membrane cytochromes are required. Here we present comparative analysis of six Geobacter genomes, including species from the clade that predominates in the subsurface. Conservation of proteins across the genomes was determined to better understand the evolution of Geobacter species and to create a metabolic model applicable to subsurface environments.

Results

The results showed that enzymes for acetate transport and oxidation, and for proton transport across the inner membrane were well conserved. An NADH dehydrogenase, the ATP synthase, and several TCA cycle enzymes were among the best conserved in the genomes. However, most of the cytochromes required for Fe(III)-reduction were not, including many of the outer-membrane cytochromes. While conservation of cytochromes was poor, an abundance and diversity of cytochromes were found in every genome, with duplications apparent in several species.

Conclusions

These results indicate there is a common pathway for acetate oxidation and energy generation across the family and in the last common ancestor. They also suggest that while cytochromes are important for extracellular electron transport, the path of electrons across the periplasm and outer membrane is variable. This combination of abundant cytochromes with weak sequence conservation suggests they may not be specific terminal reductases, but rather may be important in their heme-bearing capacity, as sinks for electrons between the inner-membrane electron transport chain and the extracellular acceptor.

Keywords


Article
Deletions of Immunoglobulin heavy chain and T cell receptor gene regions are uniquely associated with lymphoid blast transformation of chronic myeloid leukemia

Loading...
Loading...
Abstract

Abstract

Background

Chronic myelogenous leukemia (CML) results from the neoplastic transformation of a haematopoietic stem cell. The hallmark genetic abnormality of CML is a chimeric BCR/ABL1 fusion gene resulting from the Philadelphia chromosome rearrangement t(9;22)(q34;q11). Clinical and laboratory studies indicate that the BCR/ABL1 fusion protein is essential for initiation, maintenance and progression of CML, yet the event(s) driving the transformation from chronic phase to blast phase are poorly understood.

Results

Here we report multiple genome aberrations in a collection of 78 CML and 14 control samples by oligonucleotide array comparative genomic hybridization. We found a unique signature of genome deletions within the immunoglobulin heavy chain (IGH) and T cell receptor regions (TCR), frequently accompanied by concomitant loss of sequences within the short arm regions of chromosomes 7 and 9, including IKZF1, HOXA7, CDKN2A/2B, MLLT3, IFNA/B, RNF38, PAX5, JMJD2C and PDCD1LG2 genes.

Conclusions

None of these genome losses were detected in any of the CML samples with myeloid transformation, chronic phase or controls, indicating that their presence is obligatory for the development of a malignant clone with a lymphoid phenotype. Notably, the coincidental deletions at IGH and TCR regions appear to precede the loss of IKZF1 and/or p16 genes in CML indicating a possible involvement of RAG in these deletions.

Keywords


Article
Evolution and diversity of secretome genes in the apicomplexan parasite Theileria annulata

Authors: Weir William --- Karagenç Tülin --- Baird Margaret --- Tait Andy --- et al.
DOI: 10.1186/1471-2164-11-42
Pages: 42
Loading...
Loading...
Abstract

Abstract

Background

Little is known about how apicomplexan parasites have evolved to infect different host species and cell types. Theileria annulata and Theileria parva invade and transform bovine leukocytes but each species favours a different host cell lineage. Parasite-encoded proteins secreted from the intracellular macroschizont stage within the leukocyte represent a critical interface between host and pathogen systems. Genome sequencing has revealed that several Theileria-specific gene families encoding secreted proteins are positively selected at the inter-species level, indicating diversification between the species. We extend this analysis to the intra-species level, focusing on allelic diversity of two major secretome families. These families represent a well-characterised group of genes implicated in control of the host cell phenotype and a gene family of unknown function. To gain further insight into their evolution and function, this study investigates whether representative genes of these two families are diversifying or constrained within the T. annulata population.

Results

Strong evidence is provided that the sub-telomerically encoded SVSP family and the host-nucleus targeted TashAT family have evolved under contrasting pressures within natural T. annulata populations. SVSP genes were found to possess atypical codon usage and be evolving neutrally, with high levels of nucleotide substitutions and multiple indels. No evidence of geographical sub-structuring of allelic sequences was found. In contrast, TashAT family genes, implicated in control of host cell gene expression, are strongly conserved at the protein level and geographically sub-structured allelic sequences were identified among Tunisian and Turkish isolates. Although different copy numbers of DNA binding motifs were identified in alleles of TashAT proteins, motif periodicity was strongly maintained, implying conserved functional activity of these sites.

Conclusions

This analysis provides evidence that two distinct secretome genes families have evolved under contrasting selective pressures. The data supports current hypotheses regarding the biological role of TashAT family proteins in the management of host cell phenotype that may have evolved to allow adaptation of T. annulata to a specific host cell lineage. We provide new evidence of extensive allelic diversity in representative members of the enigmatic SVSP gene family, which supports a putative role for the encoded products in subversion of the host immune response.

Keywords


Article
Proteomic analysis of peach fruit mesocarp softening and chilling injury using difference gel electrophoresis (DIGE)

Loading...
Loading...
Abstract

Abstract

Background

Peach fruit undergoes a rapid softening process that involves a number of metabolic changes. Storing fruit at low temperatures has been widely used to extend its postharvest life. However, this leads to undesired changes, such as mealiness and browning, which affect the quality of the fruit. In this study, a 2-D DIGE approach was designed to screen for differentially accumulated proteins in peach fruit during normal softening as well as under conditions that led to fruit chilling injury.

Results

The analysis allowed us to identify 43 spots -representing about 18% of the total number analyzed- that show statistically significant changes. Thirty-nine of the proteins could be identified by mass spectrometry. Some of the proteins that changed during postharvest had been related to peach fruit ripening and cold stress in the past. However, we identified other proteins that had not been linked to these processes. A graphical display of the relationship between the differentially accumulated proteins was obtained using pairwise average-linkage cluster analysis and principal component analysis. Proteins such as endopolygalacturonase, catalase, NADP-dependent isocitrate dehydrogenase, pectin methylesterase and dehydrins were found to be very important for distinguishing between healthy and chill injured fruit. A categorization of the differentially accumulated proteins was performed using Gene Ontology annotation. The results showed that the 'response to stress', 'cellular homeostasis', 'metabolism of carbohydrates' and 'amino acid metabolism' biological processes were affected the most during the postharvest.

Conclusions

Using a comparative proteomic approach with 2-D DIGE allowed us to identify proteins that showed stage-specific changes in their accumulation pattern. Several proteins that are related to response to stress, cellular homeostasis, cellular component organization and carbohydrate metabolism were detected as being differentially accumulated. Finally, a significant proportion of the proteins identified had not been associated with softening, cold storage or chilling injury-altered fruit before; thus, comparative proteomics has proven to be a valuable tool for understanding fruit softening and postharvest.

Keywords


Article
Identification and characterization of repetitive extragenic palindromes (REP)-associated tyrosine transposases: implications for REP evolution and dynamics in bacterial genomes

Authors: Nunvar Jaroslav --- Huckova Tereza --- Licha Irena
DOI: 10.1186/1471-2164-11-44
Pages: 44
Loading...
Loading...
Abstract

Abstract

Background

Bacterial repetitive extragenic palindromes (REPs) compose a distinct group of genomic repeats. They usually occur in high abundance (>100 copies/genome) and are often arranged in composite repetitive structures - bacterial interspersed mosaic elements (BIMEs). In BIMEs, regularly spaced REPs are present in alternating orientations. BIMEs and REPs have been shown to serve as binding sites for several proteins and suggested to play role in chromosome organization and transcription termination. Their origins are, at present, unknown.

Results

In this report, we describe a novel class of putative transposases related to IS200/IS605 transposase family and we demonstrate that they are obligately associated with bacterial REPs. Open reading frames coding for these REP-associated tyrosine transposases (RAYTs) are always flanked by two REPs in inverted orientation and thus constitute a unit reminiscent of typical transposable elements. Besides conserved residues involved in catalysis of DNA cleavage, RAYTs carry characteristic structural motifs that are absent in typical IS200/IS605 transposases. DNA sequences flanking rayt genes are in one third of examined cases arranged in modular BIMEs. RAYTs and their flanking REPs apparently coevolve with each other. The rayt genes themselves are subject to rapid evolution, substantially exceeding the substitution rate of neighboring genes. Strong correlation was found between the presence of a particular rayt in a genome and the abundance of its cognate REPs.

Conclusions

In light of our findings, we propose that RAYTs are responsible for establishment of REPs and BIMEs in bacterial genomes, as well as for their exceptional dynamics and species-specifity. Conversely, we suggest that BIMEs are in fact a special type of nonautonomous transposable elements, mobilizable by RAYTs.

Keywords


Article
Novel OBP genes similar to hamster Aphrodisin in the bank vole, Myodes glareolus

Authors: Stopková Romana --- Zdráhal Zbyněk --- Ryba Štěpán --- Šedo Ondřej --- et al.
DOI: 10.1186/1471-2164-11-45
Pages: 45
Loading...
Loading...
Abstract

Abstract

Background

Chemical communication in mammals involves globular lipocalins that protect and transport pheromones during their passage out of the body. Efficient communication via this protein - pheromone complex is essential for triggering multiple responses including aggression, mate choice, copulatory behaviour, and onset and synchronization of oestrus. The roles of lipocalins in communication were studied in many organisms and especially in mice (i.e. Mus musculus domesticus) which excrete Major Urinary Proteins (Mup) in excessive amounts in saliva and urine. Other mammals, however, often lack the genes for Mups or their expression is very low. Therefore, we aimed at characterization of candidate lipocalins in Myodes glareolus which are potentially linked to chemical communication. One of them is Aphrodisin which is a unique lipocalin that was previously described from hamster vaginal discharge and is known to carry pheromones stimulating copulatory behaviour in males.

Results

Here we show that Aphrodisin-like proteins exist in other species, belong to a group of Odorant Binding Proteins (Obp), and contrary to the expression of Aphrodisin only in hamster genital tract and parotid glands of females, we have detected these transcripts in both sexes of M. glareolus with the expression confirmed in various tissues including prostate, prepucial and salivary glands, liver and uterus. On the level of mRNA, we have detected three different gene variants. To assess their relevance for chemical communication we investigated the occurrence of particular proteins in saliva, urine and vaginal discharge. On the protein level we confirmed the presence of Obp2 and Obp3 in both saliva and urine. Appropriate bands in the range of 17-20 kDa from vaginal discharge were, however, beyond the MS detection limits.

Conclusion

Our results demonstrate that three novel Obps (Obp1, Obp2, and Obp3) are predominant lipocalins in Myodes urine and saliva. On the protein level we have detected further variants and thus we assume that similarly as Major Urinary Proteins in mice, these proteins may be important in chemical communication in this Cricetid rodent.

Keywords


Article
Comparative metagenomic analysis of plasmid encoded functions in the human gut microbiome

Authors: Jones Brian V --- Sun Funing --- Marchesi Julian R
DOI: 10.1186/1471-2164-11-46
Pages: 46
Loading...
Loading...
Abstract

Abstract

Background

Little is known regarding the pool of mobile genetic elements associated with the human gut microbiome. In this study we employed the culture independent TRACA system to isolate novel plasmids from the human gut microbiota, and a comparative metagenomic analysis to investigate the distribution and relative abundance of functions encoded by these plasmids in the human gut microbiome.

Results

Novel plasmids were acquired from the human gut microbiome, and homologous nucleotide sequences with high identity (>90%) to two plasmids (pTRACA10 and pTRACA22) were identified in the multiple human gut microbiomes analysed here. However, no homologous nucleotide sequences to these plasmids were identified in the murine gut or environmental metagenomes. Functions encoded by the plasmids pTRACA10 and pTRACA22 were found to be more prevalent in the human gut microbiome when compared to microbial communities from other environments. Among the most prevalent functions identified was a putative RelBE toxin-antitoxin (TA) addiction module, and subsequent analysis revealed that this was most closely related to putative TA modules from gut associated bacteria belonging to the Firmicutes. A broad phylogenetic distribution of RelE toxin genes was observed in gut associated bacterial species (Firmicutes, Bacteroidetes, Actinobacteria and Proteobacteria), but no RelE homologues were identified in gut associated archaeal species. We also provide indirect evidence for the horizontal transfer of these genes between bacterial species belonging to disparate phylogenetic divisions, namely Gram negative Proteobacteria and Gram positive species from the Firmicutes division.

Conclusions

The application of a culture independent system to capture novel plasmids from the human gut mobile metagenome, coupled with subsequent comparative metagenomic analysis, highlighted the unexpected prevalence of plasmid encoded functions in the gut microbial ecosystem. In particular the increased relative abundance and broad phylogenetic distribution was identified for a putative RelBE toxin/antitoxin addiction module, a putative phosphohydrolase/phosphoesterase, and an ORF of unknown function. Our analysis also indicates that some plasmids or plasmid families are present in the gut microbiomes of geographically isolated human hosts with a broad global distribution (America, Japan and Europe), and are potentially unique to the human gut microbiome. Further investigation of the plasmid population associated with the human gut is likely to provide important insights into the development, functioning and evolution of the human gut microbiota.

Keywords


Article
Expression profiling of prospero in the Drosophila larval chemosensory organ: Between growth and outgrowth

Authors: Guenin Laure --- Raharijaona Mahatsangy --- Houlgatte Rémi --- Baba-Aissa Fawzia
DOI: 10.1186/1471-2164-11-47
Pages: 47
Loading...
Loading...
Abstract

Abstract

Background

The antenno-maxilary complex (AMC) forms the chemosensory system of the Drosophila larva and is involved in gustatory and olfactory perception. We have previously shown that a mutant allele of the homeodomain transcription factor Prospero (prosVoila1, V1), presents several developmental defects including abnormal growth and altered taste responses. In addition, many neural tracts connecting the AMC to the central nervous system (CNS) were affected. Our earlier reports on larval AMC did not argue in favour of a role of pros in cell fate decision, but strongly suggested that pros could be involved in the control of other aspect of neuronal development. In order to identify these functions, we used microarray analysis of larval AMC and CNS tissue isolated from the wild type, and three other previously characterised prospero alleles, including the V1 mutant, considered as a null allele for the AMC.

Results

A total of 17 samples were first analysed with hierarchical clustering. To determine those genes affected by loss of pros function, we calculated a discriminating score reflecting the differential expression between V1 mutant and other pros alleles. We identified a total of 64 genes in the AMC. Additional manual annotation using all the computed information on the attributed role of these genes in the Drosophila larvae nervous system, enabled us to identify one functional category of potential Prospero target genes known to be involved in neurite outgrowth, synaptic transmission and more specifically in neuronal connectivity remodelling. The second category of genes found to be differentially expressed between the null mutant AMC and the other alleles concerned the development of the sensory organs and more particularly the larval olfactory system. Surprisingly, a third category emerged from our analyses and suggests an association of pros with the genes that regulate autophagy, growth and insulin pathways. Interestingly, EGFR and Notch pathways were represented in all of these three functional categories. We now propose that Pros could perform all of these different functions through the modulation of these two antagonistic and synergic pathways.

Conclusions

The current data contribute to the clarification of the prospero function in the larval AMC and show that pros regulates different function in larvae as compared to those controlled by this gene in embryos. In the future, the possible mechanism by which Pros could achieve its function in the AMC will be explored in detail.

Keywords


Article
Intergenic, gene terminal, and intragenic CpG islands in the human genome

Loading...
Loading...
Abstract

Abstract

Background

Recently, it has been discovered that the human genome contains many transcription start sites for non-coding RNA. Regulatory regions related to transcription of this non-coding RNAs are poorly studied. Some of these regulatory regions may be associated with CpG islands located far from transcription start-sites of any protein coding gene. The human genome contains many such CpG islands; however, until now their properties were not systematically studied.

Results

We studied CpG islands located in different regions of the human genome using methods of bioinformatics and comparative genomics. We have observed that CpG islands have a preference to overlap with exons, including exons located far from transcription start site, but usually extend well into introns. Synonymous substitution rate of CpG-containing codons becomes substantially reduced in regions where CpG islands overlap with protein-coding exons, even if they are located far downstream from transcription start site. CAGE tag analysis displayed frequent transcription start sites in all CpG islands, including those found far from transcription start sites of protein coding genes. Computational prediction and analysis of published ChIP-chip data revealed that CpG islands contain an increased number of sites recognized by Sp1 protein. CpG islands containing more CAGE tags usually also contain more Sp1 binding sites. This is especially relevant for CpG islands located in 3' gene regions. Various examples of transcription, confirmed by mRNAs or ESTs, but with no evidence of protein coding genes, were found in CAGE-enriched CpG islands located far from transcription start site of any known protein coding gene.

Conclusions

CpG islands located far from transcription start sites of protein coding genes have transcription initiation activity and display Sp1 binding properties. In exons, overlapping with these islands, the synonymous substitution rate of CpG containing codons is decreased. This suggests that these CpG islands are involved in transcription initiation, possibly of some non-coding RNAs.

Keywords


Article
Identification of interacting transcription factors regulating tissue gene expression in human

Authors: Hu Zihua --- Gallo Steven M
DOI: 10.1186/1471-2164-11-49
Pages: 49
Loading...
Loading...
Abstract

Abstract

Background

Tissue gene expression is generally regulated by multiple transcription factors (TFs). A major first step toward understanding how tissues achieve their specificity is to identify, at the genome scale, interacting TFs regulating gene expression in different tissues. Despite previous discoveries, the mechanisms that control tissue gene expression are not fully understood.

Results

We have integrated a function conservation approach, which is based on evolutionary conservation of biological function, and genes with highest expression level in human tissues to predict TF pairs controlling tissue gene expression. To this end, we have identified 2549 TF pairs associated with a certain tissue. To find interacting TFs controlling tissue gene expression in a broad spatial and temporal manner, we looked for TF pairs common to the same type of tissues and identified 379 such TF pairs, based on which TF-TF interaction networks were further built. We also found that tissue-specific TFs may play an important role in recruiting non-tissue-specific TFs to the TF-TF interaction network, offering the potential for coordinating and controlling tissue gene expression across a variety of conditions.

Conclusion

The findings from this study indicate that tissue gene expression is regulated by large sets of interacting TFs either on the same promoter of a gene or through TF-TF interaction networks.

Keywords


Article
Integrating multiple genome annotation databases improves the interpretation of microarray gene expression data

Authors: Yin Jun --- McLoughlin Sarah --- Jeffery Ian B --- Glaviano Antonino --- et al.
DOI: 10.1186/1471-2164-11-50
Pages: 50
Loading...
Loading...
Abstract

Abstract

Background

The Affymetrix GeneChip is a widely used gene expression profiling platform. Since the chips were originally designed, the genome databases and gene definitions have been considerably updated. Thus, more accurate interpretation of microarray data requires parallel updating of the specificity of GeneChip probes. We propose a new probe remapping protocol, using the zebrafish GeneChips as an example, by removing nonspecific probes, and grouping the probes into transcript level probe sets using an integrated zebrafish genome annotation. This genome annotation is based on combining transcript information from multiple databases. This new remapping protocol, especially the new genome annotation, is shown here to be an important factor in improving the interpretation of gene expression microarray data.

Results

Transcript data from the RefSeq, GenBank and Ensembl databases were downloaded from the UCSC genome browser, and integrated to generate a combined zebrafish genome annotation. Affymetrix probes were filtered and remapped according to the new annotation. The influence of transcript collection and gene definition methods was tested using two microarray data sets. Compared to remapping using a single database, this new remapping protocol results in up to 20% more probes being retained in the remapping, leading to approximately 1,000 more genes being detected. The differentially expressed gene lists are consequently increased by up to 30%. We are also able to detect up to three times more alternative splicing events. A small number of the bioinformatics predictions were confirmed using real-time PCR validation.

Conclusions

By combining gene definitions from multiple databases, it is possible to greatly increase the numbers of genes and splice variants that can be detected in microarray gene expression experiments.

Keywords


Article
An insight into the sialotranscriptome of the West Nile mosquito vector, Culex tarsalis

Authors: Calvo Eric --- Sanchez-Vargas Irma --- Favreau Amanda J --- Barbian Kent D --- et al.
DOI: 10.1186/1471-2164-11-51
Pages: 51
Loading...
Loading...
Abstract

Abstract

Background

Saliva of adult female mosquitoes help sugar and blood feeding by providing enzymes and polypeptides that help sugar digestion, control microbial growth and counteract their vertebrate host hemostasis and inflammation. Mosquito saliva also potentiates the transmission of vector borne pathogens, including arboviruses. Culex tarsalis is a bird feeding mosquito vector of West Nile Virus closely related to C. quinquefasciatus, a mosquito relatively recently adapted to feed on humans, and the only mosquito of the genus Culex to have its sialotranscriptome so far described.

Results

A total of 1,753 clones randomly selected from an adult female C. tarsalis salivary glands (SG) cDNA library were sequenced and used to assemble a database that yielded 809 clusters of related sequences, 675 of which were singletons. Primer extension experiments were performed in selected clones to further extend sequence coverage, allowing for the identification of 283 protein sequences, 80 of which code for putative secreted proteins.

Conclusion

Comparison of the C. tarsalis sialotranscriptome with that of C. quinquefasciatus reveals accelerated evolution of salivary proteins as compared to housekeeping proteins. The average amino acid identity among salivary proteins is 70.1%, while that for housekeeping proteins is 91.2% (P < 0.05), and the codon volatility of secreted proteins is significantly higher than those of housekeeping proteins. Several protein families previously found exclusive of mosquitoes, including only in the Aedes genus have been identified in C. tarsalis. Interestingly, a protein family so far unique to C. quinquefasciatus, with 30 genes, is also found in C. tarsalis, indicating it was not a specific C. quinquefasciatus acquisition in its evolution to optimize mammal blood feeding.

Keywords


Article
Deep sequencing of small RNA libraries reveals dynamic regulation of conserved and novel microRNAs and microRNA-stars during silkworm development

Authors: Jagadeeswaran Guru --- Zheng Yun --- Sumathipala Niranji --- Jiang Haobo --- et al.
DOI: 10.1186/1471-2164-11-52
Pages: 52
Loading...
Loading...
Abstract

Abstract

Background

In eukaryotes, microRNAs (miRNAs) have emerged as critical regulators of gene expression. The Silkworm (Bombyx mori L.) is one of the most suitable lepidopteran insects for studying the molecular aspects of metamorphosis because of its large size, availability of mutants and genome sequence. Besides, this insect also has been amply studied from a physiological and biochemical perspective. Deep sequencing of small RNAs isolated from different stages of silkworm is a powerful tool not only for measuring the changes in miRNA profile but also for discovering novel miRNAs.

Results

We generated small RNA libraries from feeding larvae, spinning larvae, pupae and adults of B. mori and obtained ~2.5 million reads of 18-30 nt. Sequence analysis identified 14 novel and 101 conserved miRNAs. Most novel miRNAs are preferentially expressed in pupae, whereas more than 95% of the conserved miRNAs are dynamically regulated during different developmental stages. Remarkably, the miRNA-star (miR*) of four miRNAs are expressed at much higher levels than their corresponding miRNAs, and their expression profiles are distinct from their corresponding miRNA profiles during different developmental stages. Additionally, we detected two antisense miRNA loci (miR-263-S and miR-263-AS; miR-306-S and miR-306-AS) that are expressed in sense and antisense directions. Interestingly, miR-263 and miR-306 are preferentially and abundantly expressed in pupae and adults, respectively.

Conclusions

We identified 101 homologs of conserved miRNAs, 14 species-specific and two antisense miRNAs in the silkworm. Our results provided deeper insights into changes in conserved and novel miRNA and miRNA* accumulation during development.

Keywords


Article
Genome-wide detection of predicted non-coding RNAs in Rhizobium etli expressed during free-living and host-associated growth using a high-resolution tiling array

Authors: Vercruysse Maarten --- Fauvart Maarten --- Cloots Lore --- Engelen Kristof --- et al.
DOI: 10.1186/1471-2164-11-53
Pages: 53
Loading...
Loading...
Abstract

Abstract

Background

Non-coding RNAs (ncRNAs) play a crucial role in the intricate regulation of bacterial gene expression, allowing bacteria to quickly adapt to changing environments. In the past few years, a growing number of regulatory RNA elements have been predicted by computational methods, mostly in well-studied γ-proteobacteria but lately in several α-proteobacteria as well. Here, we have compared an extensive compilation of these non-coding RNA predictions to intergenic expression data of a whole-genome high-resolution tiling array in the soil-dwelling α-proteobacterium Rhizobium etli.

Results

Expression of 89 candidate ncRNAs was detected, both on the chromosome and on the six megaplasmids encompassing the R. etli genome. Of these, 11 correspond to functionally well characterized ncRNAs, 12 were previously identified in other α-proteobacteria but are as yet uncharacterized and 66 were computationally predicted earlier but had not been experimentally identified and were therefore classified as novel ncRNAs. The latter comprise 17 putative sRNAs and 49 putative cis-regulatory ncRNAs. A selection of these candidate ncRNAs was validated by RT-qPCR, Northern blotting and 5' RACE, confirming the existence of 4 ncRNAs. Interestingly, individual transcript levels of numerous ncRNAs varied during free-living growth and during interaction with the eukaryotic host plant, pointing to possible ncRNA-dependent regulation of these specialized processes.

Conclusions

Our data support the practical value of previous ncRNA prediction algorithms and significantly expand the list of candidate ncRNAs encoded in the intergenic regions of R. etli and, by extension, of α-proteobacteria. Moreover, we show high-resolution tiling arrays to be suitable tools for studying intergenic ncRNA transcription profiles across the genome. The differential expression levels of some of these ncRNAs may indicate a role in adaptation to changing environmental conditions.

Keywords


Article
DIGAP - a Database of Improved Gene Annotation for Phytopathogens

Authors: Gao Na --- Chen Ling-Ling --- Ji Hong-Fang --- Wang Wei --- et al.
DOI: 10.1186/1471-2164-11-54
Pages: 54
Loading...
Loading...
Abstract

Abstract

Background

Bacterial plant pathogens are very harmful to their host plants, which can cause devastating agricultural losses in the world. With the development of microbial genome sequencing, many strains of phytopathogens have been sequenced. However, some misannotations exist in these phytopathogen genomes. Our objective is to improve these annotations and store them in a central database DIGAP.

Description

DIGAP includes the following improved information on phytopathogen genomes. (i) All the 'hypothetical proteins' were checked, and non-coding ORFs recognized by the Z curve method were removed. (ii) The translation initiation sites (TISs) of 20% ~ 25% of all the protein-coding genes have been corrected based on the NCBI RefSeq, ProTISA database and an ab initio program, GS-Finder. (iii) Potential functions of about 10% 'hypothetical proteins' have been predicted using sequence alignment tools. (iv) Two theoretical gene expression indices, the codon adaptation index (CAI) and the E(g) index, were calculated to predict the gene expression levels. (v) Potential agricultural bactericide targets and their homology-modeled 3D structures are provided in the database, which is of significance for agricultural antibiotic discovery.

Conclusion

The results in DIGAP provide useful information for understanding the pathogenetic mechanisms of phytopathogens and for finding agricultural bactericides. DIGAP is freely available at http://ibi.hzau.edu.cn/digap/.

Keywords


Article
Identification and characterization of microRNAs and endogenous siRNAs in Schistosoma japonicum

Authors: Hao Lili --- Cai Pengfei --- Jiang Ning --- Wang Heng --- et al.
DOI: 10.1186/1471-2164-11-55
Pages: 55
Loading...
Loading...
Abstract

Abstract

Background

Small endogenous non-coding RNAs (sncRNAs) such as small interfering RNA (siRNA), microRNA and other small RNA transcripts are derived from distinct loci in the genome and play critical roles in RNA-mediated gene silencing mechanisms in plants and metazoa. They are approximately 22 nucleotides long; regulate mRNA stability through perfect or imperfect match to the targets. The biological activities of sncRNAs have been related to many biological events, from resistance to microbe infections to cellular differentiation. The development of the zoonotic parasite Schistosoma japonicum parasite includes multiple steps of morphological alterations and biological differentiations, which provide a unique model for studies on the functions of small RNAs. Characterization of the genome-wide transcription of the sncRNAs will be a major step in understanding of the parasite biology. The objective of this study is to investigate the transcriptional profile and potential function of the small non-coding RNAs in the development of S. japanicum.

Results

The endogenous siRNAs were found mainly derived from transposable elements (TE) or transposons and the natural antisense transcripts (NAT). In contrast to other organisms, the TE-derived siRNAs in S. japonicum were more predominant than other sncRNAs including microRNAs (miRNAs). Further, there were distinct length and 3'end variations in the sncRNAs, which were associated with the developmental differentiation of the parasite. Among the identified miRNA transcripts, there were 38 unique to S. japonicum and 16 that belonged to 13 miRNA families are common to other metazoan lineages. These miRNAs were either ubiquitously expressed, or they exhibited specific expression patterns related to the developmental stages or sex. Genes that encoded miRNAs are mainly located in clusters within the genome of S. japonicum. However, genes within one cluster could be differentially transcribed, which suggested that individual genes might be regulated by distinct mechanisms during parasite development.

Conclusions

Many miRNA and endogenous siRNA transcripts were identified in S. japonicum and the amount of siRNA was at least 4.4 and 1.6 times more than that of miRNA in both schistosomulum and adult worm stages respectively. SiRNAs are mainly derived from transposable elements (or transposons); while natural antisense transcripts (NAT)-derived siRNAs were much less. A majority of miRNA transcripts identified in the parasite were species-specific and the expression of certain miRNAs was found developmentally regulated. Both miRNA and siRNAs are potentially important regulators in the development of schistosomal parasites.

Keywords


Article
Genome-wide computational prediction of tandem gene arrays: application in yeasts

Authors: Despons Laurence --- Baret Philippe V --- Frangeul Lionel --- Louis Véronique --- et al.
DOI: 10.1186/1471-2164-11-56
Pages: 56
Loading...
Loading...
Abstract

Abstract

Background

This paper describes an efficient in silico method for detecting tandem gene arrays (TGAs) in fully sequenced and compact genomes such as those of prokaryotes or unicellular eukaryotes. The originality of this method lies in the search of protein sequence similarities in the vicinity of each coding sequence, which allows the prediction of tandem duplicated gene copies independently of their functionality.

Results

Applied to nine hemiascomycete yeast genomes, this method predicts that 2% of the genes are involved in TGAs and gene relics are present in 11% of TGAs. The frequency of TGAs with degenerated gene copies means that a significant fraction of tandem duplicated genes follows the birth-and-death model of evolution. A comparison of sequence identity distributions between sets of homologous gene pairs shows that the different copies of tandem arrayed paralogs are less divergent than copies of dispersed paralogs in yeast genomes. It suggests that paralogs included in tandem structures are more recent or more subject to the gene conversion mechanism than other paralogs.

Conclusion

The method reported here is a useful computational tool to provide a database of TGAs composed of functional or nonfunctional gene copies. Such a database has obvious applications in the fields of structural and comparative genomics. Notably, a detailed study of the TGA catalog will make it possible to tackle the fundamental questions of the origin and evolution of tandem gene clusters.

Keywords


Article
Gene expression profiling to identify eggshell proteins involved in physical defense of the chicken egg

Loading...
Loading...
Abstract

Abstract

Background

As uricoletic animals, chickens produce cleidoic eggs, which are self-contained bacteria-resistant biological packages for extra-uterine development of the chick embryo. The eggshell constitutes a natural physical barrier against bacterial penetration if it forms correctly and remains intact. The eggshell's remarkable mechanical properties are due to interactions among mineral components and the organic matrix proteins. The purpose of our study was to identify novel eggshell proteins by examining the transcriptome of the uterus during calcification of the eggshell. An extensive bioinformatic analysis on genes over-expressed in the uterus allowed us to identify novel eggshell proteins that contribute to the egg's natural defenses.

Results

Our 14 K Del-Mar Chicken Integrated Systems microarray was used for transcriptional profiling in the hen's uterus during eggshell deposition. A total of 605 transcripts were over-expressed in the uterus compared with the magnum or white isthmus across a wide range of abundance (1.1- to 79.4-fold difference). The 605 highly-expressed uterine transcripts correspond to 469 unique genes, which encode 437 different proteins. Gene Ontology (GO) analysis was used for interpretation of protein function. The most over-represented GO terms are related to genes encoding ion transport proteins, which provide eggshell mineral precursors. Signal peptide sequence was found for 54 putative proteins secreted by the uterus during eggshell formation. Many functional proteins are involved in calcium binding or biomineralization--prerequisites for interacting with the mineral phase during eggshell fabrication. While another large group of proteins could be involved in proper folding of the eggshell matrix. Many secreted uterine proteins possess antibacterial properties, which would protect the egg against microbial invasion. A final group includes proteases and protease inhibitors that regulate protein activity in the acellular uterine fluid where eggshell formation takes place.

Conclusions

Our original study provides the first detailed description of the chicken uterus transcriptome during formation of the eggshell. We have discovered a cache of about 600 functional genes and identified a large number of encoded proteins secreted into uterine fluid for fabrication of the eggshell and chemical protection of the egg. Some of these uterine genes could prove useful as biological markers for genetic improvement of phenotypic traits (i.e., egg and eggshell quality).

Keywords


Article
Does replication groups scoring reduce false positive rate in SNP interaction discovery?

Authors: Toplak Marko --- Curk Tomaz --- Demsar Janez --- Zupan Blaz
DOI: 10.1186/1471-2164-11-58
Pages: 58
Loading...
Loading...
Abstract

Abstract

Background

Computational methods that infer single nucleotide polymorphism (SNP) interactions from phenotype data may uncover new biological mechanisms in non-Mendelian diseases. However, practical aspects of such analysis face many problems. Present experimental studies typically use SNP arrays with hundreds of thousands of SNPs but record only hundreds of samples. Candidate SNP pairs inferred by interaction analysis may include a high proportion of false positives. Recently, Gayan et al. (2008) proposed to reduce the number of false positives by combining results of interaction analysis performed on subsets of data (replication groups), rather than analyzing the entire data set directly. If performing as hypothesized, replication groups scoring could improve interaction analysis and also any type of feature ranking and selection procedure in systems biology. Because Gayan et al. do not compare their approach to the standard interaction analysis techniques, we here investigate if replication groups indeed reduce the number of reported false positive interactions.

Results

A set of simulated and false interaction-imputed experimental SNP data sets were used to compare the inference of SNP-SNP interactions by means of replication groups to the standard approach where the entire data set was directly used to score all candidate SNP pairs. In all our experiments, the inference of interactions from the entire data set (e.g. without using the replication groups) reported fewer false positives.

Conclusions

With respect to the direct scoring approach the utility of replication groups does not reduce false positive rates, and may, depending on the data set, often perform worse.

Keywords


Article
Expression profiling of S. pombe acetyltransferase mutants identifies redundant pathways of gene regulation

Authors: Nugent Rebecca L --- Johnsson Anna --- Fleharty Brian --- Gogol Madelaine --- et al.
DOI: 10.1186/1471-2164-11-59
Pages: 59
Loading...
Loading...
Abstract

Abstract

Background

Histone acetyltransferase enzymes (HATs) are implicated in regulation of transcription. HATs from different families may overlap in target and substrate specificity.

Results

We isolated the elp3+ gene encoding the histone acetyltransferase subunit of the Elongator complex in fission yeast and characterized the phenotype of an Δelp3 mutant. We examined genetic interactions between Δelp3 and two other HAT mutants, Δmst2 and Δgcn5 and used whole genome microarray analysis to analyze their effects on gene expression.

Conclusions

Comparison of phenotypes and expression profiles in single, double and triple mutants indicate that these HAT enzymes have overlapping functions. Consistent with this, overlapping specificity in histone H3 acetylation is observed. However, there is no evidence for overlap with another HAT enzyme, encoded by the essential mst1+ gene.

Keywords


Article
Identical repeated backbone of the human genome

Loading...
Loading...
Abstract

Abstract

Background

Identical sequences with a minimal length of about 300 base pairs (bp) have been involved in the generation of various meiotic/mitotic genomic rearrangements through non-allelic homologous recombination (NAHR) events. Genomic disorders and structural variation, together with gene remodelling processes have been associated with many of these rearrangements. Based on these observations, we identified and integrated all the 100% identical repeats of at least 300 bp in the NCBI version 36.2 human genome reference assembly into non-overlapping regions, thus defining the Identical Repeated Backbone (IRB) of the reference human genome.

Results

The IRB sequences are distributed all over the genome in 66,600 regions, which correspond to ~2% of the total NCBI human genome reference assembly. Important structural and functional elements such as common repeats, segmental duplications, and genes are contained in the IRB. About 80% of the IRB bp overlap with known copy-number variants (CNVs). By analyzing the genes embedded in the IRB, we were able to detect some identical genes not previously included in the Ensembl release 50 annotation of human genes. In addition, we found evidence of IRB gene copy-number polymorphisms in raw sequence reads of two diploid sequenced genomes.

Conclusions

In general, the IRB offers new insight into the complex organization of the identical repeated sequences of the human genome. It provides an accurate map of potential NAHR sites which could be used in targeting the study of novel CNVs, predicting DNA copy-number variation in newly sequenced genomes, and improve genome annotation.

Keywords


Article
Systematic identification and evolutionary features of rhesus monkey small nucleolar RNAs

Authors: Zhang Yong --- Liu Jun --- Jia Chunshi --- Li Tingting --- et al.
DOI: 10.1186/1471-2164-11-61
Pages: 61
Loading...
Loading...
Abstract

Abstract

Background

Recent studies have demonstrated that non-protein-coding RNAs (npcRNAs/ncRNAs) play important roles during eukaryotic development, species evolution, and in the etiology of disease. Rhesus macaques are the most widely used primate model in both biomedical research and primate evolutionary studies. However, most reports on these animals focus on the functional roles of protein-coding sequences, whereas very little is known about macaque ncRNAs.

Results

In the present study, we performed the first systematic profiling of intermediate-size ncRNAs (50 to 500 nt) from the rhesus monkey by constructing a cDNA library. We identified 117 rhesus monkey ncRNAs, including 80 small nucleolar RNAs (snoRNAs), 29 other types of known RNAs (snRNAs, Y RNA, and others), and eight unclassified ncRNAs. Comparative genomic analysis and northern blot hybridizations demonstrated that some snoRNAs were lineage- or species-specific. Paralogous sequences were found for most rhesus monkey snoRNAs, the expression of which might be attributable to extensive duplication within the rhesus monkey genome. Further investigation of snoRNA flanking sequences showed that some rhesus monkey snoRNAs are retrogenes derived from L1-mediated integration. Finally, phylogenetic analysis demonstrated that birds and primates share some snoRNAs and host genes thereof, suggesting that both the relevant host genes and the snoRNAs contained therein may be inherited from a common ancestor. However, some rhesus monkey snoRNAs hosted by non-ribosome-related genes appeared after the evolutionary divergence between birds and mammals.

Conclusions

We provide the first experimentally-derived catalog of rhesus monkey ncRNAs and uncover some interesting genomic and evolutionary features. These findings provide important information for future functional characterization of snoRNAs during primate evolution.

Keywords


Article
Copy number variation in the genomes of twelve natural isolates of Caenorhabditis elegans

Authors: Maydan Jason S --- Lorch Adam --- Edgley Mark L --- Flibotte Stephane --- et al.
DOI: 10.1186/1471-2164-11-62
Pages: 62
Loading...
Loading...
Abstract

Abstract

Background

Copy number variation is an important component of genetic variation in higher eukaryotes. The extent of natural copy number variation in C. elegans is unknown outside of 2 highly divergent wild isolates and the canonical N2 Bristol strain.

Results

We have used array comparative genomic hybridization (aCGH) to detect copy number variation in the genomes of 12 natural isolates of Caenorhabditis elegans. Deletions relative to the canonical N2 strain are more common in these isolates than duplications, and indels are enriched in multigene families on the autosome arms. Among the strains in our study, the Hawaiian and Madeiran strains (CB4856 and JU258) carry the largest number of deletions, followed by the Vancouver strain (KR314). Overall we detected 510 different deletions affecting 1136 genes, or over 5% of the genes in the canonical N2 genome. The indels we identified had a median length of 2.7 kb. Since many deletions are found in multiple isolates, deletion loci were used as markers to derive an unrooted tree to estimate genetic relatedness among the strains.

Conclusion

Copy number variation is extensive in C. elegans, affecting over 5% of the genes in the genome. The deletions we have detected in natural isolates of C. elegans contribute significantly to the number of deletion alleles available to researchers. The relationships between strains are complex and different regions of the genome possess different genealogies due to recombination throughout the natural history of the species, which may not be apparent in studies utilizing smaller numbers of genetic markers.

Keywords


Article
Differences in the transcriptome signatures of two genetically related Entamoeba histolytica cell lines derived from the same isolate with different pathogenic properties

Authors: Biller Laura --- Davis Paul H --- Tillack Manuela --- Matthiesen Jenny --- et al.
DOI: 10.1186/1471-2164-11-63
Pages: 63
Loading...
Loading...
Abstract

Abstract

Background

The availability of two genetically very similar cell lines (A and B) derived from the laboratory isolate Entamoeba histolytica HM-1:IMSS, which differ in their virulence properties, provides a powerful tool for identifying pathogenicity factors of the causative agent of human amoebiasis. Cell line A is incapable inducing liver abscesses in gerbils, whereas interaction with cell line B leads to considerable abscess formation. Phenotypic characterization of both cell lines revealed that trophozoites from the pathogenic cell line B have a larger cell size, an increased growth rate in vitro, an increased cysteine peptidase activity and higher resistance to nitric oxide stress. To find proteins that may serve as virulence factors, the proteomes of both cell lines were previously studied, resulting in the identification of a limited number of differentially synthesized proteins. This study aims to identify additional genes, serving as virulence factors, or virulence markers.

Results

To obtain a comprehensive picture of the differences between the cell lines, we compared their transcriptomes using an oligonucleotide-based microarray and confirmed findings with quantitative real-time PCR. Out of 6242 genes represented on the array, 87 are differentially transcribed (≥two-fold) in the two cell lines. Approximately 50% code for hypothetical proteins. Interestingly, only 19 genes show a five-fold or higher differential expression. These include three rab7 GTPases, which were found with a higher abundance in the non-pathogenic cell line A. The aig1-like GTPasesare of special interest because the majority of them show higher levels of transcription in the pathogenic cell line B. Only two molecules were found to be differentially expressed between the two cell lines in both this study and our previous proteomic approach.

Conclusions

In this study we have identified a defined set of genes that are differentially transcribed between the non-pathogenic cell line A and the pathogenic cell line B of E. histolytica. The identification of transcription profiles unique for amoebic cell lines with pathogenic phenotypes may help to elucidate the transcriptional framework of E. histolytica pathogenicity and serve as a basis for identifying transcriptional markers and virulence factors.

Keywords


Article
Changes in the genomic content of circulating Bordetella pertussis strains isolated from the Netherlands, Sweden, Japan and Australia: adaptive evolution or drift?

Authors: King Audrey J --- van Gorkom Tamara --- van der Heide Han GJ --- Advani Abdolreza --- et al.
DOI: 10.1186/1471-2164-11-64
Pages: 64
Loading...
Loading...
Abstract

Abstract

Background

Bordetella pertussis is the causative agent of human whooping cough (pertussis) and is particularly severe in infants. Despite worldwide vaccinations, whooping cough remains a public health problem. A significant increase in the incidence of whooping cough has been observed in many countries since the 1990s. Several reasons for the re-emergence of this highly contagious disease have been suggested. A particularly intriguing possibility is based on evidence indicating that pathogen adaptation may play a role in this process. In an attempt to gain insight into the genomic make-up of B. pertussis over the last 60 years, we used an oligonucleotide DNA microarray to compare the genomic contents of a collection of 171 strains of B. pertussis isolates from different countries.

Results

The CGH microarray analysis estimated the core genome of B. pertussis, to consist of 3,281 CDSs that are conserved among all B. pertussis strains, and represent 84.8% of all CDSs found in the 171 B. pertussis strains. A total of 64 regions of difference consisting of one or more contiguous CDSs were identified among the variable genes. CGH data also revealed that the genome size of B. pertussis strains is decreasing progressively over the past 60 years. Phylogenetic analysis of microarray data generated a minimum spanning tree that depicted the phylogenetic structure of the strains. B. pertussis strains with the same gene content were found in several different countries. However, geographic specificity of the B. pertussis strains was not observed. The gene content was determined to highly correlate with the ptxP-type of the strains.

Conclusions

An overview of genomic contents of a large collection of isolates from different countries allowed us to derive a core genome and a phylogenetic structure of B. pertussis. Our results show that B. pertussis is a dynamic organism that continues to evolve.

Keywords


Article
Tungsten carbide cobalt nanoparticles exert hypoxia-like effects on the gene expression level in human keratinocytes

Authors: Busch Wibke --- Kühnel Dana --- Schirmer Kristin --- Scholz Stefan
DOI: 10.1186/1471-2164-11-65
Pages: 65
Loading...
Loading...
Abstract

Abstract

Background

Tungsten carbide (WC) and tungsten carbide cobalt (WC-Co) nanoparticles are of occupational health relevance because of the increasing usage in hard metal industries. Earlier studies showed an enhanced toxic potential for WC-Co compared to WC or cobalt ions alone. Therefore, we investigated the impact of these particles, compared to cobalt ions applied as CoCl2, on the global gene expression level in human keratinocytes (HaCaT) in vitro.

Results

WC nanoparticles exerted very little effects on the transcriptomic level after 3 hours and 3 days of exposure. In contrast, WC-Co nanoparticles caused significant transcriptional changes that were similar to those provoked by CoCl2. However, CoCl2 exerted even more pronounced changes in the transcription patterns. Gene set enrichment analyses revealed that the differentially expressed genes were related to hypoxia response, carbohydrate metabolism, endocrine pathways, and targets of several transcription factors. The role of the transcription factor HIF1 (hypoxia inducible factor 1) is particularly highlighted and aspects of downstream events as well as the role of other transcription factors related to cobalt toxicity are considered.

Conclusions

This study provides extensive data useful for the understanding of nanoparticle and cobalt toxicity. It shows that WC nanoparticles caused low transcriptional responses while WC-Co nanoparticles are able to exert responses similar to that of free cobalt ions, particularly the induction of hypoxia-like effects via interactions with HIF1α in human keratinocytes. However, the enhanced toxicity of WC-Co particles compared to CoCl2 could not be explained by differences in gene transcription.

Keywords


Article
Re-examination of chimp protein kinases suggests "novel architectures" are gene prediction artifacts

Authors: Robison Keith
DOI: 10.1186/1471-2164-11-66
Pages: 66
Loading...
Loading...
Abstract

Abstract

Background

Anamika et al1 recently published in this journal a sequence alignment analysis of protein kinases encoded by the chimpanzee genome in comparison to those in the human genome. From this analysis they concluded that several chimpanzee kinases have unusual domain arrangements.

Results

Re-examination of these kinases reveals claimed novel arrangements cannot withstand scrutiny; each is either not novel or represents over-analysis of weakly confident computer generated gene models. Additional sequence evidence available at the time of the paper's submission either directly contradict the gene models or suggest alternate gene models. These alternate models would minimize or eliminate the observed differences between human and chimp kinases.

Conclusion

None of the proposed novel chimpanzee kinase architectures are supported by experiment evidence. Guidelines to prevent such erroneous conclusions in similar papers are proposed.

Keywords


Article
Simple sequence repeats in Helicobacter canadensis and their role in phase variable expression and C-terminal sequence switching

Authors: Snyder Lori AS --- Loman Nicholas J --- Linton James D --- Langdon Rebecca R --- et al.
DOI: 10.1186/1471-2164-11-67
Pages: 67
Loading...
Loading...
Abstract

Abstract

Background

Helicobacter canadensis is an emerging human pathogen and zoonotic agent. The genome of H. canadensis was sequenced previously and determined to contain 29 annotated coding regions associated with homopolymeric tracts.

Results

Twenty-one of the repeat-associated coding regions were determined to be potentially transcriptionally or translationally phase variable. In each case the homopolymeric tract was within the predicted promoter region or at the 5' end of the coding region, respectively. However, eight coding sequences were identified with simple sequence repeats toward the 3' end of the open reading frame. In these cases, the repeat tract would be too far into the coding region to be mediating translational phase variation. All of the 29 coding region-associated homopolymeric tracts display variability in tract length in the sequencing read data.

Conclusions

Twenty-nine coding regions have been identified in the genome sequence of Helicobacter canadensis strain NCTC13241 that show variations in homopolymeric tract length in the bacterial population, indicative of phase variation. Five of these are potentially associated with promoter regions, which would lead to transcriptional phase variation. Translational phase variation usually switches expression of a gene ON and OFF due to the repeat region being located sufficiently close to the initiation codon for the resulting frame-shift to lead to a premature termination codon and stop the translation of the protein. Sixteen of the 29 coding regions have homopolymeric tracts characteristic of translational phase variation. For eight coding sequences with repeats located later in the reading frame, changes in the repeat tract length would alter the protein sequence at the C-terminus but not stop the expression of the protein. This mechanism of C-terminal phase variation has implications for stochastic switching of protein sequence in bacterial species that already undergo transcriptional and translational phase variation.

Keywords


Article
The European sea bass Dicentrarchus labrax genome puzzle: comparative BAC-mapping and low coverage shotgun sequencing

Authors: Kuhl Heiner --- Beck Alfred --- Wozniak Grzegorz --- Canario Adelino VM --- et al.
DOI: 10.1186/1471-2164-11-68
Pages: 68
Loading...
Loading...
Abstract

Abstract

Background

Food supply from the ocean is constrained by the shortage of domesticated and selected fish. Development of genomic models of economically important fishes should assist with the removal of this bottleneck. European sea bass Dicentrarchus labrax L. (Moronidae, Perciformes, Teleostei) is one of the most important fishes in European marine aquaculture; growing genomic resources put it on its way to serve as an economic model.

Results

End sequencing of a sea bass genomic BAC-library enabled the comparative mapping of the sea bass genome using the three-spined stickleback Gasterosteus aculeatus genome as a reference. BAC-end sequences (102,690) were aligned to the stickleback genome. The number of mappable BACs was improved using a two-fold coverage WGS dataset of sea bass resulting in a comparative BAC-map covering 87% of stickleback chromosomes with 588 BAC-contigs. The minimum size of 83 contigs covering 50% of the reference was 1.2 Mbp; the largest BAC-contig comprised 8.86 Mbp. More than 22,000 BAC-clones aligned with both ends to the reference genome. Intra-chromosomal rearrangements between sea bass and stickleback were identified. Size distributions of mapped BACs were used to calculate that the genome of sea bass may be only 1.3 fold larger than the 460 Mbp stickleback genome.

Conclusions

The BAC map is used for sequencing single BACs or BAC-pools covering defined genomic entities by second generation sequencing technologies. Together with the WGS dataset it initiates a sea bass genome sequencing project. This will allow the quantification of polymorphisms through resequencing, which is important for selecting highly performing domesticated fish.

Keywords


Article
Transcriptome analysis reveals absence of unintended effects in drought-tolerant transgenic plants overexpressing the transcription factor ABF3

Authors: Abdeen Ashraf --- Schnell Jaimie --- Miki Brian
DOI: 10.1186/1471-2164-11-69
Pages: 69
Loading...
Loading...
Abstract

Abstract

Background

Plants engineered for abiotic stress tolerance may soon be commercialized. The engineering of these plants typically involves the manipulation of complex multigene networks and may therefore have a greater potential to introduce pleiotropic effects than the simple monogenic traits that currently dominate the plant biotechnology market. While research on unintended effects in transgenic plant systems has been instrumental in demonstrating the substantial equivalence of many transgenic plant systems, it is essential that such analyses be extended to transgenic plants engineered for stress tolerance. Drought-tolerant Arabidopsis thaliana were engineered through overexpression of the transcription factor ABF3 in order to investigate unintended pleiotropic effects. In order to eliminate position effects, the Cre/lox recombination system was used to create control plant lines that contain identical T-DNA insertion sites but with the ABF3 transgene excised. This additionally allowed us to determine if Cre recombinase can cause unintended effects that impact the transcriptome.

Results

Microarray analysis of control plant lines that underwent Cre-mediated excision of the ABF3 transgene revealed only two genes that were differentially expressed in more than one plant line, suggesting that the impact of Cre recombinase on the transcriptome was minimal. In the absence of drought stress, overexpression of ABF3 had no effect on the transcriptome, but following drought stress, differences were observed in the gene expression patterns of plants overexpressing ABF3 relative to control plants. Examination of the functional distribution of the differentially expressed genes revealed strong similarity indicating that unintended pathways were not activated.

Conclusions

The action of ABF3 is tightly controlled in Arabidopsis. In the absence of drought stress, ectopic activation of drought response pathways does not occur. In response to drought stress, overexpression of ABF3 results in a reprogramming of the drought response, which is characterized by changes in the timing or strength of expression of some drought response genes, without activating any unexpected gene networks. These results illustrate that important gene networks are highly regulated in Arabidopsis and that engineering stress tolerance may not necessarily cause extensive changes to the transcriptome.

Keywords


Article
Development of genomic resources for the prairie vole (Microtus ochrogaster): construction of a BAC library and vole-mouse comparative cytogenetic map

Loading...
Loading...
Abstract

Abstract

Background

The prairie vole (Microtus ochrogaster) is a premier animal model for understanding the genetic and neurological basis of social behaviors. Unlike other biomedical models, prairie voles display a rich repertoire of social behaviors including the formation of long-term pair bonds and biparental care. However, due to a lack of genomic resources for this species, studies have been limited to a handful of candidate genes. To provide a substrate for future development of genomic resources for this unique model organism, we report the construction and characterization of a bacterial artificial chromosome (BAC) library from a single male prairie vole and a prairie vole-mouse (Mus musculus) comparative cytogenetic map.

Results

We constructed a prairie vole BAC library (CHORI-232) consisting of 194,267 recombinant clones with an average insert size of 139 kb. Hybridization-based screening of the gridded library at 19 loci established that the library has an average depth of coverage of ~10×. To obtain a small-scale sampling of the prairie vole genome, we generated 3884 BAC end-sequences totaling ~2.8 Mb. One-third of these BAC-end sequences could be mapped to unique locations in the mouse genome, thereby anchoring 1003 prairie vole BAC clones to an orthologous position in the mouse genome. Fluorescence in situ hybridization (FISH) mapping of 62 prairie vole clones with BAC-end sequences mapping to orthologous positions in the mouse genome was used to develop a first-generation genome-wide prairie vole-mouse comparative cytogenetic map. While conserved synteny was observed between this pair of rodent genomes, rearrangements between the prairie vole and mouse genomes were detected, including a minimum of five inversions and 16 inter-chromosomal rearrangements.

Conclusions

The construction of the prairie vole BAC library and the vole-mouse comparative cytogenetic map represent the first genome-wide modern genomic resources developed for this species. The BAC library will support future genomic, genetic and molecular characterization of this genome and species, and the isolation of clones of high interest to the vole research community will allow for immediate characterization of the regulatory and coding sequences of genes known to play important roles in social behaviors. In addition, these resources provide an excellent platform for future higher resolution cytogenetic mapping and full genome sequencing.

Keywords


Article
Relationship between operon preference and functional properties of persistent genes in bacterial genomes

Authors: Bratlie Marit S --- Johansen Jostein --- Drabløs Finn
DOI: 10.1186/1471-2164-11-71
Pages: 71
Loading...
Loading...
Abstract

Abstract

Background

Genes in bacteria may be organised into operons, leading to strict co-expression of the genes that participate in the same operon. However, comparisons between different bacterial genomes have shown that much of the operon structure is dynamic on an evolutionary time scale. This indicates that there are opposing effects influencing the tendency for operon formation, and these effects may be reflected in properties like evolutionary rate, complex formation, metabolic pathways and gene fusion.

Results

We have used multi-species protein-protein comparisons to generate a high-quality set of genes that are persistent in bacterial genomes (i.e. they have close to universal distribution). We have analysed these genes with respect to operon participation and important functional properties, including evolutionary rate and protein-protein interactions.

Conclusions

Genes for ribosomal proteins show a very slow rate of evolution. This is consistent with a strong tendency for the genes to participate in operons and for their proteins to be involved in essential and well defined complexes. Persistent genes for non-ribosomal proteins can be separated into two classes according to tendency to participate in operons. Those with a strong tendency for operon participation make proteins with fewer interaction partners that seem to participate in relatively static complexes and possibly linear pathways. Genes with a weak tendency for operon participation tend to produce proteins with more interaction partners, but possibly in more dynamic complexes and convergent pathways. Genes that are not regulated through operons are therefore more evolutionary constrained than the corresponding operon-associated genes and will on average evolve more slowly.

Keywords


Article
Heat-shock responsive genes identified and validated in Atlantic cod (Gadus morhua) liver, head kidney and skeletal muscle using genomic techniques

Authors: Hori Tiago S --- Gamperl A Kurt --- Afonso Luis OB --- Johnson Stewart C --- et al.
DOI: 10.1186/1471-2164-11-72
Pages: 72
Loading...
Loading...
Abstract

Abstract

Background

Daily and seasonal changes in temperature are challenges that fish within aquaculture settings cannot completely avoid, and are known to elicit complex organismal and cellular stress responses. We conducted a large-scale gene discovery and transcript expression study in order to better understand the genes that are potentially involved in the physiological and cellular aspects of stress caused by heat-shock. We used suppression subtractive hybridization (SSH) cDNA library construction and characterization to identify transcripts that were dysregulated by heat-shock in liver, skeletal muscle and head kidney of Atlantic cod. These tissues were selected due to their roles in metabolic regulation, locomotion and growth, and immune function, respectively. Fish were exposed for 3 hours to an 8°C elevation in temperature, and then allowed to recover for 24 hours at the original temperature (i.e. 10°C). Tissue samples obtained before heat-shock (BHS), at the cessation of heat-shock (CS), and 3, 12, and 24 hours after the cessation of heat-shock (ACS), were used for reciprocal SSH library construction and quantitative reverse transcription - polymerase chain reaction (QPCR) analysis of gene expression using samples from a group that was transferred but not heat-shocked (CT) as controls.

Results

We sequenced and characterized 4394 ESTs (1524 from liver, 1451 from head kidney and 1419 from skeletal muscle) from three "forward subtracted" libraries (enriched for genes up-regulated by heat-shock) and 1586 from the liver "reverse subtracted" library (enriched for genes down-regulated by heat-shock), for a total of 5980 ESTs. Several cDNAs encoding putative chaperones belonging to the heat-shock protein (HSP) family were found in these libraries, and "protein folding" was among the gene ontology (GO) terms with the highest proportion in the libraries. QPCR analysis of HSP90α and HSP70-1 (synonym: HSPA1A) mRNA expression showed significant up-regulation in all three tissues studied. These transcripts were more than 100-fold up-regulated in liver following heat-shock. We also identified HSP47, GRP78 and GRP94-like transcripts, which were significantly up-regulated in all 3 tissues studied. Toll-like receptor 22 (TLR22) transcript, found in the liver reverse SSH library, was shown by QPCR to be significantly down-regulated in the head kidney after heat-shock.

Conclusion

Chaperones are an important part of the cellular response to stress, and genes identified in this work may play important roles in resistance to thermal-stress. Moreover, the transcript for one key immune response gene (TLR22) was down-regulated by heat-shock, and this down-regulation may be a component of heat-induced immunosuppression.

Keywords


Article
Comprehensive expression analysis suggests overlapping and specific roles of rice glutathione S-transferase genes during development and stress responses

Authors: Jain Mukesh --- Ghanashyam Challa --- Bhattacharjee Annapurna
DOI: 10.1186/1471-2164-11-73
Pages: 73
Loading...
Loading...
Abstract

Abstract

Background

Glutathione S-transferases (GSTs) are the ubiquitous enzymes that play a key role in cellular detoxification. Although several GSTs have been identified and characterized in various plant species, the knowledge about their role in developmental processes and response to various stimuli is still very limited. In this study, we report genome-wide identification, characterization and comprehensive expression analysis of members of GST gene family in crop plant rice, to reveal their function(s).

Results

A systematic analysis revealed the presence of at least 79 GST genes in the rice genome. Phylogenetic analysis grouped GST proteins into seven classes. Sequence analysis together with the organization of putative motifs indicated the potential diverse functions of GST gene family members in rice. The tandem gene duplications have contributed a major role in expansion of this gene family. Microarray data analysis revealed tissue-/organ- and developmental stage-specific expression patterns of several rice GST genes. At least 31 GST genes showed response to plant hormones auxin and cytokinin. Furthermore, expression analysis showed the differential expression of quite a large number of GST genes during various abiotic stress (20), arsenate stress (32) and biotic stress (48) conditions. Many of the GST genes were commonly regulated by developmental processes, hormones, abiotic and biotic stresses.

Conclusion

The transcript profiling suggests overlapping and specific role(s) of GSTs during various stages of development in rice. Further, the study provides evidence for the role of GSTs in mediating crosstalk between various stress and hormone response pathways and represents a very useful resource for functional analysis of selected members of this family in rice.

Keywords


Article
Automated design of genomic Southern blot probes

Authors: Croning Mike DR --- Fricker David G --- Komiyama Noboru H --- Grant Seth GN
DOI: 10.1186/1471-2164-11-74
Pages: 74
Loading...
Loading...
Abstract

Abstract

Background

Sothern blotting is a DNA analysis technique that has found widespread application in molecular biology. It has been used for gene discovery and mapping and has diagnostic and forensic applications, including mutation detection in patient samples and DNA fingerprinting in criminal investigations. Southern blotting has been employed as the definitive method for detecting transgene integration, and successful homologous recombination in gene targeting experiments.

The technique employs a labeled DNA probe to detect a specific DNA sequence in a complex DNA sample that has been separated by restriction-digest and gel electrophoresis. Critically for the technique to succeed the probe must be unique to the target locus so as not to cross-hybridize to other endogenous DNA within the sample.

Investigators routinely employ a manual approach to probe design. A genome browser is used to extract DNA sequence from the locus of interest, which is searched against the target genome using a BLAST-like tool. Ideally a single perfect match is obtained to the target, with little cross-reactivity caused by homologous DNA sequence present in the genome and/or repetitive and low-complexity elements in the candidate probe. This is a labor intensive process often requiring several attempts to find a suitable probe for laboratory testing.

Results

We have written an informatic pipeline to automatically design genomic Sothern blot probes that specifically attempts to optimize the resultant probe, employing a brute-force strategy of generating many candidate probes of acceptable length in the user-specified design window, searching all against the target genome, then scoring and ranking the candidates by uniqueness and repetitive DNA element content. Using these in silico measures we can automatically design probes that we predict to perform as well, or better, than our previous manual designs, while considerably reducing design time.

We went on to experimentally validate a number of these automated designs by Southern blotting. The majority of probes we tested performed well confirming our in silico prediction methodology and the general usefulness of the software for automated genomic Southern probe design.

Conclusions

Software and supplementary information are freely available at: http://www.genes2cognition.org/software/southern_blot

Keywords


Article
Normalization with genes encoding ribosomal proteins but not GAPDH provides an accurate quantification of gene expressions in neuronal differentiation of PC12 cells

Authors: Zhou Lihan --- Lim Qing-En --- Wan Guoqiang --- Too Heng-Phon
DOI: 10.1186/1471-2164-11-75
Pages: 75
Loading...
Loading...
Abstract

Abstract

Background

Gene regulation at transcript level can provide a good indication of the complex signaling mechanisms underlying physiological and pathological processes. Transcriptomic methods such as microarray and quantitative real-time PCR require stable reference genes for accurate normalization of gene expression. Some but not all studies have shown that housekeeping genes (HGKs), β-actin (ACTB) and glyceraldehyde-3-phosphate dehydrogenase (GAPDH), which are routinely used for normalization, may vary significantly depending on the cell/tissue type and experimental conditions. It is currently unclear if these genes are stably expressed in cells undergoing drastic morphological changes during neuronal differentiation. Recent meta-analysis of microarray datasets showed that some but not all of the ribosomal protein genes are stably expressed. To test the hypothesis that some ribosomal protein genes can serve as reference genes for neuronal differentiation, a genome-wide analysis was performed and putative reference genes were identified based on stability of expressions. The stabilities of these potential reference genes were then analyzed by reverse transcription quantitative real-time PCR in six differentiation conditions.

Results

Twenty stably expressed genes, including thirteen ribosomal protein genes, were selected from microarray analysis of the gene expression profiles of GDNF and NGF induced differentiation of PC12 cells. The expression levels of these candidate genes as well as ACTB and GAPDH were further analyzed by reverse transcription quantitative real-time PCR in PC12 cells differentiated with a variety of stimuli including NGF, GDNF, Forskolin, KCl and ROCK inhibitor, Y27632. The performances of these candidate genes as stable reference genes were evaluated with two independent statistical approaches, geNorm and NormFinder.

Conclusions

The ribosomal protein genes, RPL19 and RPL29, were identified as suitable reference genes during neuronal differentiation of PC12 cells, regardless of the type of differentiation conditions. The combination of these two novel reference genes, but not the commonly used HKG, GAPDH, allows robust and accurate normalization of differentially expressed genes during PC12 differentiation.

Keywords


Article
Transcriptome analysis of the Cryptocaryon irritans tomont stage identifies potential genes for the detection and control of cryptocaryonosis

Authors: Lokanathan Yogeswaran --- Mohd-Adnan Adura --- Wan Kiew-Lian --- Nathan Sheila
DOI: 10.1186/1471-2164-11-76
Pages: 76
Loading...
Loading...
Abstract

Abstract

Background

Cryptocaryon irritans is a parasitic ciliate that causes cryptocaryonosis (white spot disease) in marine fish. Diagnosis of cryptocaryonosis often depends on the appearance of white spots on the surface of the fish, which are usually visible only during later stages of the disease. Identifying suitable biomarkers of this parasite would aid the development of diagnostic tools and control strategies for C. irritans. The C. irritans genome is virtually unexplored; therefore, we generated and analyzed expressed sequence tags (ESTs) of the parasite to identify genes that encode for surface proteins, excretory/secretory proteins and repeat-containing proteins.

Results

ESTs were generated from a cDNA library of C. irritans tomonts isolated from infected Asian sea bass, Lates calcarifer. Clustering of the 5356 ESTs produced 2659 unique transcripts (UTs) containing 1989 singletons and 670 consensi. BLAST analysis showed that 74% of the UTs had significant similarity (E-value < 10-5) to sequences that are currently available in the GenBank database, with more than 15% of the significant hits showing unknown function. Forty percent of the UTs had significant similarity to ciliates from the genera Tetrahymena and Paramecium. Comparative gene family analysis with related taxa showed that many protein families are conserved among the protozoans. Based on gene ontology annotation, functional groups were successfully assigned to 790 UTs. Genes encoding excretory/secretory proteins and membrane and membrane-associated proteins were identified because these proteins often function as antigens and are good antibody targets. A total of 481 UTs were classified as encoding membrane proteins, 54 were classified as encoding for membrane-bound proteins, and 155 were found to contain excretory/secretory protein-coding sequences. Amino acid repeat-containing proteins and GPI-anchored proteins were also identified as potential candidates for the development of diagnostic and control strategies for C. irritans.

Conclusions

We successfully discovered and examined a large portion of the previously unexplored C. irritans transcriptome and identified potential genes for the development and validation of diagnostic and control strategies for cryptocaryonosis.

Keywords


Article
Identification of novel non-coding RNAs using profiles of short sequence reads from next generation sequencing data

Authors: Jung Chol-Hee --- Hansen Martin A --- Makunin Igor V --- Korbie Darren J --- et al.
DOI: 10.1186/1471-2164-11-77
Pages: 77
Loading...
Loading...
Abstract

Abstract

Background

The increasing interest in small non-coding RNAs (ncRNAs) such as microRNAs (miRNAs), small interfering RNAs (siRNAs) and Piwi-interacting RNAs (piRNAs) and recent advances in sequencing technology have yielded large numbers of short (18-32 nt) RNA sequences from different organisms, some of which are derived from small nucleolar RNAs (snoRNAs) and transfer RNAs (tRNAs). We observed that these short ncRNAs frequently cover the entire length of annotated snoRNAs or tRNAs, which suggests that other loci specifying similar ncRNAs can be identified by clusters of short RNA sequences.

Results

We combined publicly available datasets of tens of millions of short RNA sequence tags from Drosophila melanogaster, and mapped them to the Drosophila genome. Approximately 6 million perfectly mapping sequence tags were then assembled into 521,302 tag-contigs (TCs) based on tag overlap. Most transposon-derived sequences, exons and annotated miRNAs, tRNAs and snoRNAs are detected by TCs, which show distinct patterns of length and tag-depth for different categories. The typical length and tag-depth of snoRNA-derived TCs was used to predict 7 previously unrecognized box H/ACA and 26 box C/D snoRNA candidates. We also identified one snRNA candidate and 86 loci with a high number of tags that are yet to be annotated, 7 of which have a particular 18mer motif and are located in introns of genes involved in development. A subset of new snoRNA candidates and putative ncRNA candidates was verified by Northern blot.

Conclusions

In this study, we have introduced a new approach to identify new members of known classes of ncRNAs based on the features of TCs corresponding to known ncRNAs. A large number of the identified TCs are yet to be examined experimentally suggesting that many more novel ncRNAs remain to be discovered.

Keywords


Article
Genome-wide gene responses in a transgenic rice line carrying the maize resistance gene Rxo1 to the rice bacterial streak pathogen, Xanthomonas oryzae pv. oryzicola

Authors: Zhou Yong-Li --- Xu Mei-Rong --- Zhao Ming-Fu --- Xie Xue-Wen --- et al.
DOI: 10.1186/1471-2164-11-78
Pages: 78
Loading...
Loading...
Abstract

Abstract

Background

Non-host resistance in rice to its bacterial pathogen, Xanthomonas oryzae pv. oryzicola (Xoc), mediated by a maize NBS-LRR type R gene, Rxo1 shows a typical hypersensitive reaction (HR) phenotype, but the molecular mechanism(s) underlying this type of non-host resistance remain largely unknown.

Results

A microarray experiment was performed to reveal the molecular mechanisms underlying HR of rice to Xoc mediated by Rxo1 using a pair of transgenic and non-transgenic rice lines. Our results indicated that Rxo1 appeared to function in the very early step of the interaction between rice and Xoc, and could specifically activate large numbers of genes involved in signaling pathways leading to HR and some basal defensive pathways such as SA and ET pathways. In the former case, Rxo1 appeared to differ from the typical host R genes in that it could lead to HR without activating NDR1. In the latter cases, Rxo1 was able to induce a unique group of WRKY TF genes and a large set of genes encoding PPR and RRM proteins that share the same G-box in their promoter regions with possible functions in post-transcriptional regulation.

Conclusions

In conclusion, Rxo1, like most host R genes, was able to trigger HR against Xoc in the heterologous rice plants by activating multiple defensive pathways related to HR, providing useful information on the evolution of plant resistance genes. Maize non-host resistance gene Rxo1 could trigger the pathogen-specific HR in heterologous rice, and ultimately leading to a localized programmed cell death which exhibits the characteristics consistent with those mediated by host resistance genes, but a number of genes encoding pentatricopeptide repeat and RNA recognition motif protein were found specifically up-regulated in the Rxo1 mediated disease resistance. These results add to our understanding the evolution of plant resistance genes.

Keywords


Article
Comparative evolutionary analysis of protein complexes in E. coli and yeast

Authors: Reid Adam J --- Ranea Juan AG --- Orengo Christine A
DOI: 10.1186/1471-2164-11-79
Pages: 79
Loading...
Loading...
Abstract

Abstract

Background

Proteins do not act in isolation; they frequently act together in protein complexes to carry out concerted cellular functions. The evolution of complexes is poorly understood, especially in organisms other than yeast, where little experimental data has been available.

Results

We generated accurate, high coverage datasets of protein complexes for E. coli and yeast in order to study differences in the evolution of complexes between these two species. We show that substantial differences exist in how complexes have evolved between these organisms. A previously proposed model of complex evolution identified complexes with cores of interacting homologues. We support findings of the relative importance of this mode of evolution in yeast, but find that it is much less common in E. coli. Additionally it is shown that those homologues which do cluster in complexes are involved in eukaryote-specific functions. Furthermore we identify correlated pairs of non-homologous domains which occur in multiple protein complexes. These were identified in both yeast and E. coli and we present evidence that these too may represent complex cores in yeast but not those of E. coli.

Conclusions

Our results suggest that there are differences in the way protein complexes have evolved in E. coli and yeast. Whereas some yeast complexes have evolved by recruiting paralogues, this is not apparent in E. coli. Furthermore, such complexes are involved in eukaryotic-specific functions. This implies that the increase in gene family sizes seen in eukaryotes in part reflects multiple family members being used within complexes. However, in general, in both E. coli and yeast, homologous domains are used in different complexes.

Keywords


Article
Tracking chromosomal positions of oligomers - a case study with Illumina's BovineSNP50 beadchip

Authors: Schmitt Armin O --- Bortfeldt Ralf H --- Brockmann Gudrun A
DOI: 10.1186/1471-2164-11-80
Pages: 80
Loading...
Loading...
Abstract

Abstract

Background

High density genotyping arrays have become established as a valuable research tool in human genetics. Currently, more than 300 genome wide association studies were published for human reporting about 1,000 SNPs that are associated with a phenotype. Also in animal sciences high density genotyping arrays are harnessed to analyse genetic variation. To exploit the full potential of this technology single nucleotide polymorphisms (SNPs) on the chips should be well characterized and their chromosomal position should be precisely known. This, however, is a challenge if the genome sequence is still subject to changes.

Results

We have developed a mapping strategy and a suite of software scripts to update the chromosomal positions of oligomer sequences used for SNP genotyping on high density arrays. We describe the mapping procedure in detail so that scientists with moderate bioinformatics skills can reproduce it. We furthermore present a case study in which we re-mapped 54,001 oligomer sequences from Ilumina's BovineSNP50 beadchip to the bovine genome sequence. We found in 992 cases substantial discrepancies between the manufacturer's annotations and our results. The software scripts in the Perl and R programming languages are provided as supplements.

Conclusions

The positions of oligomer sequences in the genome are volatile even within one build of the genome. To facilitate the analysis of data from a GWAS or from an expression study, especially with species whose genome assembly is still unstable, it is recommended to update the oligomer positions before data analysis.

Keywords


Article
FUNGIpath: a tool to assess fungal metabolic pathways predicted by orthology

Authors: Grossetête Sandrine --- Labedan Bernard --- Lespinet Olivier
DOI: 10.1186/1471-2164-11-81
Pages: 81
Loading...
Loading...
Abstract

Abstract

Background

More and more completely sequenced fungal genomes are becoming available and many more sequencing projects are in progress. This deluge of data should improve our knowledge of the various primary and secondary metabolisms of Fungi, including their synthesis of useful compounds such as antibiotics or toxic molecules such as mycotoxins. Functional annotation of many fungal genomes is imperfect, especially of genes encoding enzymes, so we need dedicated tools to analyze their metabolic pathways in depth.

Description

FUNGIpath is a new tool built using a two-stage approach. Groups of orthologous proteins predicted using complementary methods of detection were collected in a relational database. Each group was further mapped on to steps in the metabolic pathways published in the public databases KEGG and MetaCyc. As a result, FUNGIpath allows the primary and secondary metabolisms of the different fungal species represented in the database to be compared easily, making it possible to assess the level of specificity of various pathways at different taxonomic distances. It is freely accessible at http://www.fungipath.u-psud.fr.

Conclusions

As more and more fungal genomes are expected to be sequenced during the coming years, FUNGIpath should help progressively to reconstruct the ancestral primary and secondary metabolisms of the main branches of the fungal tree of life and to elucidate the evolution of these ancestral fungal metabolisms to various specific derived metabolisms.

Keywords


Article
Transcriptional changes in response to X chromosome dosage in the mouse: implications for X inactivation and the molecular basis of Turner Syndrome

Authors: Lopes Alexandra M --- Burgoyne Paul S --- Ojarikre Andrew --- Bauer Julien --- et al.
DOI: 10.1186/1471-2164-11-82
Pages: 82
Loading...
Loading...
Abstract

Abstract

Background

X monosomic mice (39,XO) have a remarkably mild phenotype when compared to women with Turner syndrome (45,XO). The generally accepted hypothesis to explain this discrepancy is that the number of genes on the mouse X chromosome which escape X inactivation, and thus are expressed at higher levels in females, is very small. However this hypothesis has never been tested and only a small number of genes have been assayed for their X-inactivation status in the mouse. We performed a global expression analysis in four somatic tissues (brain, liver, kidney and muscle) of adult 40,XX and 39,XO mice using the Illumina Mouse WG-6 v1_1 Expression BeadChip and an extensive validation by quantitative real time PCR, in order to identify which genes are expressed from both X chromosomes.

Results

We identified several genes on the X chromosome which are overexpressed in XX females, including those previously reported as escaping X inactivation, as well as new candidates. However, the results obtained by microarray and qPCR were not fully concordant, illustrating the difficulty in ascertaining modest fold changes, such as those expected for genes escaping X inactivation. Remarkably, considerable variation was observed between tissues, suggesting that inactivation patterns may be tissue-dependent. Our analysis also exposed several autosomal genes involved in mitochondrial metabolism and in protein translation which are differentially expressed between XX and XO mice, revealing secondary transcriptional changes to the alteration in X chromosome dosage.

Conclusions

Our results support the prediction that the mouse inactive X chromosome is largely silent, while providing a list of the genes potentially escaping X inactivation in rodents. Although the lower expression of X-linked genes in XO mice may not be relevant in the particular tissues/systems which are affected in human X chromosome monosomy, genes deregulated in XO mice are good candidates for further study in an involvement in Turner Syndrome phenotype.

Keywords


Article
Comparative genomics reveals selective distribution and domain organization of FYVE and PX domain proteins across eukaryotic lineages

Authors: Banerjee Sumana --- Basu Soumalee --- Sarkar Srimonti
DOI: 10.1186/1471-2164-11-83
Pages: 83
Loading...
Loading...
Abstract

Abstract

Background

Phosphatidylinositol 3-phosphate is involved in regulation of several key cellular processes, mainly endocytosis, signaling, nuclear processes, cytoskeletal remodelling, cell survival, membrane trafficking, phagosome maturation and autophagy. In most cases effector proteins bind to this lipid, using either FYVE or PX domain. These two domains are distributed amongst varied life forms such as virus, protists, fungi, viridiplantae and metazoa. As the binding ligand is identical for both domains, the goal of this study was to understand if there is any selectivity for either of these domains in different taxa. Further, to understand the different cellular functions that these domains may be involved in, we analyzed the taxonomic distribution of additional domains that associate with FYVE and PX.

Results

There is selectivity for either FYVE or PX in individual genomes where both domains are present. Fungi and metazoa encode more PX, whereas streptophytes in viridiplantae encode more FYVE. Excess of FYVE in streptophytes results from proteins containing RCC1and DZC domains and FYVE domains in these proteins have a non-canonical ligand-binding site. Within a taxonomic group the selected domain associates with a higher number of other domains and is thus expected to discharge a larger number of cellular functions. Also, while certain associated domains are present in all taxonomic groups, most of them are unique to a specific group indicating that while certain common functions are discharged by these domains in all taxonomic groups, some functions appear to be group specific.

Conclusions

Although both FYVE and PX bind to PtdIns(3)P, genomes of different taxa show distinct selectivity of encoding either of the two. Higher numbers of taxonomic group specific domains co-occur with the more abundant domain (FYVE/PX) indicating that group-specific rare domain architectures might have emerged to accomplish certain group-specific functions.

Keywords


Article
Gene prioritization in Type 2 Diabetes using domain interactions and network analysis

Authors: Sharma Amitabh --- Chavali Sreenivas --- Tabassum Rubina --- Tandon Nikhil --- et al.
DOI: 10.1186/1471-2164-11-84
Pages: 84
Loading...
Loading...
Abstract

Abstract

Background

Identification of disease genes for Type 2 Diabetes (T2D) by traditional methods has yielded limited success. Based on our previous observation that T2D may result from disturbed protein-protein interactions affected through disrupting modular domain interactions, here we have designed an approach to rank the candidates in the T2D linked genomic regions as plausible disease genes.

Results

Our approach integrates Weight value (Wv) method followed by prioritization using clustering coefficients derived from domain interaction network. Wv for each candidate is calculated based on the assumption that disease genes might be functionally related, mainly facilitated by interactions among domains of the interacting proteins. The benchmarking using a test dataset comprising of both known T2D genes and non-T2D genes revealed that Wv method had a sensitivity and specificity of 0.74 and 0.96 respectively with 9 fold enrichment. The candidate genes having a Wv > 0.5 were called High Weight Elements (HWEs). Further, we ranked HWEs by using the network property-the clustering coefficient (Ci). Each HWE with a Ci < 0.015 was prioritized as plausible disease candidates (HWEc) as previous studies indicate that disease genes tend to avoid dense clustering (with an average Ci of 0.015). This method further prioritized the identified disease genes with a sensitivity of 0.32 and a specificity of 0.98 and enriched the candidate list by 6.8 fold. Thus, from the dataset of 4052 positional candidates the method ranked 435 to be most likely disease candidates. The gene ontology sharing for the candidates showed higher representation of metabolic and signaling processes. The approach also captured genes with unknown functions which were characterized by network motif analysis.

Conclusions

Prioritization of positional candidates is essential for cost-effective and an expedited discovery of disease genes. Here, we demonstrate a novel approach for disease candidate prioritization from numerous loci linked to T2D.

Keywords


Article
MicroRNAs show diverse and dynamic expression patterns in multiple tissues of Bombyx mori

Authors: Liu Shiping --- Gao Song --- Zhang Danyu --- Yin Jiyun --- et al.
DOI: 10.1186/1471-2164-11-85
Pages: 85
Loading...
Loading...
Abstract

Abstract

Background

MicroRNAs (miRNAs) repress target genes at the post-transcriptional level, and function in the development and cell-lineage pathways of host species. Tissue-specific expression of miRNAs is highly relevant to their physiological roles in the corresponding tissues. However, to date, few miRNAs have been spatially identified in the silkworm.

Results

We establish for the first time the spatial expression patterns of nearly 100 miRNAs in multiple normal tissues (organs) of Bombyx mori females and males using microarray and Northern-blotting analyses. In all, only 10 miRNAs were universally distributed (including bmo-let-7 and bmo-bantam), while the majority were expressed exclusively or preferentially in specific tissue types (e.g., bmo-miR-275 and bmo-miR-1). Additionally, we examined the developmental patterns of miRNA expression during metamorphosis of the body wall, silk glands, midgut and fat body. In total, 63 miRNAs displayed significant alterations in abundance in at least 1 tissue during the developmental transition from larvae to pupae (e.g., bmo-miR-263b and bmo-miR-124). Expression patterns of five miRNAs were significantly increased during metamorphosis in all four tissues (e.g., bmo-miR-275 and bmo-miR-305), and two miRNA pairs, bmo-miR-10b-3p/5p and bmo-miR-281-3p/5p, showed coordinate expression.

Conclusions

In this study, we conducted preliminary spatial measurements of several miRNAs in the silkworm. Periods of rapid morphological change were associated with alterations in miRNA expression patterns in the body wall, silk glands, midgut and fat body during metamorphosis. Accordingly, we propose that corresponding ubiquitous or tissue-specific expression of miRNAs supports their critical roles in tissue specification. These results should facilitate future functional analyses.

Keywords


Article
Comparative genomic and proteomic analyses of two Mycoplasma agalactiae strains: clues to the macro- and micro-events that are shaping mycoplasma diversity

Loading...
Loading...
Abstract

Abstract

Background

While the genomic era is accumulating a tremendous amount of data, the question of how genomics can describe a bacterial species remains to be fully addressed. The recent sequencing of the genome of the Mycoplasma agalactiae type strain has challenged our general view on mycoplasmas by suggesting that these simple bacteria are able to exchange significant amount of genetic material via horizontal gene transfer. Yet, events that are shaping mycoplasma genomes and that are underlining diversity within this species have to be fully evaluated. For this purpose, we compared two strains that are representative of the genetic spectrum encountered in this species: the type strain PG2 which genome is already available and a field strain, 5632, which was fully sequenced and annotated in this study.

Results

The two genomes differ by ca. 130 kbp with that of 5632 being the largest (1006 kbp). The make up of this additional genetic material mainly corresponds (i) to mobile genetic elements and (ii) to expanded repertoire of gene families that encode putative surface proteins and display features of highly-variable systems. More specifically, three entire copies of a previously described integrative conjugative element are found in 5632 that accounts for ca. 80 kbp. Other mobile genetic elements, found in 5632 but not in PG2, are the more classical insertion sequences which are related to those found in two other ruminant pathogens, M. bovis and M. mycoides subsp. mycoides SC. In 5632, repertoires of gene families encoding surface proteins are larger due to gene duplication. Comparative proteomic analyses of the two strains indicate that the additional coding capacity of 5632 affects the overall architecture of the surface and suggests the occurrence of new phase variable systems based on single nucleotide polymorphisms.

Conclusion

Overall, comparative analyses of two M. agalactiae strains revealed a very dynamic genome which structure has been shaped by gene flow among ruminant mycoplasmas and expansion-reduction of gene repertoires encoding surface proteins, the expression of which is driven by localized genetic micro-events.

Keywords


Article
Systematic overrepresentation of DNA termini and underrepresentation of subterminal regions among sequencing templates prepared from hydrodynamically sheared linear DNA molecules

Authors: Schwartz Sherri L --- Farman Mark L
DOI: 10.1186/1471-2164-11-87
Pages: 87
Loading...
Loading...
Abstract

Abstract

Background

Analysis of fungal genome sequence assemblies reveals that telomeres are poorly represented even though telomeric reads tend to be superabundant. We surmised that the problem might lie in the DNA shearing conditions used to create clone libraries for genome sequencing.

Results

A shotgun strategy was used to sequence and assemble circular and linear cosmid DNAs sheared using conditions typical for a genome project. The DNA sheared in circular form assembled into a single sequence contig. However, the linearized cosmid produced an incomplete assembly because the two DNA termini, though greatly overrepresented in the clone library used for sequencing, were separated from neighboring sequences by gaps of ~1.4 and 1.8 kb. These gap sizes were reduced, but not eliminated, by shearing the linear cosmid into smaller fragments. Mapping of shearing breakpoints revealed a paucity of breaks in the subterminal regions of the linearized cosmid and also near chromosome ends of the fungus Neurospora crassa.

Conclusion

Together, our data indicate that the ends of linear DNA molecules are recalcitrant to hydrodynamic shearing. We propose that this causes DNA termini to be overrepresented in the resulting fragment population but ultimately prevents their incorporation into sequence assemblies.

Keywords


Article
Whole-genome sequencing of a laboratory-evolved yeast strain

Authors: Araya Carlos L --- Payen Celia --- Dunham Maitreya J --- Fields Stanley
DOI: 10.1186/1471-2164-11-88
Pages: 88
Loading...
Loading...
Abstract

Abstract

Background

Experimental evolution of microbial populations provides a unique opportunity to study evolutionary adaptation in response to controlled selective pressures. However, until recently it has been difficult to identify the precise genetic changes underlying adaptation at a genome-wide scale. New DNA sequencing technologies now allow the genome of parental and evolved strains of microorganisms to be rapidly determined.

Results

We sequenced >93.5% of the genome of a laboratory-evolved strain of the yeast Saccharomyces cerevisiae and its ancestor at >28× depth. Both single nucleotide polymorphisms and copy number amplifications were found, with specific gains over array-based methodologies previously used to analyze these genomes. Applying a segmentation algorithm to quantify structural changes, we determined the approximate genomic boundaries of a 5× gene amplification. These boundaries guided the recovery of breakpoint sequences, which provide insights into the nature of a complex genomic rearrangement.

Conclusions

This study suggests that whole-genome sequencing can provide a rapid approach to uncover the genetic basis of evolutionary adaptations, with further applications in the study of laboratory selections and mutagenesis screens. In addition, we show how single-end, short read sequencing data can provide detailed information about structural rearrangements, and generate predictions about the genomic features and processes that underlie genome plasticity.

Keywords


Article
Highly conserved motifs in non-coding regions of Sirevirus retrotransposons: the key for their pattern of distribution within and across plants?

Authors: Bousios Alexandros --- Darzentas Nikos --- Tsaftaris Athanasios --- Pearce Stephen R
DOI: 10.1186/1471-2164-11-89
Pages: 89
Loading...
Loading...
Abstract

Abstract

Background

Retrotransposons are key players in the evolution of eukaryotic genomes. Moreover, it is now known that some retrotransposon classes, like the abundant and plant-specific Sireviruses, have intriguingly distinctive host preferences. Yet, it is largely unknown if this bias is supported by different genome structures.

Results

We performed sensitive comparative analysis of the genomes of a large set of Ty1/copia retrotransposons. We discovered that Sireviruses are unique among Pseudoviridae in that they constitute an ancient genus characterized by vastly divergent members, which however contain highly conserved motifs in key non-coding regions: multiple polypurine tract (PPT) copies cluster upstream of the 3' long terminal repeat (3'LTR), of which the terminal PPT tethers to a distinctive attachment site and is flanked by a precisely positioned inverted repeat. Their LTRs possess a novel type of repeated motif (RM) defined by its exceptionally high copy number, symmetry and core CGG-CCG signature. These RM boxes form CpG islands and lie a short distance upstream of a conserved promoter region thus hinting towards regulatory functions. Intriguingly, in the envelope-containing Sireviruses additional boxes cluster at the 5' vicinity of the envelope. The 5'LTR/internal domain junction and a polyC-rich integrase signal are also highly conserved domains of the Sirevirus genome.

Conclusions

Our comparative analysis of retrotransposon genomes using advanced in silico methods highlighted the unique genome organization of Sireviruses. Their structure may dictate a life cycle with different regulation and transmission strategy compared to other Pseudoviridae, which may contribute towards their pattern of distribution within and across plants.

Keywords


Article
Transcriptional response of Burkholderia cenocepacia J2315 sessile cells to treatments with high doses of hydrogen peroxide and sodium hypochlorite

Authors: Peeters Elke --- Sass Andrea --- Mahenthiralingam Eshwar --- Nelis Hans --- et al.
DOI: 10.1186/1471-2164-11-90
Pages: 90
Loading...
Loading...
Abstract

Abstract

Background

Burkholderia cepacia complex bacteria are opportunistic pathogens, which can cause severe respiratory tract infections in patients with cystic fibrosis (CF). As treatment of infected CF patients is problematic, multiple preventive measures are taken to reduce the infection risk. Besides a stringent segregation policy to prevent patient-to-patient transmission, clinicians also advise patients to clean and disinfect their respiratory equipment on a regular basis. However, problems regarding the efficacy of several disinfection procedures for the removal and/or killing of B. cepacia complex bacteria have been reported. In order to unravel the molecular mechanisms involved in the resistance of biofilm-grown Burkholderia cenocepacia cells against high concentrations of reactive oxygen species (ROS), the present study focussed on the transcriptional response in sessile B. cenocepacia J2315 cells following exposure to high levels of H2O2 or NaOCl.

Results

The exposure to H2O2 and NaOCl resulted in an upregulation of the transcription of 315 (4.4%) and 386 (5.4%) genes, respectively. Transcription of 185 (2.6%) and 331 (4.6%) genes was decreased in response to the respective treatments. Many of the upregulated genes in the NaOCl- and H2O2-treated biofilms are involved in oxidative stress as well as general stress response, emphasizing the importance of the efficient neutralization and scavenging of ROS. In addition, multiple upregulated genes encode proteins that are necessary to repair ROS-induced cellular damage. Unexpectedly, a prolonged treatment with H2O2 also resulted in an increased transcription of multiple phage-related genes. A closer inspection of hybridisation signals obtained with probes targeting intergenic regions led to the identification of a putative 6S RNA.

Conclusion

Our results reveal that the transcription of a large fraction of B. cenocepacia J2315 genes is altered upon exposure of sessile cells to ROS. These observations have highlighted that B. cenocepacia may alter several pathways in response to exposure to ROS and they have led to the identification of many genes not previously implicated in the stress response of this pathogen.

Keywords


Article
Complete genome sequence and lifestyle of black-pigmented Corynebacterium aurimucosum ATCC 700975 (formerly C. nigricans CN-1) isolated from a vaginal swab of a woman with spontaneous abortion

Loading...
Loading...
Abstract

Abstract

Background

Corynebacterium aurimucosum is a slightly yellowish, non-lipophilic, facultative anaerobic member of the genus Corynebacterium and predominantly isolated from human clinical specimens. Unusual black-pigmented variants of C. aurimucosum (originally named as C. nigricans) continue to be recovered from the female urogenital tract and they are associated with complications during pregnancy. C. aurimucosum ATCC 700975 (C. nigricans CN-1) was originally isolated from a vaginal swab of a 34-year-old woman who experienced a spontaneous abortion during month six of pregnancy. For a better understanding of the physiology and lifestyle of this potential urogenital pathogen, the complete genome sequence of C. aurimucosum ATCC 700975 was determined.

Results

Sequencing and assembly of the C. aurimucosum ATCC 700975 genome yielded a circular chromosome of 2,790,189 bp in size and the 29,037-bp plasmid pET44827. Specific gene sets associated with the central metabolism of C. aurimucosum apparently provide enhanced metabolic flexibility and adaptability in aerobic, anaerobic and low-pH environments, including gene clusters for the uptake and degradation of aromatic amines, L-histidine and L-tartrate as well as a gene region for the formation of selenocysteine and its incorporation into formate dehydrogenase. Plasmid pET44827 codes for a non-ribosomal peptide synthetase that plays the pivotal role in the synthesis of the characteristic black pigment of C. aurimucosum ATCC 700975.

Conclusions

The data obtained by the genome project suggest that C. aurimucosum could be both a resident of the human gut and possibly a pathogen in the female genital tract causing complications during pregnancy. Since hitherto all black-pigmented C. aurimucosum strains have been recovered from female genital source, biosynthesis of the pigment is apparently required for colonization by protecting the bacterial cells against the high hydrogen peroxide concentration in the vaginal environment. The location of the corresponding genes on plasmid pET44827 explains why black-pigmented (formerly C. nigricans) and non-pigmented C. aurimucosum strains were isolated from clinical specimens.

Keywords


Article
Identification of arginine- and lysine-methylation in the proteome of Saccharomyces cerevisiae and its functional implications

Authors: Pang Chi --- Gasteiger Elisabeth --- Wilkins Marc R
DOI: 10.1186/1471-2164-11-92
Pages: 92
Loading...
Loading...
Abstract

Abstract

Background

The methylation of eukaryotic proteins has been proposed to be widespread, but this has not been conclusively shown to date. In this study, we examined 36,854 previously generated peptide mass spectra from 2,607 Saccharomyces cerevisiae proteins for the presence of arginine and lysine methylation. This was done using the FindMod tool and 5 filters that took advantage of the high number of replicate analysis per protein and the presence of overlapping peptides.

Results

A total of 83 high-confidence lysine and arginine methylation sites were found in 66 proteins. Motif analysis revealed many methylated sites were associated with MK, RGG/RXG/RGX or WXXXR motifs. Functionally, methylated proteins were significantly enriched for protein translation, ribosomal biogenesis and assembly and organellar organisation and were predominantly found in the cytoplasm and ribosome. Intriguingly, methylated proteins were seen to have significantly longer half-life than proteins for which no methylation was found. Some 43% of methylated lysine sites were predicted to be amenable to ubiquitination, suggesting methyl-lysine might block the action of ubiquitin ligase.

Conclusions

This study suggests protein methylation to be quite widespread, albeit associated with specific functions. Large-scale tandem mass spectroscopy analyses will help to further confirm the modifications reported here.

Keywords


Article
Comparative transcriptome analysis coupled to X-ray CT reveals sucrose supply and growth velocity as major determinants of potato tuber starch biosynthesis

Authors: Ferreira Stephanus J --- Senning Melanie --- Sonnewald Sophia --- Keßling Petra-Maria --- et al.
DOI: 10.1186/1471-2164-11-93
Pages: 93
Loading...
Loading...
Abstract

Abstract

Background

Even though the process of potato tuber starch biosynthesis is well understood, mechanisms regulating biosynthesis are still unclear. Transcriptome analysis provides valuable information as to how genes are regulated. Therefore, this work aimed at investigating transcriptional regulation of starch biosynthetic genes in leaves and tubers of potato plants under various conditions. More specifically we looked at gene expression diurnally in leaves and tubers, during tuber induction and in tubers growing at different velocities. To determine velocity of potato tuber growth a new method based on X-ray Computed Tomography (X-ray CT) was established.

Results

Comparative transcriptome analysis between leaves and tubers revealed striking similarities with the same genes being differentially expressed in both tissues. In tubers, oscillation of granule bound starch synthase (GBSS) expression) was observed which could be linked to sucrose supply from source leaves. X-ray CT was used to determine time-dependent changes in tuber volume and the growth velocity was calculated. Although there is not a linear correlation between growth velocity and expression of starch biosynthetic genes, there are significant differences between growing and non-growing tubers. Co-expression analysis was used to identify transcription factors positively correlating with starch biosynthetic genes possibly regulating starch biosynthesis.

Conclusion

Most starch biosynthetic enzymes are encoded by gene families. Co-expression analysis revealed that the same members of these gene families are co-regulated in leaves and tubers. This suggests that regulation of transitory and storage starch biosynthesis in leaves and tubers, respectively, is surprisingly similar. X-ray CT can be used to monitor growth and development of belowground organs and allows to link tuber growth to changes in gene expression. Comparative transcriptome analysis provides a useful tool to identify transcription factors possibly involved in the regulation of starch biosynthesis.

Keywords


Article
Development of a EST dataset and characterization of EST-SSRs in a traditional Chinese medicinal plant, Epimedium sagittatum (Sieb. Et Zucc.) Maxim

Authors: Zeng Shaohua --- Xiao Gong --- Guo Juan --- Fei Zhangjun --- et al.
DOI: 10.1186/1471-2164-11-94
Pages: 94
Loading...
Loading...
Abstract

Abstract

Background

Epimedium sagittatum (Sieb. Et Zucc.) Maxim, a traditional Chinese medicinal plant species, has been used extensively as genuine medicinal materials. Certain Epimedium species are endangered due to commercial overexploition, while sustainable application studies, conservation genetics, systematics, and marker-assisted selection (MAS) of Epimedium is less-studied due to the lack of molecular markers. Here, we report a set of expressed sequence tags (ESTs) and simple sequence repeats (SSRs) identified in these ESTs for E. sagittatum.

Results

cDNAs of E. sagittatum are sequenced using 454 GS-FLX pyrosequencing technology. The raw reads are cleaned and assembled into a total of 76,459 consensus sequences comprising of 17,231 contigs and 59,228 singlets. About 38.5% (29,466) of the consensus sequences significantly match to the non-redundant protein database (E-value < 1e-10), 22,295 of which are further annotated using Gene Ontology (GO) terms. A total of 2,810 EST-SSRs is identified from the Epimedium EST dataset. Trinucleotide SSR is the dominant repeat type (55.2%) followed by dinucleotide (30.4%), tetranuleotide (7.3%), hexanucleotide (4.9%), and pentanucleotide (2.2%) SSR. The dominant repeat motif is AAG/CTT (23.6%) followed by AG/CT (19.3%), ACC/GGT (11.1%), AT/AT (7.5%), and AAC/GTT (5.9%). Thirty-two SSR-ESTs are randomly selected and primer pairs are synthesized for testing the transferability across 52 Epimedium species. Eighteen primer pairs (85.7%) could be successfully transferred to Epimedium species and sixteen of those show high genetic diversity with 0.35 of observed heterozygosity (Ho) and 0.65 of expected heterozygosity (He) and high number of alleles per locus (11.9).

Conclusion

A large EST dataset with a total of 76,459 consensus sequences is generated, aiming to provide sequence information for deciphering secondary metabolism, especially for flavonoid pathway in Epimedium. A total of 2,810 EST-SSRs is identified from EST dataset and ~1580 EST-SSR markers are transferable. E. sagittatum EST-SSR transferability to the major Epimedium germplasm is up to 85.7%. Therefore, this EST dataset and EST-SSRs will be a powerful resource for further studies such as taxonomy, molecular breeding, genetics, genomics, and secondary metabolism in Epimedium species.

Keywords


Article
Genome-wide analysis of rice ClpB/HSP100, ClpC and ClpD genes

Authors: Singh Amanjot --- Singh Upasana --- Mittal Dheeraj --- Grover Anil
DOI: 10.1186/1471-2164-11-95
Pages: 95
Loading...
Loading...
Abstract

Abstract

Background

ClpB-cyt/HSP100 protein acts as chaperone, mediating disaggregation of denatured proteins. Previous studies have shown that ClpB-cyt/HSP100 gene belongs to the group class I Clp ATPase proteins and ClpB-cyt/HSP100 transcript is regulated by heat stress and developmental cues.

Results

Nine ORFs were noted to constitute rice class I Clp ATPases in the following manner: 3 ClpB proteins (ClpB-cyt, Os05g44340; ClpB-m, Os02g08490; ClpB-c, Os03g31300), 4 ClpC proteins (ClpC1, Os04g32560; ClpC2, Os12g12580; ClpC3, Os11g16590; ClpC4, Os11g16770) and 2 ClpD proteins (ClpD1, Os02g32520; ClpD2, Os04g33210). Using the respective signal sequences cloned upstream to GFP/CFP reporter proteins and transient expression studies with onion epidermal cells, evidence is provided that rice ClpB-m and Clp-c proteins are indeed localized to their respective cell locations mitochondria and chloroplasts, respectively. Associated with their diverse cell locations, domain structures of OsClpB-c, OsClpB-m and OsClpB-cyt proteins are noted to possess a high-level conservation. OsClpB-cyt transcript is shown to be enriched at milk and dough stages of seed development. While expression of OsClpB-m was significantly less as compared to its cytoplasmic and chloroplastic counterparts in different tissues, this transcript showed highest heat-induced expression amongst the 3 ClpB proteins. OsClpC1 and OsClpC2 are predicted to be chloroplast-localized as is the case with all known plant ClpC proteins. However, the fact that OsClpC3 protein appears mitochondrial/chloroplastic with equal probability and OsClpC4 a plasma membrane protein reflects functional diversity of this class. Different class I Clp ATPase transcripts were noted to be cross-induced by a host of different abiotic stress conditions. Complementation assays of Δhsp104 mutant yeast cells showed that OsClpB-cyt, OsClpB-m, OsClpC1 and OsClpD1 have significantly positive effects. Remarkably, OsClpD1 gene imparted appreciably high level tolerance to the mutant yeast cells.

Conclusions

Rice class I Clp ATPase gene family is constituted of 9 members. Of these 9, only 3 belonging to ClpB group are heat stress regulated. Distribution of ClpB proteins to different cell organelles indicates that their functioning might be critical in different cell locations. From the complementation assays, OsClpD1 appears to be more effective than OsClpB-cyt protein in rescuing the thermosensitive defect of the yeast ScΔhsp104 mutant cells.

Keywords


Article
Variability of gene expression profiles in human blood and lymphoblastoid cell lines

Loading...
Loading...
Abstract

Abstract

Background

Readily accessible samples such as peripheral blood or cell lines are increasingly being used in large cohorts to characterise gene expression differences between a patient group and healthy controls. However, cell and RNA isolation procedures and the variety of cell types that make up whole blood can affect gene expression measurements. We therefore systematically investigated global gene expression profiles in peripheral blood from six individuals collected during two visits by comparing five of the following cell and RNA isolation methods: whole blood (PAXgene), peripheral blood mononuclear cells (PBMCs), lymphoblastoid cell lines (LCLs), CD19 and CD20 specific B-cell subsets.

Results

Gene expression measurements were clearly discriminated by isolation method although the reproducibility was high for all methods (range ρ = 0.90-1.00). The PAXgene samples showed a decrease in the number of expressed genes (P < 1*10-16) with higher variability (P < 1*10-16) compared to the other methods. Differentially expressed probes between PAXgene and PBMCs were correlated with the number of monocytes, lymphocytes, neutrophils or erythrocytes. The correlations (ρ = 0.83; ρ = 0.79) of the expression levels of detected probes between LCLs and B-cell subsets were much lower compared to the two B-cell isolation methods (ρ = 0.98). Gene ontology analysis of detected genes showed that genes involved in inflammatory responses are enriched in B-cells CD19 and CD20 whereas genes involved in alcohol metabolic process and the cell cycle were enriched in LCLs.

Conclusion

Gene expression profiles in blood-based samples are strongly dependent on the predominant constituent cell type(s) and RNA isolation method. It is crucial to understand the differences and variability of gene expression measurements between cell and RNA isolation procedures, and their relevance to disease processes, before application in large clinical studies.

Keywords


Article
Homeolog loss and expression changes in natural populations of the recently and repeatedly formed allotetraploid Tragopogon mirus (Asteraceae)

Authors: Koh Jin --- Soltis Pamela S --- Soltis Douglas E
DOI: 10.1186/1471-2164-11-97
Pages: 97
Loading...
Loading...
Abstract

Abstract

Background

Although polyploidy has long been recognized as a major force in the evolution of plants, most of what we know about the genetic consequences of polyploidy comes from the study of crops and model systems. Furthermore, although many polyploid species have formed repeatedly, patterns of genome evolution and gene expression are largely unknown for natural polyploid populations of independent origin. We therefore examined patterns of loss and expression in duplicate gene pairs (homeologs) in multiple individuals from seven natural populations of independent origin of Tragopogon mirus (Asteraceae), an allopolyploid that formed repeatedly within the last 80 years from the diploids T. dubius and T. porrifolius.

Results

Using cDNA-AFLPs, we found differential band patterns that could be attributable to gene silencing, novel expression, and/or maternal/paternal effects between T. mirus and its diploid parents. Subsequent cleaved amplified polymorphic sequence (CAPS) analyses of genomic DNA and cDNA revealed that 20 of the 30 genes identified through cDNA-AFLP analysis showed additivity, whereas nine of the 30 exhibited the loss of one parental homeolog in at least one individual. Homeolog loss (versus loss of a restriction site) was confirmed via sequencing. The remaining gene (ADENINE-DNA GLYCOSYLASE) showed ambiguous patterns in T. mirus because of polymorphism in the diploid parent T. dubius. Most (63.6%) of the homeolog loss events were of the T. dubius parental copy. Two genes, NUCLEAR RIBOSOMAL DNA and GLYCERALDEHYDE-3-PHOSPHATE DEHYDROGENASE, showed differential expression of the parental homeologs, with the T. dubius copy silenced in some individuals of T. mirus.

Conclusions

Genomic and cDNA CAPS analyses indicated that plants representing multiple populations of this young natural allopolyploid have experienced frequent and preferential elimination of homeologous loci. Comparable analyses of synthetic F1 hybrids showed only additivity. These results suggest that loss of homeologs and changes in gene expression are not the immediate result of hybridization, but are processes that occur following polyploidization, occurring during the early (<40) generations of the young polyploid. Both T. mirus and a second recently formed allopolyploid, T. miscellus, exhibit more homeolog losses than gene silencing events. Furthermore, both allotetraploids undergo biased loss of homeologs contributed by their shared diploid parent, T. dubius. Further studies are required to assess whether the results for the 30 genes so far examined are representative of the entire genome.

Keywords


Article
Transcriptional profiling of Actinobacillus pleuropneumoniae during the acute phase of a natural infection in pigs

Authors: Deslandes Vincent --- Denicourt Martine --- Girard Christiane --- Harel Josée --- et al.
DOI: 10.1186/1471-2164-11-98
Pages: 98
Loading...
Loading...
Abstract

Abstract

Background

Actinobacillus pleuropneumoniae is the etiological agent of porcine pleuropneumonia, a respiratory disease which causes great economic losses worldwide. Many virulence factors are involved in the pathogenesis, namely capsular polysaccharides, RTX toxins, LPS and many iron acquisition systems. In order to identify genes that are expressed in vivo during a natural infection, we undertook transcript profiling experiments with an A. pleuropneumoniae DNA microarray, after recovery of bacterial mRNAs from serotype 5b-infected porcine lungs. AppChip2 contains 2033 PCR amplicons based on the genomic sequence of App serotype 5b strain L20, representing more than 95% of ORFs greater than 160 bp in length.

Results

Transcriptional profiling of A. pleuropneumoniae recovered from the lung of a pig suffering from a natural infection or following growth of the bacterial isolate in BHI medium was performed. An RNA extraction protocol combining beadbeating and hot-acid-phenol was developed in order to maximize bacterial mRNA yields and quality following total RNA extraction from lung lesions. Nearly all A. pleuropneumoniae transcripts could be detected on our microarrays, and 150 genes were deemed differentially expressed in vivo during the acute phase of the infection. Our results indicate that, for example, gene apxIVA from an operon coding for RTX toxin ApxIV is highly up-regulated in vivo, and that two genes from the operon coding for type IV fimbriae (APL_0878 and APL_0879) were also up-regulated. These transcriptional profiling data, combined with previous comparative genomic hybridizations performed by our group, revealed that 66 out of the 72 up-regulated genes are conserved amongst all serotypes and that 3 of them code for products that are predicted outer membrane proteins (genes irp and APL_0959, predicted to code for a TonB-dependent receptor and a filamentous hemagglutinin/adhesin respectively) or lipoproteins (gene APL_0920). Only 4 of 72 up-regulated genes had previously been identified in controled experimental infections.

Conclusions

These genes that we have identified as up-regulated in vivo, conserved across serotypes and coding for potential outer membrane proteins represent potential candidates for the development of a cross-protective vaccine against porcine pleuropneumonia.

Keywords


Article
Trichomonas vaginalis vast BspA-like gene family: evidence for functional diversity from structural organisation and transcriptomics

Authors: Noël Christophe J --- Diaz Nicia --- Sicheritz-Ponten Thomas --- Safarikova Lucie --- et al.
DOI: 10.1186/1471-2164-11-99
Pages: 99
Loading...
Loading...
Abstract

Abstract

Background

Trichomonas vaginalis is the most common non-viral human sexually transmitted pathogen and importantly, contributes to facilitating the spread of HIV. Yet very little is known about its surface and secreted proteins mediating interactions with, and permitting the invasion and colonisation of, the host mucosa. Initial annotations of T. vaginalis genome identified a plethora of candidate extracellular proteins.

Results

Data mining of the T. vaginalis genome identified 911 BspA-like entries (TvBspA) sharing TpLRR-like leucine-rich repeats, which represent the largest gene family encoding potential extracellular proteins for the pathogen. A broad range of microorganisms encoding BspA-like proteins was identified and these are mainly known to live on mucosal surfaces, among these T. vaginalis is endowed with the largest gene family. Over 190 TvBspA proteins with inferred transmembrane domains were characterised by a considerable structural diversity between their TpLRR and other types of repetitive sequences and two subfamilies possessed distinct classic sorting signal motifs for endocytosis. One TvBspA subfamily also shared a glycine-rich protein domain with proteins from Clostridium difficile pathogenic strains and C. difficile phages. Consistent with the hypothesis that TvBspA protein structural diversity implies diverse roles, we demonstrated for several TvBspA genes differential expression at the transcript level in different growth conditions. Identified variants of repetitive segments between several TvBspA paralogues and orthologues from two clinical isolates were also consistent with TpLRR and other repetitive sequences to be functionally important. For one TvBspA protein cell surface expression and antibody responses by both female and male T. vaginalis infected patients were also demonstrated.

Conclusions

The biased mucosal habitat for microbial species encoding BspA-like proteins, the characterisation of a vast structural diversity for the TvBspA proteins, differential expression of a subset of TvBspA genes and the cellular localisation and immunological data for one TvBspA; all point to the importance of the TvBspA proteins to various aspects of T. vaginalis pathobiology at the host-pathogen interface.

Keywords


Article
Annotation and classification of the bovine T cell receptor delta genes

Authors: Herzig Carolyn TA --- Lefranc Marie-Paule --- Baldwin Cynthia L
DOI: 10.1186/1471-2164-11-100
Pages: 100
Loading...
Loading...
Abstract

Abstract

Background

γδ T cells differ from αβ T cells with regard to the types of antigen with which their T cell receptors interact; γδ T cell antigens are not necessarily peptides nor are they presented on MHC. Cattle are considered a "γδ T cell high" species indicating they have an increased proportion of γδ T cells in circulation relative to that in "γδ T cell low" species such as humans and mice. Prior to the onset of the studies described here, there was limited information regarding the genes that code for the T cell receptor delta chains of this γδ T cell high species.

Results

By annotating the bovine (Bos taurus) genome Btau_3.1 assembly the presence of 56 distinct T cell receptor delta (TRD) variable (V) genes were found, 52 of which belong to the TRDV1 subgroup and were co-mingled with the T cell receptor alpha variable (TRAV) genes. In addition, two genes belonging to the TRDV2 subgroup and single TRDV3 and TRDV4 genes were found. We confirmed the presence of five diversity (D) genes, three junctional (J) genes and a single constant (C) gene and describe the organization of the TRD locus. The TRDV4 gene is found downstream of the C gene and in an inverted orientation of transcription, consistent with its orthologs in humans and mice. cDNA evidence was assessed to validate expression of the variable genes and showed that one to five D genes could be incorporated into a single transcript. Finally, we grouped the bovine and ovine TRDV1 genes into sets based on their relatedness.

Conclusions

The bovine genome contains a large and diverse repertoire of TRD genes when compared to the genomes of "γδ T cell low" species. This suggests that in cattle γδ T cells play a more important role in immune function since they would be predicted to bind a greater variety of antigens.

Keywords


Article
Computational prediction and experimental validation of evolutionarily conserved microRNA target genes in bilaterian animals

Authors: Takane Kahori --- Fujishima Kosuke --- Watanabe Yuka --- Sato Asako --- et al.
DOI: 10.1186/1471-2164-11-101
Pages: 101
Loading...
Loading...
Abstract

Abstract

Background

In many eukaryotes, microRNAs (miRNAs) bind to complementary sites in the 3'-untranslated regions (3'-UTRs) of target messenger RNAs (mRNAs) and regulate their expression at the stage of translation. Recent studies have revealed that many miRNAs are evolutionarily conserved; however, the evolution of their target genes has yet to be systematically characterized. We sought to elucidate a set of conserved miRNA/target-gene pairs and to analyse the mechanism underlying miRNA-mediated gene regulation in the early stage of bilaterian evolution.

Results

Initially, we extracted five evolutionarily conserved miRNAs (let-7, miR-1, miR-124, miR-125/lin-4, and miR-34) among five diverse bilaterian animals. Subsequently, we designed a procedure to predict evolutionarily conserved miRNA/target-gene pairs by introducing orthologous gene information. As a result, we extracted 31 orthologous miRNA/target-gene pairs that were conserved among at least four diverse bilaterian animals; the prediction set showed prominent enrichment of orthologous miRNA/target-gene pairs that were verified experimentally. Approximately 84% of the target genes were regulated by three miRNAs (let-7, miR-1, and miR-124) and their function was classified mainly into the following categories: development, muscle formation, cell adhesion, and gene regulation. We used a reporter gene assay to experimentally verify the downregulation of six candidate pairs (out of six tested pairs) in HeLa cells.

Conclusions

The application of our new method enables the identification of 31 miRNA/target-gene pairs that were expected to have been regulated from the era of the common bilaterian ancestor. The downregulation of all six candidate pairs suggests that orthologous information contributed to the elucidation of the primordial set of genes that has been regulated by miRNAs; it was also an efficient tool for the elimination of false positives from the predicted candidates. In conclusion, our study identified potentially important miRNA-target pairs that were evolutionarily conserved throughout diverse bilaterian animals and that may provide new insights into early-stage miRNA functions.

Keywords


Article
Homopolymeric tracts represent a general regulatory mechanism in prokaryotes

Authors: Orsi Renato H --- Bowen Barbara M --- Wiedmann Martin
DOI: 10.1186/1471-2164-11-102
Pages: 102
Loading...
Loading...
Abstract

Abstract

Background

While, traditionally, regulation of gene expression can be grouped into transcriptional, translational, and post-translational mechanisms, some mechanisms of rapid genetic variation can also contribute to regulation of gene expression, e.g., phase variation.

Results

We show here that prokaryotes evolved to include homopolymeric tracts (HTs) within coding genes as a system that allows for efficient gene inactivation. Analyses of 81 bacterial and 18 archaeal genomes showed that poly(A) and poly(T) HTs are overrepresented in these genomes and preferentially located at the 5' end of coding genes. Location of HTs at the 5' end is not driven by a preferential placement of aminoacids encoded by the AAA and TTT codons at the N-terminal of proteins. The inlA gene of the pathogen L. monocytogenes was used as a model to further study the role of HTs in reversible gene inactivation. In a number of L. monocytogenes strains, inlA harbors a 5' poly(A) HT, which regularly shows frameshift mutation leading to expression of a truncated 8 aa InlA protein. Translational fusions of the inlA 5' end allowed us to estimate that the frequency of variation in this HT is about 1,000 fold higher than the estimated average point mutation frequency.

Conclusions

As frameshift mutations in HTs can occur at high frequencies and enable efficient gene inactivation, hypermutable HTs appear to represent a universal system for regulation of gene expression in prokaryotes. Combined with other studies indicating that HTs also enable rapid diversification of both coding and regulatory genetic sequences in eukaryotes, our data suggest that hypermutable HTs represent a general and rapid evolutionary mechanism facilitating adaptation and gene regulation across diverse organisms.

Keywords


Article
Distinct, ecotype-specific genome and proteome signatures in the marine cyanobacteria Prochlorococcus

Authors: Paul Sandip --- Dutta Anirban --- Bag Sumit K --- Das Sabyasachi --- et al.
DOI: 10.1186/1471-2164-11-103
Pages: 103
Loading...
Loading...
Abstract

Abstract

Background

The marine cyanobacterium Prochlorococcus marinus, having multiple ecotypes of distinct genotypic/phenotypic traits and being the first documented example of genome shrinkage in free-living organisms, offers an ideal system for studying niche-driven molecular micro-diversity in closely related microbes. The present study, through an extensive comparative analysis of various genomic/proteomic features of 6 high light (HL) and 6 low light (LL) adapted strains, makes an attempt to identify molecular determinants associated with their vertical niche partitioning.

Results

Pronounced strand-specific asymmetry in synonymous codon usage is observed exclusively in LL strains. Distinct dinucleotide abundance profiles are exhibited by 2 LL strains with larger genomes and G+C-content ≈ 50% (group LLa), 4 LL strains having reduced genomes and G+C-content ≈ 35-37% (group LLb), and 6 HL strains. Taking into account the emergence of LLa, LLb and HL strains (based on 16S rRNA phylogeny), a gradual increase in average aromaticity, pI values and beta- & coil-forming propensities and a decrease in mean hydrophobicity, instability indices and helix-forming propensities of core proteins are observed. Greater variations in orthologous gene repertoire are found between LLa and LLb strains, while higher number of positively selected genes exist between LL and HL strains.

Conclusion

Strains of different Prochlorococcus groups are characterized by distinct compositional, physicochemical and structural traits that are not mere remnants of a continuous genetic drift, but are potential outcomes of a grand scheme of niche-oriented stepwise diversification, that might have driven them chronologically towards greater stability/fidelity and invoked upon them a special ability to inhabit diverse oceanic environments.

Keywords


Article
Identification and analysis of in planta expressed genes of Magnaporthe oryzae

Authors: Kim Soonok --- Park Jongsun --- Park Sook-Young --- Mitchell Thomas K --- et al.
DOI: 10.1186/1471-2164-11-104
Pages: 104
Loading...
Loading...
Abstract

Abstract

Background

Infection of plants by pathogens and the subsequent disease development involves substantial changes in the biochemistry and physiology of both partners. Analysis of genes that are expressed during these interactions represents a powerful strategy to obtain insights into the molecular events underlying these changes. We have employed expressed sequence tag (EST) analysis to identify rice genes involved in defense responses against infection by the blast fungus Magnaporthe oryzae and fungal genes involved in infectious growth within the host during a compatible interaction.

Results

A cDNA library was constructed with RNA from rice leaves (Oryza sativa cv. Hwacheong) infected with M. oryzae strain KJ201. To enrich for fungal genes, subtraction library using PCR-based suppression subtractive hybridization was constructed with RNA from infected rice leaves as a tester and that from uninfected rice leaves as the driver. A total of 4,148 clones from two libraries were sequenced to generate 2,302 non-redundant ESTs. Of these, 712 and 1,562 ESTs could be identified to encode fungal and rice genes, respectively. To predict gene function, Gene Ontology (GO) analysis was applied, with 31% and 32% of rice and fungal ESTs being assigned to GO terms, respectively. One hundred uniESTs were found to be specific to fungal infection EST. More than 80 full-length fungal cDNA sequences were used to validate ab initio annotated gene model of M. oryzae genome sequence.

Conclusion

This study shows the power of ESTs to refine genome annotation and functional characterization. Results of this work have advanced our understanding of the molecular mechanisms underpinning fungal-plant interactions and formed the basis for new hypothesis.

Keywords


Article
Fungal Secretome Database: Integrated platform for annotation of fungal secretomes

Authors: Choi Jaeyoung --- Park Jongsun --- Kim Donghan --- Jung Kyongyong --- et al.
DOI: 10.1186/1471-2164-11-105
Pages: 105
Loading...
Loading...
Abstract

Abstract

Background

Fungi secrete various proteins that have diverse functions. Prediction of secretory proteins using only one program is unsatisfactory. To enhance prediction accuracy, we constructed Fungal Secretome Database (FSD).

Description

A three-layer hierarchical identification rule based on nine prediction programs was used to identify putative secretory proteins in 158 fungal/oomycete genomes (208,883 proteins, 15.21% of the total proteome). The presence of putative effectors containing known host targeting signals such as RXLX [EDQ] and RXLR was investigated, presenting the degree of bias along with the species. The FSD's user-friendly interface provides summaries of prediction results and diverse web-based analysis functions through Favorite, a personalized repository.

Conclusions

The FSD can serve as an integrated platform supporting researches on secretory proteins in the fungal kingdom. All data and functions described in this study can be accessed on the FSD web site at http://fsd.snu.ac.kr/.

Keywords


Article
U12 type introns were lost at multiple occasions during evolution

Authors: Bartschat Sebastian --- Samuelsson Tore
DOI: 10.1186/1471-2164-11-106
Pages: 106
Loading...
Loading...
Abstract

Abstract

Background

Two categories of introns are known, a common U2 type and a rare U12 type. These two types of introns are removed by distinct spliceosomes. The phylogenetic distribution of spliceosomal RNAs that are characteristic of the U12 spliceosome, i.e. the U11, U12, U4atac and U6atac RNAs, suggest that U12 spliceosomes were lost in many phylogenetic groups. We have now examined the distribution of U2 and U12 introns in many of these groups.

Results

U2 and U12 introns were predicted by making use of available EST and genomic sequences. The results show that in species or branches where U12 spliceosomal components are missing, also U12 type of introns are lacking. Examples are the choanoflagellate Monosiga brevicollis, Entamoeba histolytica, green algae, diatoms, and the fungal lineage Basidiomycota. Furthermore, whereas U12 splicing does not occur in Caenorhabditis elegans, U12 introns as well as U12 snRNAs are present in Trichinella spiralis, which is deeply branching in the nematode tree. A comparison of homologous genes in T. spiralis and C. elegans revealed different mechanisms whereby U12 introns were lost.

Conclusions

The phylogenetic distribution of U12 introns and spliceosomal RNAs give further support to an early origin of U12 dependent splicing. In addition, this distribution identifies a large number of instances during eukaryotic evolution where such splicing was lost.

Keywords


Article
Mapping main, epistatic and sex-specific QTL for body composition in a chicken population divergently selected for low or high growth rate

Loading...
Loading...
Abstract

Abstract

Background

Delineating the genetic basis of body composition is important to agriculture and medicine. In addition, the incorporation of gene-gene interactions in the statistical model provides further insight into the genetic factors that underlie body composition traits. We used Bayesian model selection to comprehensively map main, epistatic and sex-specific QTL in an F2 reciprocal intercross between two chicken lines divergently selected for high or low growth rate.

Results

We identified 17 QTL with main effects across 13 chromosomes and several sex-specific and sex-antagonistic QTL for breast meat yield, thigh + drumstick yield and abdominal fatness. Different sets of QTL were found for both breast muscles [Pectoralis (P) major and P. minor], which suggests that they could be controlled by different regulatory mechanisms. Significant interactions of QTL by sex allowed detection of sex-specific and sex-antagonistic QTL for body composition and abdominal fat. We found several female-specific P. major QTL and sex-antagonistic P. minor and abdominal fatness QTL. Also, several QTL on different chromosomes interact with each other to affect body composition and abdominal fatness.

Conclusions

The detection of main effects, epistasis and sex-dimorphic QTL suggest complex genetic regulation of somatic growth. An understanding of such regulatory mechanisms is key to mapping specific genes that underlie QTL controlling somatic growth in an avian model.

Keywords


Article
Genomics based analysis of interactions between developing B-lymphocytes and stromal cells reveal complex interactions and two-way communication

Authors: Zetterblad Jenny --- Qian Hong --- Zandi Sasan --- Månsson Robert --- et al.
DOI: 10.1186/1471-2164-11-108
Pages: 108
Loading...
Loading...
Abstract

Abstract

Background

The use of functional genomics has largely increased our understanding of cell biology and promises to help the development of systems biology needed to understand the complex order of events that regulates cellular differentiation in vivo. One model system clearly dependent on the integration of extra and intra cellular signals is the development of B-lymphocytes from hematopoietic stem cells in the bone marrow. This developmental pathway involves several defined differentiation stages associated with specific expression of genes including surface markers that can be used for the prospective isolation of the progenitor cells directly from the bone marrow to allow for ex vivo gene expression analysis. The developmental process can be simulated in vitro making it possible to dissect information about cell/cell communication as well as to address the relevance of communication pathways in a rather direct manner. Thus we believe that B-lymphocyte development represents a useful model system to take the first steps towards systems biology investigations in the bone marrow.

Results

In order to identify extra cellular signals that promote B lymphocyte development we created a database with approximately 400 receptor ligand pairs and software matching gene expression data from two cell populations to obtain information about possible communication pathways. Using this database and gene expression data from NIH3T3 cells (unable to support B cell development), OP-9 cells (strongly supportive of B cell development), pro-B and pre-B cells as well as mature peripheral B-lineage cells, we were able to identify a set of potential stage and stromal cell restricted communication pathways. Functional analysis of some of these potential ways of communication allowed us to identify BMP-4 as a potent stimulator of B-cell development in vitro. Further, the analysis suggested that there existed possibilities for progenitor B cells to send signals to the stroma. The functional consequences of this were investigated by co-culture experiments revealing that the co-incubation of stromal cells with B cell progenitors altered both the morphology and the gene expression pattern in the stromal cells.

Conclusions

We believe that this gene expression data analysis method allows for the identification of functionally relevant interactions and therefore could be applied to other data sets to unravel novel communication pathways.

Keywords


Article
Correction: High throughput approaches reveal splicing of primary microRNA transcripts and tissue specific expression of mature microRNAs in Vitis vinifera

Loading...
Loading...
Abstract

Abstract

The version of this article published in BMC Genomics 2009, 10:558, contains data in Table 1 which are now known to be unreliable, and an illustration, in Figure 1, of unusual miRNA processing events predicted by these unreliable data. In this full-length correction, new data replace those found to be unreliable, leading to a more straightforward interpretation without altering the principle conclusions of the study. Table 1 and associated methods have been corrected, Figure 1 deleted, supplementary file 1 added, and modifications made to the sections "Deep sequencing of small RNAs from grapevine leaf tissue" and "Microarray analysis of miRNA expression". The editors and authors regret the inconvenience caused to readers by premature publication of the original paper.

Background

MicroRNAs are short (~21 base) single stranded RNAs that, in plants, are generally coded by specific genes and cleaved specifically from hairpin precursors. MicroRNAs are critical for the regulation of multiple developmental, stress related and other physiological processes in plants. The recent annotation of the genome of the grapevine (Vitis vinifera L.) allowed the identification of many putative conserved microRNA precursors, grouped into multiple gene families.

Results

Here we use oligonucleotide arrays to provide the first indication that many of these microRNAs show differential expression patterns between tissues and during the maturation of fruit in the grapevine. Furthermore we demonstrate that whole transcriptome sequencing and deep-sequencing of small RNA fractions can be used both to identify which microRNA precursors are expressed in different tissues and to estimate genomic coordinates and patterns of splicing and alternative splicing for many primary miRNA transcripts.

Conclusions

Our results show that many microRNAs are differentially expressed in different tissues and during fruit maturation in the grapevine. Furthermore, the demonstration that whole transcriptome sequencing can be used to identify candidate splicing events and approximate primary microRNA transcript coordinates represents a significant step towards the large-scale elucidation of mechanisms regulating the expression of microRNAs at the transcriptional and post-transcriptional levels.

Keywords


Article
Analysis of a normalised expressed sequence tag (EST) library from a key pollinator, the bumblebee Bombus terrestris

Authors: Sadd Ben M --- Kube Michael --- Klages Sven --- Reinhardt Richard --- et al.
DOI: 10.1186/1471-2164-11-110
Pages: 110
Loading...
Loading...
Abstract

Abstract

Background

The bumblebee, Bombus terrestris (Order Hymenoptera), is of widespread importance. This species is extensively used for commercial pollination in Europe, and along with other Bombus spp. is a key member of natural pollinator assemblages. Furthermore, the species is studied in a wide variety of biological fields. The objective of this project was to create a B. terrestris EST resource that will prove to be valuable in obtaining a deeper understanding of this significant social insect.

Results

A normalised cDNA library was constructed from the thorax and abdomen of B. terrestris workers in order to enhance the discovery of rare genes. A total of 29'428 ESTs were sequenced. Subsequent clustering resulted in 13'333 unique sequences. Of these, 58.8 percent had significant similarities to known proteins, with 54.5 percent having a "best-hit" to existing Hymenoptera sequences. Comparisons with the honeybee and other insects allowed the identification of potential candidates for gene loss, pseudogene evolution, and possible incomplete annotation in the honeybee genome. Further, given the focus of much basic research and the perceived threat of disease to natural and commercial populations, the immune system of bumblebees is a particularly relevant component. Although the library is derived from unchallenged bees, we still uncover transcription of a number of immune genes spanning the principally described insect immune pathways. Additionally, the EST library provides a resource for the discovery of genetic markers that can be used in population level studies. Indeed, initial screens identified 589 simple sequence repeats and 854 potential single nucleotide polymorphisms.

Conclusion

The resource that these B. terrestris ESTs represent is valuable for ongoing work. The ESTs provide direct evidence of transcriptionally active regions, but they will also facilitate further functional genomics, gene discovery and future genome annotation. These are important aspects in obtaining a greater understanding of this key pollinator species.

Keywords


Article
Identification of candidates for cyclotide biosynthesis and cyclisation by expressed sequence tag analysis of Oldenlandia affinis

Authors: Qin Qiaoping --- McCallum Emily J --- Kaas Quentin --- Suda Jan --- et al.
DOI: 10.1186/1471-2164-11-111
Pages: 111
Loading...
Loading...
Abstract

Abstract

Background

Cyclotides are a family of circular peptides that exhibit a range of biological activities, including anti-bacterial, cytotoxic, anti-HIV activities, and are proposed to function in plant defence. Their high stability has motivated their development as scaffolds for the stabilisation of peptide drugs. Oldenlandia affinis is a member of the Rubiaceae (coffee) family from which 18 cyclotides have been sequenced to date, but the details of their processing from precursor proteins have only begun to be elucidated. To increase the speed at which genes involved in cyclotide biosynthesis and processing are being discovered, an expressed sequence tag (EST) project was initiated to survey the transcript profile of O. affinis and to propose some future directions of research on in vivo protein cyclisation.

Results

Using flow cytometry the holoploid genome size (1C-value) of O. affinis was estimated to be 4,210 - 4,284 Mbp, one of the largest genomes of the Rubiaceae family. High-quality ESTs were identified, 1,117 in total, from leaf cDNAs and assembled into 502 contigs, comprising 202 consensus sequences and 300 singletons. ESTs encoding the cyclotide precursors for kalata B1 (Oak1) and kalata B2 (Oak4) were among the 20 most abundant ESTs. In total, 31 ESTs encoded cyclotide precursors, representing a distinct commitment of 2.8% of the O. affinis transcriptome to cyclotide biosynthesis. The high expression levels of cyclotide precursor transcripts are consistent with the abundance of mature cyclic peptides in O. affinis. A new cyclotide precursor named Oak5 was isolated and represents the first cDNA for the bracelet class of cyclotides in O. affinis. Clones encoding enzymes potentially involved in processing cyclotides were also identified and include enzymes involved in oxidative folding and proteolytic processing.

Conclusion

The EST library generated in this study provides a valuable resource for the study of the cyclisation of plant peptides. Further analysis of the candidates for cyclotide processing discovered in this work will increase our understanding and aid in reconstructing cyclotide production using transgenic systems and will benefit their development in pharmaceutical applications and insect-resistant crop plants.

Keywords


Article
Intensity-based analysis of dual-color gene expression data as an alternative to ratio-based analysis to enhance reproducibility

Authors: Bossers Koen --- Ylstra Bauke --- Brakenhoff Ruud H --- Smeets Serge J --- et al.
DOI: 10.1186/1471-2164-11-112
Pages: 112
Loading...
Loading...
Abstract

Abstract

Background

Ratio-based analysis is the current standard for the analysis of dual-color microarray data. Indeed, this method provides a powerful means to account for potential technical variations such as differences in background signal, spot size and spot concentration. However, current high density dual-color array platforms are of very high quality, and inter-array variance has become much less pronounced. We therefore raised the question whether it is feasible to use an intensity-based analysis rather than ratio-based analysis of dual-color microarray datasets. Furthermore, we compared performance of both ratio- and intensity-based analyses in terms of reproducibility and sensitivity for differential gene expression.

Results

By analyzing three distinct and technically replicated datasets with either ratio- or intensity-based models, we determined that, when applied to the same dataset, intensity-based analysis of dual-color gene expression experiments yields 1) more reproducible results, and 2) is more sensitive in the detection of differentially expressed genes. These effects were most pronounced in experiments with large biological variation and complex hybridization designs. Furthermore, a power analysis revealed that for direct two-group comparisons above a certain sample size, ratio-based models have higher power, although the difference with intensity-based models is very small.

Conclusions

Intensity-based analysis of dual-color datasets results in more reproducible results and increased sensitivity in the detection of differential gene expression than the analysis of the same dataset with ratio-based analysis. Complex dual-color setups such as interwoven loop designs benefit most from ignoring the array factor. The applicability of our approach to array platforms other than dual-color needs to be further investigated.

Keywords


Article
SoyTEdb: a comprehensive database of transposable elements in the soybean genome

Authors: Du Jianchang --- Grant David --- Tian Zhixi --- Nelson Rex T --- et al.
DOI: 10.1186/1471-2164-11-113
Pages: 113
Loading...
Loading...
Abstract

Abstract

Background

Transposable elements are the most abundant components of all characterized genomes of higher eukaryotes. It has been documented that these elements not only contribute to the shaping and reshaping of their host genomes, but also play significant roles in regulating gene expression, altering gene function, and creating new genes. Thus, complete identification of transposable elements in sequenced genomes and construction of comprehensive transposable element databases are essential for accurate annotation of genes and other genomic components, for investigation of potential functional interaction between transposable elements and genes, and for study of genome evolution. The recent availability of the soybean genome sequence has provided an unprecedented opportunity for discovery, and structural and functional characterization of transposable elements in this economically important legume crop.

Description

Using a combination of structure-based and homology-based approaches, a total of 32,552 retrotransposons (Class I) and 6,029 DNA transposons (Class II) with clear boundaries and insertion sites were structurally annotated and clearly categorized, and a soybean transposable element database, SoyTEdb, was established. These transposable elements have been anchored in and integrated with the soybean physical map and genetic map, and are browsable and visualizable at any scale along the 20 soybean chromosomes, along with predicted genes and other sequence annotations. BLAST search and other infrastracture tools were implemented to facilitate annotation of transposable elements or fragments from soybean and other related legume species. The majority (> 95%) of these elements (particularly a few hundred low-copy-number families) are first described in this study.

Conclusion

SoyTEdb provides resources and information related to transposable elements in the soybean genome, representing the most comprehensive and the largest manually curated transposable element database for any individual plant genome completely sequenced to date. Transposable elements previously identified in legumes, the third largest family of flowering plants, are relatively scarce. Thus this database will facilitate structural, evolutionary, functional, and epigenetic analyses of transposable elements in soybean and other legume species.

Keywords


Article
Genome-wide analysis of alternative splicing in Chlamydomonas reinhardtii

Authors: Labadorf Adam --- Link Alicia --- Rogers Mark F --- Thomas Julie --- et al.
DOI: 10.1186/1471-2164-11-114
Pages: 114
Loading...
Loading...
Abstract

Abstract

Background

Genome-wide computational analysis of alternative splicing (AS) in several flowering plants has revealed that pre-mRNAs from about 30% of genes undergo AS. Chlamydomonas, a simple unicellular green alga, is part of the lineage that includes land plants. However, it diverged from land plants about one billion years ago. Hence, it serves as a good model system to study alternative splicing in early photosynthetic eukaryotes, to obtain insights into the evolution of this process in plants, and to compare splicing in simple unicellular photosynthetic and non-photosynthetic eukaryotes. We performed a global analysis of alternative splicing in Chlamydomonas reinhardtii using its recently completed genome sequence and all available ESTs and cDNAs.

Results

Our analysis of AS using BLAT and a modified version of the Sircah tool revealed AS of 498 transcriptional units with 611 events, representing about 3% of the total number of genes. As in land plants, intron retention is the most prevalent form of AS. Retained introns and skipped exons tend to be shorter than their counterparts in constitutively spliced genes. The splice site signals in all types of AS events are weaker than those in constitutively spliced genes. Furthermore, in alternatively spliced genes, the prevalent splice form has a stronger splice site signal than the non-prevalent form. Analysis of constitutively spliced introns revealed an over-abundance of motifs with simple repetitive elements in comparison to introns involved in intron retention. In almost all cases, AS results in a truncated ORF, leading to a coding sequence that is around 50% shorter than the prevalent splice form. Using RT-PCR we verified AS of two genes and show that they produce more isoforms than indicated by EST data. All cDNA/EST alignments and splice graphs are provided in a website at http://combi.cs.colostate.edu/as/chlamy.

Conclusions

The extent of AS in Chlamydomonas that we observed is much smaller than observed in land plants, but is much higher than in simple unicellular heterotrophic eukaryotes. The percentage of different alternative splicing events is similar to flowering plants. Prevalence of constitutive and alternative splicing in Chlamydomonas, together with its simplicity, many available public resources, and well developed genetic and molecular tools for this organism make it an excellent model system to elucidate the mechanisms involved in regulated splicing in photosynthetic eukaryotes.

Keywords


Article
Transcriptomic changes arising during light-induced sporulation in Physarum polycephalum

Authors: Barrantes Israel --- Glockner Gernot --- Meyer Sonja --- Marwan Wolfgang
DOI: 10.1186/1471-2164-11-115
Pages: 115
Loading...
Loading...
Abstract

Abstract

Background

Physarum polycephalum is a free-living amoebozoan protist displaying a complex life cycle, including alternation between single- and multinucleate stages through sporulation, a simple form of cell differentiation. Sporulation in Physarum can be experimentally induced by several external factors, and Physarum displays many biochemical features typical for metazoan cells, including metazoan-type signaling pathways, which makes this organism a model to study cell cycle, cell differentiation and cellular reprogramming.

Results

In order to identify the genes associated to the light-induced sporulation in Physarum, especially those related to signal transduction, we isolated RNA before and after photoinduction from sporulation- competent cells, and used these RNAs to synthesize cDNAs, which were then analyzed using the 454 sequencing technology. We obtained 16,669 cDNAs that were annotated at every computational level. 13,169 transcripts included hit count data, from which 2,772 displayed significant differential expression (upregulated: 1,623; downregulated: 1,149). Transcripts with valid annotations and significant differential expression were later integrated into putative networks using interaction information from orthologs.

Conclusions

Gene ontology analysis suggested that most significantly downregulated genes are linked to DNA repair, cell division, inhibition of cell migration, and calcium release, while highly upregulated genes were involved in cell death, cell polarization, maintenance of integrity, and differentiation. In addition, cell death- associated transcripts were overrepresented between the upregulated transcripts. These changes are associated to a network of actin-binding proteins encoded by genes that are differentially regulated before and after light induction.

Keywords


Article
Genomic sequencing and analyses of Lymantria xylina multiple nucleopolyhedrovirus

Authors: Nai Yu-Shin --- Wu Chih-Yu --- Wang Tai-Chuan --- Chen Yun-Ru --- et al.
DOI: 10.1186/1471-2164-11-116
Pages: 116
Loading...
Loading...
Abstract

Abstract

Background

Outbreaks of the casuarina moth, Lymantria xylina Swinehoe (Lepidoptera: Lymantriidae), which is a very important forest pest in Taiwan, have occurred every five to 10 years. This moth has expanded its range of host plants to include more than 65 species of broadleaf trees. LyxyMNPV (L. xylina multiple nucleopolyhedrovirus) is highly virulent to the casuarina moth and has been investigated as a possible biopesticide for controlling this moth. LdMNPV-like virus has also been isolated from Lymantria xylina larvae but LyxyMNPV was more virulent than LdMNPV-like virus both in NTU-LY and IPLB-LD-652Y cell lines. To better understand LyxyMNPV, the nucleotide sequence of the LyxyMNPV DNA genome was determined and analysed.

Results

The genome of LyxyMNPV consists of 156,344 bases, has a G+C content of 53.4% and contains 157 putative open reading frames (ORFs). The gene content and gene order of LyxyMNPV were similar to those of LdMNPV, with 151 ORFs identified as homologous to those reported in the LdMNPV genome. Two genes (Lyxy49 and Lyxy123) were homologous to other baculoviruses, and four unique LyxyMNPV ORFs (Lyxy11, Lyxy19, Lyxy130 and Lyxy131) were identified in the LyxyMNPV genome, including a gag-like gene that was not reported in baculoviruses. LdMNPV contains 23 ORFs that are absent in LyxyMNPV. Readily identifiable homologues of the gene host range factor-1 (hrf-1), which appears to be involved in the susceptibility of L. dispar to NPV infection, were not present in LyxyMNPV. Additionally, two putative odv-e27 homologues were identified in LyxyMNPV. The LyxyMNPV genome encoded 14 bro genes compared with 16 in LdMNPV, which occupied more than 8% of the LyxyMNPV genome. Thirteen homologous regions (hrs) were identified containing 48 repeated sequences composed of 30-bp imperfect palindromes. However, they differed in the relative positions, number of repeats and orientation in the genome compared to LdMNPV.

Conclusion

The gene parity plot analysis, percent identity of the gene homologues and a phylogenetic analysis suggested that LyxyMNPV is a Group II NPV that is most closely related to LdMNPV but with a highly distinct genomic organisation.

Keywords


Article
General and species-specific transcriptional responses to downy mildew infection in a susceptible (Vitis vinifera) and a resistant (V. riparia) grapevine species

Loading...
Loading...
Abstract

Abstract

Background

Downy mildew is a destructive grapevine disease caused by Plasmopara viticola (Berk. and Curt.) Berl. and de Toni, which can only be controlled by intensive fungicide treatments. Natural sources of resistance from wild grapevine (Vitis) species are used in conventional breeding approaches, but the signals and effectors involved in resistance in this important crop species are not well understood.

Results

Early transcriptional changes associated with P. viticola infection in susceptible V. vinifera and resistant V. riparia plants were analyzed using the Combimatrix microarray platform. Transcript levels were measured 12 and 24 h post-inoculation, reflecting the time points immediately preceding the onset of resistance in V. riparia, as determined by microscopic analysis. Our data indicate that resistance in V. riparia is induced after infection, and is not based on differences in basal gene expression between the two species. The strong and rapid transcriptional reprogramming involves the induction of pathogenesis-related proteins and enzymes required for the synthesis of phenylpropanoid-derived compounds, many of which are also induced, albeit to a lesser extent, in V. vinifera. More interestingly, resistance in V. riparia also involves the specific modulation of numerous transcripts encoding components of signal transduction cascades, hypersensitive reaction markers and genes involved in jasmonate biosynthesis. The limited transcriptional modulation in V. vinifera represents a weak attempted defense response rather than the activation of compatibility-specific pathways.

Conclusions

Several candidate resistance genes were identified that could be exploited in future biotechnological approaches to increase disease resistance in susceptible grapevine species. Measurements of jasmonic acid and methyl jasmonate in infected leaves suggest that this hormone may also be involved in V. riparia resistance to P. viticola.

Keywords


Article
Increased incidence of rare codon clusters at 5' and 3' gene termini:implications for function

Authors: Clarke Thomas F --- Clark Patricia L
DOI: 10.1186/1471-2164-11-118
Pages: 118
Loading...
Loading...
Abstract

Abstract

Background

The process of translation can be affected by the use of rare versus common codons within the mRNA transcript.

Results

Here, we show that rare codons are enriched at the 5' and 3' termini of genes from E. coli and other prokaryotes. Genes predicted to be secreted show significant enrichment in 5' rare codon clusters, but not 3' rare codon clusters. Surprisingly, no correlation between 5' mRNA structure and rare codon usage was observed.

Conclusions

Potential functional roles for the enrichment of rare codons at terminal positions are explored.

Keywords


Article
Identification of microRNAs expressed in two mosquito vectors, Aedes albopictus and Culex quinquefasciatus

Authors: Skalsky Rebecca L --- Vanlandingham Dana L --- Scholle Frank --- Higgs Stephen --- et al.
DOI: 10.1186/1471-2164-11-119
Pages: 119
Loading...
Loading...
Abstract

Abstract

Background

MicroRNAs (miRNAs) are small non-coding RNAs that post-transcriptionally regulate gene expression in a variety of organisms, including insects, vertebrates, and plants. miRNAs play important roles in cell development and differentiation as well as in the cellular response to stress and infection. To date, there are limited reports of miRNA identification in mosquitoes, insects that act as essential vectors for the transmission of many human pathogens, including flaviviruses. West Nile virus (WNV) and dengue virus, members of the Flaviviridae family, are primarily transmitted by Aedes and Culex mosquitoes. Using high-throughput deep sequencing, we examined the miRNA repertoire in Ae. albopictus cells and Cx. quinquefasciatus mosquitoes.

Results

We identified a total of 65 miRNAs in the Ae. albopictus C7/10 cell line and 77 miRNAs in Cx. quinquefasciatus mosquitoes, the majority of which are conserved in other insects such as Drosophila melanogaster and Anopheles gambiae. The most highly expressed miRNA in both mosquito species was miR-184, a miRNA conserved from insects to vertebrates. Several previously reported Anopheles miRNAs, including miR-1890 and miR-1891, were also found in Culex and Aedes, and appear to be restricted to mosquitoes. We identified seven novel miRNAs, arising from nine different precursors, in C7/10 cells and Cx. quinquefasciatus mosquitoes, two of which have predicted orthologs in An. gambiae. Several of these novel miRNAs reside within a ~350 nt long cluster present in both Aedes and Culex. miRNA expression was confirmed by primer extension analysis. To determine whether flavivirus infection affects miRNA expression, we infected female Culex mosquitoes with WNV. Two miRNAs, miR-92 and miR-989, showed significant changes in expression levels following WNV infection.

Conclusions

Aedes and Culex mosquitoes are important flavivirus vectors. Recent advances in both mosquito genomics and high-throughput sequencing technologies enabled us to interrogate the miRNA profile in these two species. Here, we provide evidence for over 60 conserved and seven novel mosquito miRNAs, expanding upon our current understanding of insect miRNAs. Undoubtedly, some of the miRNAs identified will have roles not only in mosquito development, but also in mediating viral infection in the mosquito host.

Keywords


Article
High-throughput genome sequencing of two Listeria monocytogenes clinical isolates during a large foodborne outbreak

Authors: Gilmour Matthew W --- Graham Morag --- Van Domselaar Gary --- Tyler Shaun --- et al.
DOI: 10.1186/1471-2164-11-120
Pages: 120
Loading...
Loading...
Abstract

Abstract

Background

A large, multi-province outbreak of listeriosis associated with ready-to-eat meat products contaminated with Listeria monocytogenes serotype 1/2a occurred in Canada in 2008. Subtyping of outbreak-associated isolates using pulsed-field gel electrophoresis (PFGE) revealed two similar but distinct AscI PFGE patterns. High-throughput pyrosequencing of two L. monocytogenes isolates was used to rapidly provide the genome sequence of the primary outbreak strain and to investigate the extent of genetic diversity associated with a change of a single restriction enzyme fragment during PFGE.

Results

The chromosomes were collinear, but differences included 28 single nucleotide polymorphisms (SNPs) and three indels, including a 33 kbp prophage that accounted for the observed difference in AscI PFGE patterns. The distribution of these traits was assessed within further clinical, environmental and food isolates associated with the outbreak, and this comparison indicated that three distinct, but highly related strains may have been involved in this nationwide outbreak. Notably, these two isolates were found to harbor a 50 kbp putative mobile genomic island encoding translocation and efflux functions that has not been observed in other Listeria genomes.

Conclusions

High-throughput genome sequencing provided a more detailed real-time assessment of genetic traits characteristic of the outbreak strains than could be achieved with routine subtyping methods. This study confirms that the latest generation of DNA sequencing technologies can be applied during high priority public health events, and laboratories need to prepare for this inevitability and assess how to properly analyze and interpret whole genome sequences in the context of molecular epidemiology.

Keywords


Article
Massive gene losses in Asian cultivated rice unveiled by comparative genome analysis

Authors: Sakai Hiroaki --- Itoh Takeshi
DOI: 10.1186/1471-2164-11-121
Pages: 121
Loading...
Loading...
Abstract

Abstract

Background

Rice is one of the most important food crops in the world. With increasing world demand for food crops, there is an urgent need to develop new cultivars that have enhanced performance with regard to yield, disease resistance, and so on. Wild rice is expected to provide useful genetic resources that could improve the present cultivated species. However, the quantity and quality of these unexplored resources remain unclear. Recent accumulation of the genomic information of both cultivated and wild rice species allows for their comparison at the molecular level. Here, we compared the genome sequence of Oryza sativa ssp. japonica with sets of bacterial artificial chromosome end sequences (BESs) from two wild rice species, O. rufipogon and O. nivara, and an African rice species, O. glaberrima.

Results

We found that about four to five percent of the BESs of the two wild rice species and about seven percent of the African rice could not be mapped to the japonica genome, suggesting that a substantial number of genes have been lost in the japonica rice lineage; however, their close relatives still possess their counterpart genes. We estimated that during evolution, O. sativa has lost at least one thousand genes that are still preserved in the genomes of the other species. In addition, our BLASTX searches against the non-redundant protein sequence database showed that disease resistance-related proteins were significantly overrepresented in the close relative-specific genomic portions. In total, 235 unmapped BESs of the three relatives matched 83 non-redundant proteins that contained a disease resistance protein domain, most of which corresponded to an NBS-LRR domain.

Conclusion

We found that the O. sativa lineage appears to have recently experienced massive gene losses following divergence from its wild ancestor. Our results imply that the domestication process accelerated large-scale genomic deletions in the lineage of Asian cultivated rice and that the close relatives of cultivated rice have the potential to restore the lost traits.

Keywords


Article
Feasibility of physical map construction from fingerprinted bacterial artificial chromosome libraries of polyploid plant species

Authors: Luo Ming-Cheng --- Ma Yaqin --- You Frank M --- Anderson Olin D --- et al.
DOI: 10.1186/1471-2164-11-122
Pages: 122
Loading...
Loading...
Abstract

Abstract

Background

The presence of closely related genomes in polyploid species makes the assembly of total genomic sequence from shotgun sequence reads produced by the current sequencing platforms exceedingly difficult, if not impossible. Genomes of polyploid species could be sequenced following the ordered-clone sequencing approach employing contigs of bacterial artificial chromosome (BAC) clones and BAC-based physical maps. Although BAC contigs can currently be constructed for virtually any diploid organism with the SNaPshot high-information-content-fingerprinting (HICF) technology, it is currently unknown if this is also true for polyploid species. It is possible that BAC clones from orthologous regions of homoeologous chromosomes would share numerous restriction fragments and be therefore included into common contigs. Because of this and other concerns, physical mapping utilizing the SNaPshot HICF of BAC libraries of polyploid species has not been pursued and the possibility of doing so has not been assessed. The sole exception has been in common wheat, an allohexaploid in which it is possible to construct single-chromosome or single-chromosome-arm BAC libraries from DNA of flow-sorted chromosomes and bypass the obstacles created by polyploidy.

Results

The potential of the SNaPshot HICF technology for physical mapping of polyploid plants utilizing global BAC libraries was evaluated by assembling contigs of fingerprinted clones in an in silico merged BAC library composed of single-chromosome libraries of two wheat homoeologous chromosome arms, 3AS and 3DS, and complete chromosome 3B. Because the chromosome arm origin of each clone was known, it was possible to estimate the fidelity of contig assembly. On average 97.78% or more clones, depending on the library, were from a single chromosome arm. A large portion of the remaining clones was shown to be library contamination from other chromosomes, a feature that is unavoidable during the construction of single-chromosome BAC libraries.

Conclusions

The negligibly low level of incorporation of clones from homoeologous chromosome arms into a contig during contig assembly suggested that it is feasible to construct contigs and physical maps using global BAC libraries of wheat and almost certainly also of other plant polyploid species with genome sizes comparable to that of wheat. Because of the high purity of the resulting assembled contigs, they can be directly used for genome sequencing. It is currently unknown but possible that equally good BAC contigs can be also constructed for polyploid species containing smaller, more gene-rich genomes.

Keywords


Article
Structural classification by the Lipase Engineering Database: a case study of Candida antarctica lipase A

Authors: Widmann Michael --- Juhl P Benjamin --- Pleiss Jürgen
DOI: 10.1186/1471-2164-11-123
Pages: 123
Loading...
Loading...
Abstract

Abstract

Background

The Lipase Engineering Database (LED) integrates information on sequence, structure and function of lipases, esterases and related proteins with the α/β hydrolase fold. A new superfamily for Candida antarctica lipase A (CALA) was introduced including the recently published crystal structure of CALA. Since CALA has a highly divergent sequence in comparison to other α/β hydrolases, the Lipase Engineering Database was used to classify CALA in the frame of the already established classification system. This involved the comparison of CALA to similar structures as well as sequence-based comparisons against the content of the LED.

Results

The new release 3.0 (December 2009) of the Lipase Engineering Database contains 24783 sequence entries for 18585 proteins as well as 656 experimentally determined protein structures, including the structure of CALA. In comparison to the previous release 1 with 4322 protein and 167 structure entries this update represents a significant increase in data volume. By comparing CALA to representative structures from all superfamilies, a structure from the deacetylase superfamily was found to be most similar to the structure of CALA. While the α/β hydrolase fold is conserved in both proteins, the major difference is found in the cap region. Sequence alignments between both proteins show a sequence similarity of only 15%. A multisequence alignment of both protein families was used to create hidden Markov models for the cap region of CALA and showed that the cap region of CALA is unique among all other proteins of the α/β hydrolase fold. By specifically comparing the substrate binding pocket of CALA to other binding pockets of α/β hydrolases, the binding pocket of Candida rugosa lipase was identified as being highly similar. This similarity also applied to the lid of Candida rugosa lipase in comparison to the potential lid of CALA.

Conclusion

The LED serves as a valuable tool for the systematic analysis of single proteins or protein families. The updated release 3.0 was used for the evaluation of α/β hydrolases. The HTML version of the database with new features is available at http://www.led.uni-stuttgart.de and provides sequences, structures and a set of analysis tools including phylogenetic trees and HMM profiles

Keywords


Article
Digital gene expression analysis of two life cycle stages of the human-infective parasite, Trypanosoma brucei gambiense reveals differentially expressed clusters of co-regulated genes

Authors: Veitch Nicola J --- Johnson Paul CD --- Trivedi Urmi --- Terry Sandra --- et al.
DOI: 10.1186/1471-2164-11-124
Pages: 124
Loading...
Loading...
Abstract

Abstract

Background

The evolutionarily ancient parasite, Trypanosoma brucei, is unusual in that the majority of its genes are regulated post-transcriptionally, leading to the suggestion that transcript abundance of most genes does not vary significantly between different life cycle stages despite the fact that the parasite undergoes substantial cellular remodelling and metabolic changes throughout its complex life cycle. To investigate this in the clinically relevant sub-species, Trypanosoma brucei gambiense, which is the causative agent of the fatal human disease African sleeping sickness, we have compared the transcriptome of two different life cycle stages, the potentially human-infective bloodstream forms with the non-human-infective procyclic stage using digital gene expression (DGE) analysis.

Results

Over eleven million unique tags were generated, producing expression data for 7360 genes, covering 81% of the genes in the genome. Compared to microarray analysis of the related T. b. brucei parasite, approximately 10 times more genes with a 2.5-fold change in expression levels were detected. The transcriptome analysis revealed the existence of several differentially expressed gene clusters within the genome, indicating that contiguous genes, presumably from the same polycistronic unit, are co-regulated either at the level of transcription or transcript stability.

Conclusions

DGE analysis is extremely sensitive for detecting gene expression differences, revealing firstly that a far greater number of genes are stage-regulated than had previously been identified and secondly and more importantly, this analysis has revealed the existence of several differentially expressed clusters of genes present on what appears to be the same polycistronic units, a phenomenon which had not previously been observed in microarray studies. These differentially regulated clusters of genes are in addition to the previously identified RNA polymerase I polycistronic units of variant surface glycoproteins and procyclin expression sites, which encode the major surface proteins of the parasite. This raises a number of questions regarding the function and regulation of the gene clusters that clearly warrant further study.

Keywords


Article
Comparative gene expression profiling between human cultured myotubes and skeletal muscle tissue

Loading...
Loading...
Abstract

Abstract

Background

A high-sensitivity DNA microarray platform requiring nanograms of RNA input facilitates the application of transcriptome analysis to individual skeletal muscle (SM) tissue samples. Culturing myotubes from SM-biopsies enables investigating transcriptional defects and assaying therapeutic strategies. This study compares the transcriptome of aneurally cultured human SM cells versus that of tissue biopsies.

Results

We used the Illumina expression BeadChips to determine the transcriptomic differences between tissue and cultured SM samples from five individuals. Changes in the expression of several genes were confirmed by QuantiGene Plex assay or reverse transcription real-time PCR. In cultured myotubes compared to the tissue, 1216 genes were regulated: 583 down and 633 up. Gene ontology analysis showed that downregulated genes were mainly associated with cytoplasm, particularly mitochondria, and involved in metabolism and the muscle-system/contraction process. Upregulated genes were predominantly related to cytoplasm, endoplasmic reticulum, and extracellular matrix. The most significantly regulated pathway was mitochondrial dysfunction. Apoptosis genes were also modulated. Among the most downregulated genes detected in this study were genes encoding metabolic proteins AMPD1, PYGM, CPT1B and UCP3, muscle-system proteins TMOD4, MYBPC1, MYOZ1 and XIRP2, the proteolytic CAPN3 and the myogenic regulator MYF6. Coordinated reduced expression of five members of the GIMAP gene family, which form a cluster on chromosome 7, was shown, and the GIMAP4-reduction was validated. Within the most upregulated group were genes encoding senescence/apoptosis-related proteins CDKN1A and KIAA1199 and potential regulatory factors HIF1A, TOP2A and CCDC80.

Conclusions

Cultured muscle cells display reductive metabolic and muscle-system transcriptome adaptations as observed in muscle atrophy and they activate tissue-remodeling and senescence/apoptosis processes.

Keywords


Article
Transcriptome screen for fast evolving genes by Inter-Specific Selective Hybridization (ISSH)

Authors: Montoya-Burgos Juan I --- Foulon Aurélia --- Bahechar Ilham
DOI: 10.1186/1471-2164-11-126
Pages: 126
Loading...
Loading...
Abstract

Abstract

Background

Fast evolving genes are targets of an increasing panel of biological studies, from cancer research to population genetics and species specific adaptations. Yet, their identification and isolation are still laborious, particularly for non-model organisms. We developed a method, named the Inter-Specific Selective Hybridization (ISSH) method, for generating cDNA libraries enriched in fast evolving genes. It utilizes transcripts of homologous tissues of distinct yet related species. Experimental hybridization conditions are monitored in order to discard transcripts that do not find their homologous counterparts in the two species sets as well as transcripts that display a strong complementarity between the two species. Only heteroduplexes that disanneal at low stringency are used for constructing the resulting cDNA library.

Results

We demonstrate the efficiency of the ISSH method by generating a brain cDNA library enriched in fast evolving transcripts of a non-model catfish species as well as a control, non-enriched library. Our results indicate that the enriched library contains effectively more fast evolving sequences than the control library. Gene annotation analyses also indicate enrichment in genes with low expression levels and non-ubiquitously expressed genes, both categories encompassing the majority of fast evolving genes. Furthermore, most of the identified transcripts show higher sequence divergence between two closely related catfish species as compared to recognized fast evolving DNA markers.

Conclusions

The ISSH method offers a simple, inexpensive and efficient way to screen the transcriptome for isolating fast evolving genes. This method opens new opportunities in the investigation of biological mechanisms that include fast evolving genes, such as the evolution of lineage specific processes and traits responsible for species adaptation to their environment.

Keywords


Article
Expression profiling of mouse embryonic fibroblasts with a deletion in the helicase domain of the Werner Syndrome gene homologue treated with hydrogen peroxide

Authors: Labbé Adam --- Turaga Ramachander VN --- Paquet Éric R --- Garand Chantal --- et al.
DOI: 10.1186/1471-2164-11-127
Pages: 127
Loading...
Loading...
Abstract

Abstract

Background

Werner Syndrome (WS) is a rare disorder characterized by the premature onset of a number of age-related diseases. The gene responsible for WS encodes a DNA helicase/exonuclease protein believed to affect different aspects of transcription, replication, and/or DNA repair. In addition to genomic instability, human WS cells exhibit oxidative stress. In this report, we have examined the impact of exogenous hydrogen peroxide on the expression profile of mouse embryonic fibroblasts lacking part of the helicase domain of the WRN homologue (here referred to as WrnΔhelhel).

Results

WrnΔhelhel mutant mouse embryonic fibroblasts exhibit increased oxidative stress. This was reflected by increased intracellular reactive oxygen species (ROS), increased oxidative damage in genomic DNA, changes in ATP/ADP ratios, and a disruption of the inner mitochondrial transmembrane potential when compared to wild type mouse embryonic fibroblasts. Expression profile analyses of hydrogen peroxide-treated wild type cells have indicated significant decreases in the expression of genes involved in mitosis, glycolysis, fatty acid metabolism, nucleic acid metabolism, and cell cycle control, as well as protein modification and stability. Such decreases in these biological processes were not observed in hydrogen peroxide-treated WrnΔhelhel cells. Importantly, untreated WrnΔhelhel cells already exhibited down regulation of several biological processes decreased in wild type cells that had been treated with hydrogen peroxide.

Conclusion

Expression profiling of WrnΔhelhel mutant cells revealed a very different response to exogenous addition of hydrogen peroxide in culture compared to wild type cells. This is due in part to the fact that WrnΔhelhel mutant cells already exhibited a modest chronic intracellular oxidative stress.

Keywords


Article
Defining species specific genome differences in malaria parasites

Authors: Liew Kingsley JL --- Hu Guangan --- Bozdech Zbynek --- Peter Preiser R
DOI: 10.1186/1471-2164-11-128
Pages: 128
Loading...
Loading...
Abstract

Abstract

Background

In recent years a number of genome sequences for different plasmodium species have become available. This has allowed the identification of numerous conserved genes across the different species and has significantly enhanced our understanding of parasite biology. In contrast little is known about species specific differences between the different genomes partly due to the lower sequence coverage and therefore relatively poor annotation of some of the draft genomes particularly the rodent malarias parasite species.

Results

To improve the current annotation and gene identification status of the draft genomes of P. berghei, P. chabaudi and P. yoelii, we performed genome-wide comparisons between these three species. Through analyses via comparative genome hybridizations using a newly designed pan-rodent array as well as in depth bioinformatics analysis, we were able to improve on the coverage of the draft rodent parasite genomes by detecting orthologous genes between these related rodent parasite species. More than 1,000 orthologs for P. yoelii were now newly associated with a P. falciparum gene. In addition to extending the current core gene set for all plasmodium species this analysis also for the first time identifies a relatively small number of genes that are unique to the primate malaria parasites while a larger gene set is uniquely conserved amongst the rodent malaria parasites.

Conclusions

These findings allow a more thorough investigation of the genes that are important for host specificity in malaria.

Keywords


Article
High-density linkage mapping and evolution of paralogs and orthologs in Salix and Populus

Authors: Berlin Sofia --- Lagercrantz Ulf --- von Arnold Sara --- Öst Torbjörn --- et al.
DOI: 10.1186/1471-2164-11-129
Pages: 129
Loading...
Loading...
Abstract

Abstract

Background

Salix (willow) and Populus (poplar) are members of the Salicaceae family and they share many ecological as well as genetic and genomic characteristics. The interest of using willow for biomass production is growing, which has resulted in increased pressure on breeding of high yielding and resistant clones adapted to different environments. The main purpose of this work was to develop dense genetic linkage maps for mapping of traits related to yield and resistance in willow. We used the Populus trichocarpa genome to extract evenly spaced markers and mapped the orthologous loci in the willow genome. The marker positions in the two genomes were used to study genome evolution since the divergence of the two lineages some 45 mya.

Results

We constructed two linkage maps covering the 19 linkage groups in willow. The most detailed consensus map, S1, contains 495 markers with a total genetic distance of 2477 cM and an average distance of 5.0 cM between the markers. The S3 consensus map contains 221 markers and has a total genetic distance of 1793 cM and an average distance of 8.1 cM between the markers. We found high degree of synteny and gene order conservation between willow and poplar. There is however evidence for two major interchromosomal rearrangements involving poplar LG I and XVI and willow LG Ib, suggesting a fission or a fusion in one of the lineages, as well as five intrachromosomal inversions. The number of silent substitutions were three times lower (median: 0.12) between orthologs than between paralogs (median: 0.37 - 0.41).

Conclusions

The relatively slow rates of genomic change between willow and poplar mean that the genomic resources in poplar will be most useful in genomic research in willow, such as identifying genes underlying QTLs of important traits. Our data suggest that the whole-genome duplication occurred long before the divergence of the two genera, events which have until now been regarded as contemporary. Estimated silent substitution rates were 1.28 × 10-9 and 1.68 × 10-9 per site and year, which are close to rates found in other perennials but much lower than rates in annuals.

Keywords


Article
Identification and characterization of Tc1/mariner-like DNA transposons in genomes of the pathogenic fungi of the Paracoccidioides species complex

Loading...
Loading...
Abstract

Abstract

Background

Paracoccidioides brasiliensis (Eukaryota, Fungi, Ascomycota) is a thermodimorphic fungus, the etiological agent of paracoccidioidomycosis, the most important systemic mycoses in Latin America. Three isolates corresponding to distinct phylogenetic lineages of the Paracoccidioides species complex had their genomes sequenced. In this study the identification and characterization of class II transposable elements in the genomes of these fungi was carried out.

Results

A genomic survey for DNA transposons in the sequence assemblies of Paracoccidioides, a genus recently proposed to encompass species P. brasiliensis (harboring phylogenetic lineages S1, PS2, PS3) and P. lutzii (Pb01-like isolates), has been completed. Eight new Tc1/mariner families, referred to as Trem (Transposable element mariner), labeled A through H were identified. Elements from each family have 65-80% sequence similarity with other Tc1/mariner elements. They are flanked by 2-bp TA target site duplications and different termini. Encoded DDD-transposases, some of which have complete ORFs, indicated that they could be functionally active. The distribution of Trem elements varied between the genomic sequences characterized as belonging to P. brasiliensis (S1 and PS2) and P. lutzii. TremC and H elements would have been present in a hypothetical ancestor common to P. brasiliensis and P. lutzii, while TremA, B and F elements were either acquired by P. brasiliensis or lost by P. lutzii after speciation. Although TremD and TremE share about 70% similarity, they are specific to P. brasiliensis and P. lutzii, respectively. This suggests that these elements could either have been present in a hypothetical common ancestor and have evolved divergently after the split between P. brasiliensis and P. Lutzii, or have been independently acquired by horizontal transfer.

Conclusions

New families of Tc1/mariner DNA transposons in the genomic assemblies of the Paracoccidioides species complex are described. Families were distinguished based on significant BLAST identities between transposases and/or TIRs. The expansion of Trem in a putative ancestor common to the species P. brasiliensis and P. lutzii would have given origin to TremC and TremH, while other elements could have been acquired or lost after speciation had occurred. The results may contribute to our understanding of the organization and architecture of genomes in the genus Paracoccidioides.

Keywords


Article
Genomic sequence of a mutant strain of Caenorhabditis elegans with an altered recombination pattern

Loading...
Loading...
Abstract

Abstract

Background

The original sequencing and annotation of the Caenorhabditis elegans genome along with recent advances in sequencing technology provide an exceptional opportunity for the genomic analysis of wild-type and mutant strains. Using the Illumina Genome Analyzer, we sequenced the entire genome of Rec-1, a strain that alters the distribution of meiotic crossovers without changing the overall frequency. Rec-1 was derived from ethylmethane sulfonate (EMS)-treated strains, one of which had a high level of transposable element mobility. Sequencing of this strain provides an opportunity to examine the consequences on the genome of altering the distribution of meiotic recombination events.

Results

Using Illumina sequencing and MAQ software, 83% of the base pair sequence reads were aligned to the reference genome available at Wormbase, providing a 21-fold coverage of the genome. Using the software programs MAQ and Slider, we observed 1124 base pair differences between Rec-1 and the reference genome in Wormbase (WS190), and 441 between the mutagenized Rec-1 (BC313) and the wild-type N2 strain (VC2010). The most frequent base-substitution was G:C to A:T, 141 for the entire genome most of which were on chromosomes I or X, 55 and 31 respectively. With this data removed, no obvious pattern in the distribution of the base differences along the chromosomes was apparent. No major chromosomal rearrangements were observed, but additional insertions of transposable elements were detected. There are 11 extra copies of Tc1, and 8 of Tc2 in the Rec-1 genome, most likely the remains of past high-hopper activity in a progenitor strain.

Conclusion

Our analysis of high-throughput sequencing was able to detect regions of direct repeat sequences, deletions, insertions of transposable elements, and base pair differences. A subset of sequence alterations affecting coding regions were confirmed by an independent approach using oligo array comparative genome hybridization. The major phenotype of the Rec-1 strain is an alteration in the preferred position of the meiotic recombination event with no other significant phenotypic consequences. In this study, we observed no evidence of a mutator effect at the nucleotide level attributable to the Rec-1 mutation.

Keywords


Article
Pediatric primary central nervous system germ cell tumors of different prognosis groups show characteristic miRNome traits and chromosome copy number variations

Authors: Wang Hsei-Wei --- Wu Yu-Hsuan --- Hsieh Jui-Yu --- Liang Muh-Lii --- et al.
DOI: 10.1186/1471-2164-11-132
Pages: 132
Loading...
Loading...
Abstract

Abstract

Background

Intracranial pediatric germ cell tumors (GCTs) are rare and heterogeneous neoplasms and vary in histological differentiation, prognosis and clinical behavior. Germinoma and mature teratoma are GCTs that have a good prognosis, while other types of GCTs, termed nongerminomatous malignant germ cell tumors (NGMGCTs), are tumors with an intermediate or poor prognosis. The second group of tumors requires more extensive drug and irradiation treatment regimens. The mechanisms underlying the differences in incidence and prognosis of the various GCT subgroups are unclear.

Results

We identified a distinct mRNA profile correlating with GCT histological differentiation and prognosis, and also present in this study the first miRNA profile of pediatric primary intracranial GCTs. Most of the differentially expressed miRNAs were downregulated in germinomas, but miR-142-5p and miR-146a were upregulated. Genes responsible for self-renewal (such as POU5F1 (OCT4), NANOG and KLF4) and the immune response were abundant in germinomas, while genes associated with neuron differentiation, Wnt/β-catenin pathway, invasiveness and epithelial-mesenchymal transition (including SNAI2 (SLUG) and TWIST2) were abundant in NGMGCTs. Clear transcriptome segregation based on patient survival was observed, with malignant NGMGCTs being closest to embryonic stem cells. Chromosome copy number variations (CNVs) at cytobands 4q13.3-4q28.3 and 9p11.2-9q13 correlated with GCT malignancy and clinical risk. Six genes (BANK1, CXCL9, CXCL11, DDIT4L, ELOVL6 and HERC5) within 4q13.3-4q28.3 were more abundant in germinomas.

Conclusions

Our results integrate molecular profiles with clinical observations and provide insights into the underlying mechanisms causing GCT malignancy. The genes, pathways and microRNAs identified have the potential to be novel therapeutic targets.

Keywords


Article
Comparative analysis of fungal protein kinases and associated domains

Authors: Kosti Idit --- Mandel-Gutfreund Yael --- Glaser Fabian --- Horwitz Benjamin A
DOI: 10.1186/1471-2164-11-133
Pages: 133
Loading...
Loading...
Abstract

Abstract

Background

Protein phosphorylation is responsible for a large portion of the regulatory functions of eukaryotic cells. Although the list of sequenced genomes of filamentous fungi has grown rapidly, the kinomes of recently sequenced species have not yet been studied in detail. The objective of this study is to apply a comparative analysis of the kinase distribution in different fungal phyla, and to explore its relevance to understanding the evolution of fungi and their taxonomic classification. We have analyzed in detail 12 subgroups of kinases and their distribution over 30 species, as well as their potential use as a classifier for members of the fungal kingdom.

Results

Our findings show that despite the similarity of the kinase distribution in all fungi, their domain distributions and kinome density can potentially be used to classify them and give insight into their evolutionary origin. In general, we found that the overall representation of kinase groups is similar across fungal genomes, the only exception being a large number of tyrosine kinase-like (TKL) kinases predicted in Laccaria bicolor. This unexpected finding underscores the need to continue to sequence fungal genomes, since many species or lineage-specific properties may remain to be discovered. Furthermore, we found that the domain organization significantly varies between the fungal species. Our results suggest that protein kinases and their functional domains strongly reflect fungal taxonomy.

Conclusions

Comparison of the predicted kinomes of sequenced fungi suggests essential signaling functions common to all species, but also specific adaptations of the signal transduction networks to particular species.

Keywords


Article
Correcting for intra-experiment variation in Illumina BeadChip data is necessary to generate robust gene-expression profiles

Loading...
Loading...
Abstract

Abstract

Background

Microarray technology is a popular means of producing whole genome transcriptional profiles, however high cost and scarcity of mRNA has led many studies to be conducted based on the analysis of single samples. We exploit the design of the Illumina platform, specifically multiple arrays on each chip, to evaluate intra-experiment technical variation using repeated hybridisations of universal human reference RNA (UHRR) and duplicate hybridisations of primary breast tumour samples from a clinical study.

Results

A clear batch-specific bias was detected in the measured expressions of both the UHRR and clinical samples. This bias was found to persist following standard microarray normalisation techniques. However, when mean-centering or empirical Bayes batch-correction methods (ComBat) were applied to the data, inter-batch variation in the UHRR and clinical samples were greatly reduced. Correlation between replicate UHRR samples improved by two orders of magnitude following batch-correction using ComBat (ranging from 0.9833-0.9991 to 0.9997-0.9999) and increased the consistency of the gene-lists from the duplicate clinical samples, from 11.6% in quantile normalised data to 66.4% in batch-corrected data. The use of UHRR as an inter-batch calibrator provided a small additional benefit when used in conjunction with ComBat, further increasing the agreement between the two gene-lists, up to 74.1%.

Conclusion

In the interests of practicalities and cost, these results suggest that single samples can generate reliable data, but only after careful compensation for technical bias in the experiment. We recommend that investigators appreciate the propensity for such variation in the design stages of a microarray experiment and that the use of suitable correction methods become routine during the statistical analysis of the data.

Keywords


Article
A consensus linkage map of the grass carp (Ctenopharyngodon idella) based on microsatellites and SNPs

Authors: Xia Jun --- Liu Feng --- Zhu Ze --- Fu Jianjun --- et al.
DOI: 10.1186/1471-2164-11-135
Pages: 135
Loading...
Loading...
Abstract

Abstract

Background

Grass carp (Ctenopharyngodon idella) belongs to the family Cyprinidae which includes more than 2000 fish species. It is one of the most important freshwater food fish species in world aquaculture. A linkage map is an essential framework for mapping traits of interest and is often the first step towards understanding genome evolution. The aim of this study is to construct a first generation genetic map of grass carp using microsatellites and SNPs to generate a new resource for mapping QTL for economically important traits and to conduct a comparative mapping analysis to shed new insights into the evolution of fish genomes.

Results

We constructed a first generation linkage map of grass carp with a mapping panel containing two F1 families including 192 progenies. Sixteen SNPs in genes and 263 microsatellite markers were mapped to twenty-four linkage groups (LGs). The number of LGs was corresponding to the haploid chromosome number of grass carp. The sex-specific map was 1149.4 and 888.8 cM long in females and males respectively whereas the sex-averaged map spanned 1176.1 cM. The average resolution of the map was 4.2 cM/locus. BLAST searches of sequences of mapped markers of grass carp against the whole genome sequence of zebrafish revealed substantial macrosynteny relationship and extensive colinearity of markers between grass carp and zebrafish.

Conclusions

The linkage map of grass carp presented here is the first linkage map of a food fish species based on co-dominant markers in the family Cyprinidae. This map provides a valuable resource for mapping phenotypic variations and serves as a reference to approach comparative genomics and understand the evolution of fish genomes and could be complementary to grass carp genome sequencing project.

Keywords


Article
Flux of transcript patterns during soybean seed development

Authors: Jones Sarah I --- Gonzalez Delkin O --- Vodkin Lila O
DOI: 10.1186/1471-2164-11-136
Pages: 136
Loading...
Loading...
Abstract

Abstract

Background

To understand gene expression networks leading to functional properties of the soybean seed, we have undertaken a detailed examination of soybean seed development during the stages of major accumulation of oils, proteins, and starches, as well as the desiccating and mature stages, using microarrays consisting of up to 27,000 soybean cDNAs. A subset of these genes on a highly-repetitive 70-mer oligonucleotide microarray was also used to support the results.

Results

It was discovered that genes related to cell growth and maintenance processes, as well as energy processes like photosynthesis, decreased in expression levels as the cotyledons approached the mature, dry stage. Genes involved with some storage proteins had their highest expression levels at the stage of highest fresh weight. However, genes encoding many transcription factors and DNA binding proteins showed higher expression levels in the desiccating and dry seeds than in most of the green stages.

Conclusions

Data on 27,000 cDNAs have been obtained over five stages of soybean development, including the stages of major accumulation of agronomically-important products, using two different types of microarrays. Of particular interest are the genes found to peak in expression at the desiccating and dry seed stages, such as those annotated as transcription factors, which may indicate the preparation of pathways that will be needed later in the early stages of imbibition and germination.

Keywords


Article
Genome-wide analysis of aberrant methylation in human breast cancer cells using methyl-DNA immunoprecipitation combined with high-throughput sequencing

Authors: Ruike Yoshinao --- Imanaka Yukako --- Sato Fumiaki --- Shimizu Kazuharu --- et al.
DOI: 10.1186/1471-2164-11-137
Pages: 137
Loading...
Loading...
Abstract

Abstract

Background

Cancer cells undergo massive alterations to their DNA methylation patterns that result in aberrant gene expression and malignant phenotypes. However, the mechanisms that underlie methylome changes are not well understood nor is the genomic distribution of DNA methylation changes well characterized.

Results

Here, we performed methylated DNA immunoprecipitation combined with high-throughput sequencing (MeDIP-seq) to obtain whole-genome DNA methylation profiles for eight human breast cancer cell (BCC) lines and for normal human mammary epithelial cells (HMEC). The MeDIP-seq analysis generated non-biased DNA methylation maps by covering almost the entire genome with sufficient depth and resolution. The most prominent feature of the BCC lines compared to HMEC was a massively reduced methylation level particularly in CpG-poor regions. While hypomethylation did not appear to be associated with particular genomic features, hypermethylation preferentially occurred at CpG-rich gene-related regions independently of the distance from transcription start sites. We also investigated methylome alterations during epithelial-to-mesenchymal transition (EMT) in MCF7 cells. EMT induction was associated with specific alterations to the methylation patterns of gene-related CpG-rich regions, although overall methylation levels were not significantly altered. Moreover, approximately 40% of the epithelial cell-specific methylation patterns in gene-related regions were altered to those typical of mesenchymal cells, suggesting a cell-type specific regulation of DNA methylation.

Conclusions

This study provides the most comprehensive analysis to date of the methylome of human mammary cell lines and has produced novel insights into the mechanisms of methylome alteration during tumorigenesis and the interdependence between DNA methylome alterations and morphological changes.

Keywords


Article
Escherichia coli infection induces distinct local and systemic transcriptome responses in the mammary gland

Authors: Mitterhuemer Simone --- Petzl Wolfram --- Krebs Stefan --- Mehne Daniel --- et al.
DOI: 10.1186/1471-2164-11-138
Pages: 138
Loading...
Loading...
Abstract

Abstract

Background

Coliform bacteria are the most common etiologic agents in severe mastitis of cows. Escherichia coli infections are mostly restricted to a single udder quarter whereas neighboring quarters stay clinically inapparent, implicating the presence of a systemic defense reaction. To address its underlying mechanism, we performed a transcriptome study of mammary tissue from udder quarters inoculated with E. coli (6 h and 24 h post infection), from neighboring quarters of the same animals, and from untreated control animals.

Results

After 6 h 13 probe sets of differentially expressed genes (DEG) were detected in infected quarters versus control animals. Eighteen hours later 2154 and 476 DEG were found in infected and in neighboring quarters vs. control animals. Cluster analysis revealed DEG found only in infected quarters (local response) and DEG detected in both infected and neighboring quarters (systemic response). The first group includes genes mainly involved in immune response and inflammation, while the systemic reaction comprises antigen processing and presentation, cytokines, protein degradation and apoptosis. Enhanced expression of antimicrobial genes (S100A8, S100A9, S100A12, CXCL2, GNLY), acute phase genes (LBP, SAA3, CP, BF, C6, C4BPA, IF), and indicators of oxidative stress (GPX3, MT1A, MT2A, SOD2) point to an active defense reaction in infected and neighboring healthy quarters. Its early onset is indicated by increased transcription of NFIL3 at 6 h. NFIL3 is a predicted regulator of many genes of the systemic response at 24 h. The significance of our transcriptome study was evidenced by some recent findings with candidate gene based approaches.

Conclusions

The discovery and holistic analysis of an extensive systemic reaction in the mammary gland significantly expands the knowledge of host-pathogen interactions in mastitis which may be relevant for the development of novel therapies and for genetic selection towards mastitis resistance.

Keywords


Article
Comparative assessment of methods for estimating individual genome-wide homozygosity-by-descent from human genomic data

Loading...
Loading...
Abstract

Abstract

Background

Genome-wide homozygosity estimation from genomic data is becoming an increasingly interesting research topic. The aim of this study was to compare different methods for estimating individual homozygosity-by-descent based on the information from human genome-wide scans rather than genealogies. We considered the four most commonly used methods and investigated their applicability to single-nucleotide polymorphism (SNP) data in both a simulation study and by using the human genotyped data. A total of 986 inhabitants from the isolated Island of Vis, Croatia (where inbreeding is present, but no pedigree-based inbreeding was observed at the level of F > 0.0625) were included in this study. All individuals were genotyped with the Illumina HumanHap300 array with 317,503 SNP markers.

Results

Simulation data suggested that multi-point FEstim is the method most strongly correlated to true homozygosity-by-descent. Correlation coefficients between the homozygosity-by-descent estimates were high but only for inbred individuals, with nearly absolute correlation between single-point measures.

Conclusions

Deciding who is really inbred is a methodological challenge where multi-point approaches can be very helpful once the set of SNP markers is filtered to remove linkage disequilibrium. The use of several different methodological approaches and hence different homozygosity measures can help to distinguish between homozygosity-by-state and homozygosity-by-descent in studies investigating the effects of genomic autozygosity on human health.

Keywords


Article
Gene-specific FACS sorting method for target selection in high-throughput amplicon sequencing

Authors: Sandberg Julia --- Neiman Marten --- Ahmadian Afshin --- Lundeberg Joakim
DOI: 10.1186/1471-2164-11-140
Pages: 140
Loading...
Loading...
Abstract

Abstract

Background

In addition to shotgun sequencing, next generation sequencing has been shown to be suitable for deep sequencing of many specific PCR-amplified target genes in parallel. However, unspecific product formation is a common problem in amplicon sequencing since these fragments are difficult to fully remove by gel purification, and their presence inevitably reduces the number of mappable sequence reads that can be obtained in each sequencing run.

Results

We have used a novel flow cytometric sorting approach to specifically enrich Roche/454 DNA Capture beads carrying target DNA sequences on their surface, and reject beads carrying unspecific sequences. This procedure gives a nearly three-fold increase in the fraction of informative sequences obtained. Presented results also show that there are no significant differences in the distribution or presence of different genotypes between a FACS-enriched sample and a standard-enriched control sample.

Conclusions

Target-specific FACS enrichment prior to Roche/454 sequencing provides a quick, inexpensive way of increasing the amount of high quality data obtained in a single sequencing run, without introducing any sequence bias.

Keywords


Article
Hyper-expansion of large DNA segments in the genome of kuruma shrimp, Marsupenaeus japonicus

Loading...
Loading...
Abstract

Abstract

Background

Higher crustaceans (class Malacostraca) represent the most species-rich and morphologically diverse group of non-insect arthropods and many of its members are commercially important. Although the crustacean DNA sequence information is growing exponentially, little is known about the genome organization of Malacostraca. Here, we constructed a bacterial artificial chromosome (BAC) library and performed BAC-end sequencing to provide genomic information for kuruma shrimp (Marsupenaeus japonicus), one of the most widely cultured species among crustaceans, and found the presence of a redundant sequence in the BAC library. We examined the BAC clone that includes the redundant sequence to further analyze its length, copy number and location in the kuruma shrimp genome.

Results

Mj024A04 BAC clone, which includes one redundant sequence, contained 27 putative genes and seemed to display a normal genomic DNA structure. Notably, of the putative genes, 3 genes encode homologous proteins to the inhibitor of apoptosis protein and 7 genes encode homologous proteins to white spot syndrome virus, a virulent pathogen known to affect crustaceans. Colony hybridization and PCR analysis of 381 BAC clones showed that almost half of the BAC clones maintain DNA segments whose sequences are homologous to the representative BAC clone Mj024A04. The Mj024A04 partial sequence was detected multiple times in the kuruma shrimp nuclear genome with a calculated copy number of at least 100. Microsatellites based BAC genotyping clearly showed that Mj024A04 homologous sequences were cloned from at least 48 different chromosomal loci. The absence of micro-syntenic relationships with the available genomic sequences of Daphnia and Drosophila suggests the uniqueness of these fragments in kuruma shrimp from current arthropod genome sequences.

Conclusions

Our results demonstrate that hyper-expansion of large DNA segments took place in the kuruma shrimp genome. Although we analyzed only a part of the duplicated DNA segments, our result suggested that it is difficult to analyze the shrimp genome following normal analytical methodology. Hence, it is necessary to avoid repetitive sequence (such as segmental duplications) when studying the other unique structures in the shrimp genome.

Keywords


Article
TobEA: an atlas of tobacco gene expression from seed to senescence

Authors: Edwards Kieron D --- Bombarely Aureliano --- Story Geraint W --- Allen Fraser --- et al.
DOI: 10.1186/1471-2164-11-142
Pages: 142
Loading...
Loading...
Abstract

Abstract

Background

Transcriptomics has resulted in the development of large data sets and tools for the progression of functional genomics and systems biology in many model organisms. Currently there is no commercially available microarray to allow such expression studies in Nicotiana tabacum (tobacco).

Results

A custom designed Affymetrix tobacco expression microarray was generated from a set of over 40k unigenes and used to measure gene expression in 19 different tobacco samples to produce the Tobacco Expression Atlas (TobEA). TobEA provides a snap shot of the transcriptional activity for thousands of tobacco genes in different tissues throughout the lifecycle of the plant and enables the identification of the biological processes occurring in these different tissues. 772 of 2513 transcription factors previously identified in tobacco were mapped to the array, with 87% of them being expressed in at least one tissue in the atlas. Putative transcriptional networks were identified based on the co-expression of these transcription factors. Several interactions in a floral identity transcription factor network were consistent with previous results from other plant species. To broaden access and maximise the benefit of TobEA a set of tools were developed to provide researchers with expression information on their genes of interest via the Solanaceae Genomics Network (SGN) web site. The array has also been made available for public use via the Nottingham Arabidopsis Stock Centre microarray service.

Conclusions

The generation of a tobacco expression microarray is an important development for research in this model plant. The data provided by TobEA represents a valuable resource for plant functional genomics and systems biology research and can be used to identify gene targets for both fundamental and applied scientific applications in tobacco.

Keywords


Article
Chloroplast genome sequence of the moss Tortula ruralis: gene content, polymorphism, and structural arrangement relative to other green plant chloroplast genomes

Loading...
Loading...
Abstract

Abstract

Background

Tortula ruralis, a widely distributed species in the moss family Pottiaceae, is increasingly used as a model organism for the study of desiccation tolerance and mechanisms of cellular repair. In this paper, we present the chloroplast genome sequence of T. ruralis, only the second published chloroplast genome for a moss, and the first for a vegetatively desiccation-tolerant plant.

Results

The Tortula chloroplast genome is ~123,500 bp, and differs in a number of ways from that of Physcomitrella patens, the first published moss chloroplast genome. For example, Tortula lacks the ~71 kb inversion found in the large single copy region of the Physcomitrella genome and other members of the Funariales. Also, the Tortula chloroplast genome lacks petN, a gene found in all known land plant plastid genomes. In addition, an unusual case of nucleotide polymorphism was discovered.

Conclusions

Although the chloroplast genome of Tortula ruralis differs from that of the only other sequenced moss, Physcomitrella patens, we have yet to determine the biological significance of the differences. The polymorphisms we have uncovered in the sequencing of the genome offer a rare possibility (for mosses) of the generation of DNA markers for fine-level phylogenetic studies, or to investigate individual variation within populations.

Keywords


Article
Whole genome analysis of p38 SAPK-mediated gene expression upon stress

Loading...
Loading...
Abstract

Abstract

Background

Cells have the ability to respond and adapt to environmental changes through activation of stress-activated protein kinases (SAPKs). Although p38 SAPK signalling is known to participate in the regulation of gene expression little is known on the molecular mechanisms used by this SAPK to regulate stress-responsive genes and the overall set of genes regulated by p38 in response to different stimuli.

Results

Here, we report a whole genome expression analyses on mouse embryonic fibroblasts (MEFs) treated with three different p38 SAPK activating-stimuli, namely osmostress, the cytokine TNFα and the protein synthesis inhibitor anisomycin. We have found that the activation kinetics of p38α SAPK in response to these insults is different and also leads to a complex gene pattern response specific for a given stress with a restricted set of overlapping genes. In addition, we have analysed the contribution of p38α the major p38 family member present in MEFs, to the overall stress-induced transcriptional response by using both a chemical inhibitor (SB203580) and p38α deficient (p38α-/-) MEFs. We show here that p38 SAPK dependency ranged between 60% and 88% depending on the treatments and that there is a very good overlap between the inhibitor treatment and the ko cells. Furthermore, we have found that the dependency of SAPK varies depending on the time the cells are subjected to osmostress.

Conclusions

Our genome-wide transcriptional analyses shows a selective response to specific stimuli and a restricted common response of up to 20% of the stress up-regulated early genes that involves an important set of transcription factors, which might be critical for either cell adaptation or preparation for continuous extra-cellular changes. Interestingly, up to 85% of the up-regulated genes are under the transcriptional control of p38 SAPK. Thus, activation of p38 SAPK is critical to elicit the early gene expression program required for cell adaptation to stress.

Keywords


Article
Genome-wide transcription factor binding site/promoter databases for the analysis of gene sets and co-occurrence of transcription factor binding motifs

Authors: Veerla Srinivas --- Ringnér Markus --- Höglund Mattias
DOI: 10.1186/1471-2164-11-145
Pages: 145
Loading...
Loading...
Abstract

Abstract

Background

The use of global gene expression profiling is a well established approach to understand biological processes. One of the major goals of these investigations is to identify sets of genes with similar expression patterns. Such gene signatures may be very informative and reveal new aspects of particular biological processes. A logical and systematic next step is to reduce the identified gene signatures to the regulatory components that induce the relevant gene expression changes. A central issue in this context is to identify transcription factors, or transcription factor binding sites (TFBS), likely to be of importance for the expression of the gene signatures.

Results

We develop a strategy that efficiently produces TFBS/promoter databases based on user-defined criteria. The resulting databases constitute all genes in the Santa Cruz database and the positions for all TFBS provided by the user as position weight matrices. These databases are then used for two purposes, to identify significant TFBS in the promoters in sets of genes and to identify clusters of co-occurring TFBS. We use two criteria for significance, significantly enriched TFBS in terms of total number of binding sites for the promoters, and significantly present TFBS in terms of the fraction of promoters with binding sites. Significant TFBS are identified by a re-sampling procedure in which the query gene set is compared with typically 105 gene lists of similar size randomly drawn from the TFBS/promoter database. We apply this strategy to a large number of published ChIP-Chip data sets and show that the proposed approach faithfully reproduces ChIP-Chip results. The strategy also identifies relevant TFBS when analyzing gene signatures obtained from the MSigDB database. In addition, we show that several TFBS are highly correlated and that co-occurring TFBS define functionally related sets of genes.

Conclusions

The presented approach of promoter analysis faithfully reproduces the results from several ChIP-Chip and MigDB derived gene sets and hence may prove to be an important method in the analysis of gene signatures obtained through ChIP-Chip or global gene expression experiments. We show that TFBS are organized in clusters of co-occurring TFBS that together define highly coherent sets of genes.

Keywords


Article
Distribution of candidate genes for experimentally induced arthritis in rats

Authors: Andersson Lars --- Ståhl Fredrik
DOI: 10.1186/1471-2164-11-146
Pages: 146
Loading...
Loading...
Abstract

Abstract

Background

Rat models are frequently used to link genomic regions to experimentally induced arthritis in quantitative trait locus (QTL) analyses. To facilitate the search for candidate genes within such regions, we have previously developed an application (CGC) that uses weighted keywords to rank genes based on their descriptive text. In this study, CGC is used for analyzing the localization of candidate genes from two viewpoints: distribution over the rat genome and functional connections between arthritis QTLs.

Methods

To investigate if candidate genes identified by CGC are more likely to be found inside QTLs, we ranked 2403 genes genome wide in rat. The number of genes within different ranges of CGC scores localized inside and outside QTLs was then calculated. Furthermore, we investigated if candidate genes within certain QTLs share similar functions, and if these functions could be connected to genes within other QTLs. Based on references between genes in OMIM, we created connections between genes in QTLs identified in two distinct rat crosses. In this way, QTL pairs with one QTL from each cross that share an unexpectedly high number of gene connections were identified. The genes that were found to connect a pair of QTLs were then functionally analysed using a publicly available classification tool.

Results

Out of the 2403 genes ranked by the CGC application, 1160 were localized within QTL regions. No difference was observed between highly and lowly rated genes. Hence, highly rated candidate genes for arthritis seem to be distributed randomly inside and outside QTLs. Furthermore, we found five pairs of QTLs that shared a significantly high number of interconnected genes. When functionally analyzed, most genes connecting two QTLs could be included in a single functional cluster. Thus, the functional connections between these genes could very well be involved in the development of an arthritis phenotype.

Conclusions

From the genome wide CGC search, we conclude that candidate genes for arthritis in rat are randomly distributed between QTL and non-QTL regions. We do however find certain pairs of QTLs that share a large number of functionally connected candidate genes, suggesting that these QTLs contain a number of genes involved in similar functions contributing to the arthritis phenotype.

Keywords


Article
Conservation and divergence of known apicomplexan transcriptional regulons

Authors: Essien Kobby --- Stoeckert Christian J
DOI: 10.1186/1471-2164-11-147
Pages: 147
Loading...
Loading...
Abstract

Abstract

Background

The apicomplexans are a diverse phylum of parasites causing an assortment of diseases including malaria in a wide variety of animals and lymphoproliferation in cattle. Little is known about how these varied parasites regulate their transcriptional regulons. Even less is known about how regulon systems, consisting of transcription factors and target genes together with their associated biological process, evolve in these diverse parasites.

Results

In order to obtain insights into the differences in transcriptional regulation between these parasites we compared the orthology profiles of putative malaria transcription factors across species and examined the enrichment patterns of four binding sites across eleven apicomplexans.

About three-fifths of the factors are broadly conserved in several phylogenetic orders of sequenced apicomplexans. This observation suggests the existence of regulons whose regulation is conserved across this ancient phylum. Transcription factors not broadly conserved across the phylum are possibly involved in regulon systems that have diverged between species. Examining binding site enrichment patterns in light of transcription factor conservation patterns suggests a second mode via which regulon systems may diverge - rewiring of existing transcription factors and their associated binding sites in specific ways. Integrating binding sites with transcription factor conservation patterns also facilitated prediction of putative regulators for one of the binding sites.

Conclusions

Even though transcription factors are underrepresented in apicomplexans, the distribution of these factors and their associated regulons reflect common and family-specific transcriptional regulatory processes.

Keywords


Article
MicroRNAs of Bombyx mori identified by Solexa sequencing

Authors: Liu Shiping --- Li Dong --- Li Qibin --- Zhao Ping --- et al.
DOI: 10.1186/1471-2164-11-148
Pages: 148
Loading...
Loading...
Abstract

Abstract

Background

MicroRNA (miRNA) and other small regulatory RNAs contribute to the modulation of a large number of cellular processes. We sequenced three small RNA libraries prepared from the whole body, and the anterior-middle and posterior silk glands of Bombyx mori, with a view to expanding the repertoire of silkworm miRNAs and exploring transcriptional differences in miRNAs between segments of the silk gland.

Results

With the aid of large-scale Solexa sequencing technology, we validated 257 unique miRNA genes, including 202 novel and 55 previously reported genes, corresponding to 324 loci in the silkworm genome. Over 30 known silkworm miRNAs were further corrected in their sequence constitutes and length. A number of reads originated from the loop regions of the precursors of two previously reported miRNAs (bmo-miR-1920 and miR-1921). Interestingly, the majority of the newly identified miRNAs were silkworm-specific, 23 unique miRNAs were widely conserved from invertebrates to vertebrates, 13 unique miRNAs were limited to invertebrates, and 32 were confined to insects. We identified 24 closely positioned clusters and 45 paralogs of miRNAs in the silkworm genome. However, sequence tags showed that paralogs or clusters were not prerequisites for coordinated transcription and accumulation. The majority of silkworm-specific miRNAs were located in transposable elements, and displayed significant differences in abundance between the anterior-middle and posterior silk gland.

Conclusions

Conservative analysis revealed that miRNAs can serve as phylogenetic markers and function in evolutionary signaling. The newly identified miRNAs greatly enrich the repertoire of insect miRNAs, and provide insights into miRNA evolution, biogenesis, and expression in insects. The differential expression of miRNAs in the anterior-middle and posterior silk glands supports their involvement as new levels in the regulation of the silkworm silk gland.

Keywords


Article
Transcribed-ultra conserved region expression profiling from low-input total RNA

Authors: Scaruffi Paola --- Stigliani Sara --- Coco Simona --- Valdora Franscesca --- et al.
DOI: 10.1186/1471-2164-11-149
Pages: 149
Loading...
Loading...
Abstract

Abstract

Background

Ultra Conserved Regions (UCRs) are a class of 481 noncoding sequences located in both intra- and inter-genic regions of the genome. The recent findings that they are significantly altered in adult chronic lymphocytic leukemias, carcinomas, and pediatric neuroblastomas lead to the hypothesis that UCRs may play a role in tumorigenesis.

Results

We present a novel application of Ribo-SPIA™ isothermal linear amplification of minute RNA quantities for quantifying Transcribed-UCR (T-UCR) expression by quantitative PCR. Direct comparison of non-amplified with amplified cDNA in two neuroblastoma cell lines showed that the amplification approach increases sensitivity and repeatability in T-UCR quantification. It is noteworthy that the Ribo-SPIA™ step allowed us to analyze all 481 T-UCRs by using 150 ng of RNA, while introducing a minimal bias and preserving the magnitude of relative expression. Only the less abundant T-UCRs have high intra-assay variability, consistently with the Poisson distribution statistics and stochastic effects on PCR repeatability.

Conclusions

We demonstrated that the quantification procedure shown here is an accurate and reliable technique for genome-wide non coding gene (i.e., UCRs) profiling using small amounts of RNA. This issue is particularly important because studies of transcription regulation are increasingly conducted in small homogeneous samples, such as laser capture microdissected or sorted cell populations.

Keywords


Article
Genome-wide transcriptome analysis of the transition from primary to secondary stem development in Populus trichocarpa

Authors: Dharmawardhana Palitha --- Brunner Amy M --- Strauss Steven H
DOI: 10.1186/1471-2164-11-150
Pages: 150
Loading...
Loading...
Abstract

Abstract

Background

With its genome sequence and other experimental attributes, Populus trichocarpa has become the model species for genomic studies of wood development. Wood is derived from secondary growth of tree stems, and begins with the development of a ring of vascular cambium in the young developing stem. The terminal region of the developing shoot provides a steep developmental gradient from primary to secondary growth that facilitates identification of genes that play specialized functions during each of these phases of growth.

Results

Using a genomic microarray representing the majority of the transcriptome, we profiled gene expression in stem segments that spanned primary to secondary growth. We found 3,016 genes that were differentially expressed during stem development (Q-value ≤ 0.05; >2-fold expression variation), and 15% of these genes encode proteins with no significant identities to known genes. We identified all gene family members putatively involved in secondary growth for carbohydrate active enzymes, tubulins, actins, actin depolymerizing factors, fasciclin-like AGPs, and vascular development-associated transcription factors. Almost 70% of expressed transcription factors were upregulated during the transition to secondary growth. The primary shoot elongation region of the stem contained specific carbohydrate active enzyme and expansin family members that are likely to function in primary cell wall synthesis and modification. Genes involved in plant defense and protective functions were also dominant in the primary growth region.

Conclusion

Our results describe the global patterns of gene expression that occur during the transition from primary to secondary stem growth. We were able to identify three major patterns of gene expression and over-represented gene ontology categories during stem development. The new regulatory factors and cell wall biogenesis genes that we identified provide candidate genes for further functional characterization, as well as new tools for molecular breeding and biotechnology aimed at improvement of tree growth rate, crown form, and wood quality.

Keywords


Article
Promiscuity of enhancer, coding and non-coding transcription functions in ultraconserved elements

Authors: Licastro Danilo --- Gennarino Vincenzo A --- Petrera Francesca --- Sanges Remo --- et al.
DOI: 10.1186/1471-2164-11-151
Pages: 151
Loading...
Loading...
Abstract

Abstract

Background

Ultraconserved elements (UCEs) are highly constrained elements of mammalian genomes, whose functional role has not been completely elucidated yet. Previous studies have shown that some of them act as enhancers in mouse, while some others are expressed in both normal and cancer-derived human tissues. Only one UCE element so far was shown to present these two functions concomitantly, as had been observed in other isolated instances of single, non ultraconserved enhancer elements.

Results

We used a custom microarray to assess the levels of UCE transcription during mouse development and integrated these data with published microarray and next-generation sequencing datasets as well as with newly produced PCR validation experiments. We show that a large fraction of non-exonic UCEs is transcribed across all developmental stages examined from only one DNA strand. Although the nature of these transcripts remains a mistery, our meta-analysis of RNA-Seq datasets indicates that they are unlikely to be short RNAs and that some of them might encode nuclear transcripts. In the majority of cases this function overlaps with the already established enhancer function of these elements during mouse development. Utilizing several next-generation sequencing datasets, we were further able to show that the level of expression observed in non-exonic UCEs is significantly higher than in random regions of the genome and that this is also seen in other regions which act as enhancers.

Conclusion

Our data shows that the concurrent presence of enhancer and transcript function in non-exonic UCE elements is more widespread than previously shown. Moreover through our own experiments as well as the use of next-generation sequencing datasets, we were able to show that the RNAs encoded by non-exonic UCEs are likely to be long RNAs transcribed from only one DNA strand.

Keywords


Article
Genome dynamics of Bartonella grahamii in micro-populations of woodland rodents

Loading...
Loading...
Abstract

Abstract

Background

Rodents represent a high-risk reservoir for the emergence of new human pathogens. The recent completion of the 2.3 Mb genome of Bartonella grahamii, one of the most prevalent blood-borne bacteria in wild rodents, revealed a higher abundance of genes for host-cell interaction systems than in the genomes of closely related human pathogens. The sequence variability within the global B. grahamii population was recently investigated by multi locus sequence typing, but no study on the variability of putative host-cell interaction systems has been performed.

Results

To study the population dynamics of B. grahamii, we analyzed the genomic diversity on a whole-genome scale of 27 B. grahamii strains isolated from four different species of wild rodents in three geographic locations separated by less than 30 km. Even using highly variable spacer regions, only 3 sequence types were identified. This low sequence diversity contrasted with a high variability in genome content. Microarray comparative genome hybridizations identified genes for outer surface proteins, including a repeated region containing the fha gene for filamentous hemaggluttinin and a plasmid that encodes a type IV secretion system, as the most variable. The estimated generation times in liquid culture medium for a subset of strains ranged from 5 to 22 hours, but did not correlate with sequence type or presence/absence patterns of the fha gene or the plasmid.

Conclusion

Our study has revealed a geographic microstructure of B. grahamii in wild rodents. Despite near-identity in nucleotide sequence, major differences were observed in gene presence/absence patterns that did not segregate with host species. This suggests that genetically similar strains can infect a range of different hosts.

Keywords


Article
Chemogenomic and transcriptome analysis identifies mode of action of the chemosensitizing agent CTBT (7-chlorotetrazolo[5,1-c]benzo[1,2,4]triazine)

Authors: Batova Monika --- Klobucnikova Vlasta --- Oblasova Zuzana --- Gregan Juraj --- et al.
DOI: 10.1186/1471-2164-11-153
Pages: 153
Loading...
Loading...
Abstract

Abstract

Background

CTBT (7-chlorotetrazolo [5,1-c]benzo[1,2,4]triazine) increases efficacy of commonly used antifungal agents by an unknown mechanism. It increases the susceptibility of Saccharomyces cerevisiae, Candida albicans and Candida glabrata cells to cycloheximide, 5-fluorocytosine and azole antimycotic drugs. Here we elucidate CTBT mode of action with a combination of systematic genetic and transcriptome analysis.

Results

To identify the cellular processes affected by CTBT, we screened the systematic haploid deletion mutant collection for CTBT sensitive mutants. We identified 169 hypersensitive deletion mutants. The deleted genes encode proteins mainly involved in mitochondrial functions, DNA repair, transcription and chromatin remodeling, and oxidative stress response. We found that the susceptibility of yeast cells to CTBT depends on molecular oxygen. Transcriptome analysis of the immediate early response to CTBT revealed rapid induction of oxidant and stress response defense genes. Many of these genes depend on the transcription factors Yap1 and Cin5. Yap1 accumulates rapidly in the nucleus in CTBT treated cells suggesting acute oxidative stress. Moreover, molecular calculations supported a superoxide generating activity of CTBT. Superoxide production in vivo by CTBT was found associated to mitochondria as indicated by oxidation of MitoSOX Red.

Conclusion

We conclude that CTBT causes intracellular superoxide production and oxidative stress in fungal cells and is thus enhancing antimycotic drug effects by a secondary stress.

Keywords


Article
Comprehensive analysis of MHC class I genes from the U-, S-, and Z-lineages in Atlantic salmon

Authors: Lukacs Morten F --- Harstad Håvard --- Bakke Hege G --- Beetz-Sargent Marianne --- et al.
DOI: 10.1186/1471-2164-11-154
Pages: 154
Loading...
Loading...
Abstract

Abstract

Background

We have previously sequenced more than 500 kb of the duplicated MHC class I regions in Atlantic salmon. In the IA region we identified the loci for the MHC class I gene Sasa-UBA in addition to a soluble MHC class I molecule, Sasa-ULA. A pseudolocus for Sasa-UCA was identified in the nonclassical IB region. Both regions contained genes for antigen presentation, as wells as orthologues to other genes residing in the human MHC region.

Results

The genomic localisation of two MHC class I lineages (Z and S) has been resolved. 7 BACs were sequenced using a combination of standard Sanger and 454 sequencing. The new sequence data extended the IA region with 150 kb identifying the location of one Z-lineage locus, ZAA. The IB region was extended with 350 kb including three new Z-lineage loci, ZBA, ZCA and ZDA in addition to a UGA locus. An allelic version of the IB region contained a functional UDA locus in addition to the UCA pseudolocus. Additionally a BAC harbouring two MHC class I genes (UHA) was placed on linkage group 14, while a BAC containing the S-lineage locus SAA (previously known as UAA) was placed on LG10. Gene expression studies showed limited expression range for all class I genes with exception of UBA being dominantly expressed in gut, spleen and gills, and ZAA with high expression in blood.

Conclusion

Here we describe the genomic organization of MHC class I loci from the U-, Z-, and S-lineages in Atlantic salmon. Nine of the described class I genes are located in the extension of the duplicated IA and IB regions, while three class I genes are found on two separate linkage groups. The gene organization of the two regions indicates that the IB region is evolving at a different pace than the IA region. Expression profiling, polymorphic content, peptide binding properties and phylogenetic relationship show that Atlantic salmon has only one MHC class Ia gene (UBA), in addition to a multitude of nonclassical MHC class I genes from the U-, S- and Z-lineages.

Keywords


Article
Unified translation repression mechanism for microRNAs and upstream AUGs

Authors: Ajay Subramanian S --- Athey Brian D --- Lee Inhan
DOI: 10.1186/1471-2164-11-155
Pages: 155
Loading...
Loading...
Abstract

Abstract

Background

MicroRNAs (miRNAs) are endogenous small RNAs that modulate gene expression at the post-transcriptional level by binding complementary sites in the 3'-UTR. In a recent genome-wide study reporting a new miRNA target class (miBridge), we identified and validated interactions between 5'-UTRs and miRNAs. Separately, upstream AUGs (uAUGs) in 5'-UTRs are known to regulate genes translationally without affecting mRNA levels, one of the mechanisms for miRNA-mediated repression.

Results

Using sequence data from whole-genome cDNA alignments we identified 1418 uAUG sequences on the 5'-UTR that specifically interact with 3'-ends of conserved miRNAs. We computationally identified miRNAs that can target six genes through their uAUGs that were previously reported to suppress translation. We extended this meta-analysis by confirming expression of these miRNAs in cell-lines used in the uAUG studies. Similarly, seven members of the KLF family of genes containing uAUGs were computationally identified as interacting with several miRNAs. Using KLF9 as an example (whose protein expression is limited to brain tissue despite the mRNA being expressed ubiquitously), we show computationally that miRNAs expressed only in HeLa cells and not in neuroblastoma (N2A) cells can bind the uAUGs responsible for translation inhibition. Our computed results demonstrate that tissue- or cell-line specific repression of protein translation by uAUGs can be explained by the presence or absence of miRNAs that target these uAUG sequences. We propose that these uAUGs represent a subset of miRNA interaction sites on 5'-UTRs in miBridge, whereby a miRNA binding a uAUG hinders the progression of ribosome scanning the mRNA before it reaches the open reading frame (ORF).

Conclusions

While both miRNAs and uAUGs are separately known to down-regulate protein expression, we show that they may be functionally related by identifying potential interactions through a sequence-specific binding mechanism. Using prior experimental evidence that shows uAUG effects on translation repression together with miRNA expression data specific to cell lines, we demonstrate through computational analysis that cell-specific down-regulation of protein expression (while maintaining mRNA levels) correlates well with the simultaneous presence of miRNA and target uAUG sequences in one cell type and not others, suggesting tissue-specific translation repression by miRNAs through uAUGs.

Keywords


Article
Discovery and application of insertion-deletion (INDEL) polymorphisms for QTL mapping of early life-history traits in Atlantic salmon

Authors: Vasemägi Anti --- Gross Riho --- Palm Daniel --- Paaver Tiit --- et al.
DOI: 10.1186/1471-2164-11-156
Pages: 156
Loading...
Loading...
Abstract

Abstract

Background

For decades, linkage mapping has been one of the most powerful and widely used approaches for elucidating the genetic architecture of phenotypic traits of medical, agricultural and evolutionary importance. However, successful mapping of Mendelian and quantitative phenotypic traits depends critically on the availability of fast and preferably high-throughput genotyping platforms. Several array-based single nucleotide polymorphism (SNP) genotyping platforms have been developed for genetic model organisms during recent years but most of these methods become prohibitively expensive for screening large numbers of individuals. Therefore, inexpensive, simple and flexible genotyping solutions that enable rapid screening of intermediate numbers of loci (~75-300) in hundreds to thousands of individuals are still needed for QTL mapping applications in a broad range of organisms.

Results

Here we describe the discovery of and application of insertion-deletion (INDEL) polymorphisms for cost-efficient medium throughput genotyping that enables analysis of >75 loci in a single automated sequencer electrophoresis column with standard laboratory equipment. Genotyping of INDELs requires low start-up costs, includes few standard sample handling steps and is applicable to a broad range of species for which expressed sequence tag (EST) collections are available. As a proof of principle, we generated a partial INDEL linkage map in Atlantic salmon (Salmo salar) and rapidly identified a number of quantitative trait loci (QTLs) affecting early life-history traits that are expected to have important fitness consequences in the natural environment.

Conclusions

The INDEL genotyping enabled fast coarse-mapping of chromosomal regions containing QTL, thus providing an efficient means for characterization of genetic architecture in multiple crosses and large pedigrees. This enables not only the discovery of larger number of QTLs with relatively smaller phenotypic effect but also provides a cost-effective means for evaluation of the frequency of segregating QTLs in outbred populations which is important for further understanding how genetic variation underlying phenotypic traits is maintained in the wild.

Keywords


Article
Transcriptome profiling of a Sinorhizobium meliloti fadD mutant reveals the role of rhizobactin 1021 biosynthesis and regulation genes in the control of swarming

Loading...
Loading...
Abstract

Abstract

Background

Swarming is a multicellular phenomenom characterized by the coordinated and rapid movement of bacteria across semisolid surfaces. In Sinorhizobium meliloti this type of motility has been described in a fadD mutant. To gain insights into the mechanisms underlying the process of swarming in rhizobia, we compared the transcriptome of a S. meliloti fadD mutant grown under swarming inducing conditions (semisolid medium) to those of cells grown under non-swarming conditions (broth and solid medium).

Results

More than a thousand genes were identified as differentially expressed in response to growth on agar surfaces including genes for several metabolic activities, iron uptake, chemotaxis, motility and stress-related genes. Under swarming-specific conditions, the most remarkable response was the up-regulation of iron-related genes. We demonstrate that the pSymA plasmid and specifically genes required for the biosynthesis of the siderophore rhizobactin 1021 are essential for swarming of a S. meliloti wild-type strain but not in a fadD mutant. Moreover, high iron conditions inhibit swarming of the wild-type strain but not in mutants lacking either the iron limitation response regulator RirA or FadD.

Conclusions

The present work represents the first transcriptomic study of rhizobium growth on surfaces including swarming inducing conditions. The results have revealed major changes in the physiology of S. meliloti cells grown on a surface relative to liquid cultures. Moreover, analysis of genes responding to swarming inducing conditions led to the demonstration that iron and genes involved in rhizobactin 1021 synthesis play a role in the surface motility shown by S. meliloti which can be circumvented in a fadD mutant. This work opens a way to the identification of new traits and regulatory networks involved in swarming by rhizobia.

Keywords


Article
From QTL to candidate gene: Genetical genomics of simple and complex traits in potato using a pooling strategy

Authors: Kloosterman Bjorn --- Oortwijn Marian --- uitdeWilligen Jan --- America Twan --- et al.
DOI: 10.1186/1471-2164-11-158
Pages: 158
Loading...
Loading...
Abstract

Abstract

Background

Utilization of the natural genetic variation in traditional breeding programs remains a major challenge in crop plants. The identification of candidate genes underlying, or associated with, phenotypic trait QTLs is desired for effective marker assisted breeding. With the advent of high throughput -omics technologies, screening of entire populations for association of gene expression with targeted traits is becoming feasible but remains costly. Here we present the identification of novel candidate genes for different potato tuber quality traits by employing a pooling approach reducing the number of hybridizations needed. Extreme genotypes for a quantitative trait are collected and the RNA from contrasting bulks is then profiled with the aim of finding differentially expressed genes.

Results

We have successfully implemented the pooling strategy for potato quality traits and identified candidate genes associated with potato tuber flesh color and tuber cooking type. Elevated expression level of a dominant allele of the β-carotene hydroxylase (bch) gene was associated with yellow flesh color through mapping of the gene under a major QTL for flesh color on chromosome 3. For a second trait, a candidate gene with homology to a tyrosine-lysine rich protein (TLRP) was identified based on allele specificity of the probe on the microarray. TLRP was mapped on chromosome 9 in close proximity to a QTL for potato cooking type strengthening its significance as a candidate gene. Furthermore, we have performed a profiling experiment targeting a polygenic trait, by pooling individual genotypes based both on phenotypic and marker data, allowing the identification of candidate genes associated with the two different linkage groups.

Conclusions

A pooling approach for RNA-profiling with the aim of identifying novel candidate genes associated with tuber quality traits was successfully implemented. The identified candidate genes for tuber flesh color (bch) and cooking type (tlrp) can provide useful markers for breeding schemes in the future. Strengths and limitations of the approach are discussed.

Keywords


Article
Recombinational landscape of porcine X chromosome and individual variation in female meiotic recombination associated with haplotypes of Chinese pigs

Authors: Ma Junwu --- Iannuccelli Nathalie --- Duan Yanyu --- Huang Weibing --- et al.
DOI: 10.1186/1471-2164-11-159
Pages: 159
Loading...
Loading...
Abstract

Abstract

Background

Variations in recombination fraction (θ) among chromosomal regions, individuals and families have been observed and have an important impact on quantitative trait loci (QTL) mapping studies. Such variations on porcine chromosome X (SSC-X) and on other mammalian chromosome X are rarely explored. The emerging assembly of pig sequence provides exact physical location of many markers, facilitating the study of a fine-scale recombination landscape of the pig genome by comparing a clone-based physical map to a genetic map. Using large offspring of F1 females from two large-scale resource populations (Large White ♂ × Chinese Meishan ♀, and White Duroc ♂ × Chinese Erhualian ♀), we were able to evaluate the heterogeneity in θ for a specific interval among individual F1 females.

Results

Alignments between the cytogenetic map, radiation hybrid (RH) map, genetic maps and clone map of SSC-X with the physical map of human chromosome X (HSA-X) are presented. The most likely order of 60 markers on SSC-X is inferred. The average recombination rate across SSC-X is of ~1.27 cM/Mb. However, almost no recombination occurred in a large region of ~31 Mb extending from the centromere to Xq21, whereas in the surrounding regions and in the Xq telomeric region a recombination rate of 2.8-3.3 cM/Mb was observed, more than twice the chromosome-wide average rate. Significant differences in θ among F1 females within each population were observed for several chromosomal intervals. The largest variation was observed in both populations in the interval UMNP71-SW1943, or more precisely in the subinterval UMNP891-UMNP93. The individual variation in θ over this subinterval was found associated with F1 females' maternal haplotypes (Chinese pig haplotypes) and independent of paternal haplotype (European pig haplotypes). The θ between UMNP891 and UMNP93 for haplotype 1122 and 4311 differed by more than fourteen-fold (10.3% vs. 0.7%).

Conclusions

This study reveals marked regional, individual and haplotype-specific differences in recombination rate on SSC-X. Lack of recombination in such a large region makes it impossible to narrow QTL interval using traditional fine-mapping approaches. The relationship between recombination variation and haplotype polymorphism is shown for the first time in pigs.

Keywords


Article
Transcriptome analysis of reproductive tissue and intrauterine developmental stages of the tsetse fly (Glossina morsitans morsitans)

Authors: Attardo Geoffrey M --- Ribeiro José MC --- Wu Yineng --- Berriman Matthew --- et al.
DOI: 10.1186/1471-2164-11-160
Pages: 160
Loading...
Loading...
Abstract

Abstract

Background

Tsetse flies, vectors of African trypanosomes, undergo viviparous reproduction (the deposition of live offspring). This reproductive strategy results in a large maternal investment and the deposition of a small number of progeny during a female's lifespan. The reproductive biology of tsetse has been studied on a physiological level; however the molecular analysis of tsetse reproduction requires deeper investigation. To build a foundation from which to base molecular studies of tsetse reproduction, a cDNA library was generated from female tsetse (Glossina morsitans morsitans) reproductive tissues and the intrauterine developmental stages. 3438 expressed sequence tags were sequenced and analyzed.

Results

Analysis of a nonredundant catalogue of 1391 contigs resulted in 520 predicted proteins. 475 of these proteins were full length. We predict that 412 of these represent cytoplasmic proteins while 57 are secreted. Comparison of these proteins with other tissue specific tsetse cDNA libraries (salivary gland, fat body/milk gland, and midgut) identified 51 that are unique to the reproductive/immature cDNA library. 11 unique proteins were homologus to uncharacterized putative proteins within the NR database suggesting the identification of novel genes associated with reproductive functions in other insects (hypothetical conserved). The analysis also yielded seven putative proteins without significant homology to sequences present in the public database (unknown genes). These proteins may represent unique functions associated with tsetse's viviparous reproductive cycle. RT-PCR analysis of hypothetical conserved and unknown contigs was performed to determine basic tissue and stage specificity of the expression of these genes.

Conclusion

This paper identifies 51 putative proteins specific to a tsetse reproductive/immature EST library. 11 of these proteins correspond to hypothetical conserved genes and 7 proteins are tsetse specific.

Keywords


Article
Functional annotation of novel lineage-specific genes using co-expression and promoter analysis

Authors: Kumar Charu G --- Everts Robin E --- Loor Juan J --- Lewin Harris A
DOI: 10.1186/1471-2164-11-161
Pages: 161
Loading...
Loading...
Abstract

Abstract

Background

The diversity of placental architectures within and among mammalian orders is believed to be the result of adaptive evolution. Although, the genetic basis for these differences is unknown, some may arise from rapidly diverging and lineage-specific genes. Previously, we identified 91 novel lineage-specific transcripts (LSTs) from a cow term-placenta cDNA library, which are excellent candidates for adaptive placental functions acquired by the ruminant lineage. The aim of the present study was to infer functions of previously uncharacterized lineage-specific genes (LSGs) using co-expression, promoter, pathway and network analysis.

Results

Clusters of co-expressed genes preferentially expressed in liver, placenta and thymus were found using 49 previously uncharacterized LSTs as seeds. Over-represented composite transcription factor binding sites (TFBS) in promoters of clustered LSGs and known genes were then identified computationally. Functions were inferred for nine previously uncharacterized LSGs using co-expression analysis and pathway analysis tools. Our results predict that these LSGs may function in cell signaling, glycerophospholipid/fatty acid metabolism, protein trafficking, regulatory processes in the nucleus, and processes that initiate parturition and immune system development.

Conclusions

The placenta is a rich source of lineage-specific genes that function in the adaptive evolution of placental architecture and functions. We have shown that co-expression, promoter, and gene network analyses are useful methods to infer functions of LSGs with heretofore unknown functions. Our results indicate that many LSGs are involved in cellular recognition and developmental processes. Furthermore, they provide guidance for experimental approaches to validate the functions of LSGs and to study their evolution.

Keywords


Article
Transcriptional and pathway analysis in the hypothalamus of newly hatched chicks during fasting and delayed feeding

Authors: Higgins Stacy E --- Ellestad Laura E --- Trakooljul Nares --- McCarthy Fiona --- et al.
DOI: 10.1186/1471-2164-11-162
Pages: 162
Loading...
Loading...
Abstract

Abstract

Background

The hypothalamus plays a central role in regulating appetite and metabolism. However, the gene networks within the hypothalamus that regulate feed intake and metabolism, and the effects of fasting on those pathways are not completely understood in any species. The present experiment evaluated global hypothalamic gene expression in newly hatched chicks using microarray analysis to elucidate genes and pathways regulated by feeding, fasting, and delayed feeding. Ten groups of chicks were sampled over four days post-hatch, including fed, fasted, and 48 h fasted followed by access to feed for 4 h, 24 h, and 48 h. Hypothalamic samples were collected for microarray analysis (n = 4). Expression patterns of selected genes were confirmed by quantitative real-time PCR. Pathway analysis of the microarray results predicted a network of genes involved in neuropeptide or neurotransmitter signaling. To confirm the functionality of this predicted gene network, hypothalamic neurons from fed and fasted chicks were isolated and cultured in the presence of neuropeptide Y, somatostatin, α-melanocyte stimulating hormone, norepinephrine, and L-phospho-serine. Results confirmed functional relationships among members of the predicted gene network. Moreover, the effects observed were dependant upon the nutritional state of the animals (fed vs. fasted).

Results

Differences in gene expression (≥ 1.6 fold) were detected in 1,272 genes between treatments, and of those, 119 genes were significantly (P < 0.05) different. Pathway Miner analysis revealed that six genes (SSTR5, NPY5R, POMC, ADRB2, GRM8, and RLN3) were associated within a gene network. In vitro experiments with primary hypothalamic neurons confirmed that receptor agonists involved in this network regulated expression of other genes in the predicted network, and this regulation within the network was influenced by the nutritional status and age of the chick.

Conclusions

Microarray analysis of the hypothalamus during different nutritional states revealed that many genes are differentially regulated. We found that functional interactions exist among six differentially regulated genes associated within a putative gene network from this experiment. Considering that POMC, an important gene in controlling metabolism, was central to this network, this gene network may play an important role in regulation of feeding and metabolism in birds.

Keywords


Article
Improvement of tissue preparation for laser capture microdissection: application for cell type-specific miRNA expression profiling in colorectal tumors

Authors: Wang Shuyang --- Wang Lei --- Zhu Tengfang --- Gao Xue --- et al.
DOI: 10.1186/1471-2164-11-163
Pages: 163
Loading...
Loading...
Abstract

Abstract

Background

Laser capture microdissection (LCM) has successfully isolated pure cell populations from tissue sections and the combination of LCM with standard genomic and proteomic methods has revolutionized molecular analysis of complex tissue. However, the quantity and quality of material recovered after LCM is often still limited for analysis by using whole genomic and proteomic approaches. To procure high quality and quantity of RNA after LCM, we optimized the procedures on tissue preparations and applied the approach for cell type-specific miRNA expression profiling in colorectal tumors.

Results

We found that the ethanol fixation of tissue sections for 2 hours had the maximum improvement of RNA quality (1.8 fold, p = 0.0014) and quantity (1.5 fold, p = 0.066). Overall, the quality (RNA integrity number, RIN) for the microdissected colorectal tissues was 5.2 ± 1.5 (average ± SD) for normal (n = 43), 5.7 ± 1.1 for adenomas (n = 14) and 7.2 ± 1.2 for carcinomas (n = 44). We then compared miRNA expression profiles of 18 colorectal tissues (6 normal, 6 adenomas and 6 carcinomas) between LCM selected epithelial cells versus stromal cells using Agilent miRNA microarrays. We identified 51 differentially expressed miRNAs (p <= 0.001) between these two cell types. We found that the miRNAs in the epithelial cells could differentiate adenomas from normal and carcinomas. However, the miRNAs in the stromal and mixed cells could not separate adenomas from normal tissues. Finally, we applied quantitative RT-PCR to cross-verify the expression patterns of 7 different miRNAs using 8 LCM-selected epithelial cells and found the excellent correlation of the fold changes between the two platforms (R = 0.996).

Conclusions

Our study demonstrates the feasibility and potential power of discovering cell type-specific miRNA biomarkers in complex tissue using combination of LCM with genome-wide miRNA analysis.

Keywords


Article
Comparative genomics and proteomics of Helicobacter mustelae, an ulcerogenic and carcinogenic gastric pathogen

Loading...
Loading...
Abstract

Abstract

Background

Helicobacter mustelae causes gastritis, ulcers and gastric cancer in ferrets and other mustelids. H. mustelae remains the only helicobacter other than H. pylori that causes gastric ulceration and cancer in its natural host. To improve understanding of H. mustelae pathogenesis, and the ulcerogenic and carcinogenic potential of helicobacters in general, we sequenced the H. mustelae genome, and identified 425 expressed proteins in the envelope and cytosolic proteome.

Results

The H. mustelae genome lacks orthologs of major H. pylori virulence factors including CagA, VacA, BabA, SabA and OipA. However, it encodes ten autotransporter surface proteins, seven of which were detected in the expressed proteome, and which, except for the Hsr protein, are of unknown function. There are 26 putative outer membrane proteins in H. mustelae, some of which are most similar to the Hof proteins of H. pylori. Although homologs of putative virulence determinants of H. pylori (NapA, plasminogen adhesin, collagenase) and Campylobacter jejuni (CiaB, Peb4a) are present in the H. mustelae genome, it also includes a distinct complement of virulence-related genes including a haemagglutinin/haemolysin protein, and a glycosyl transferase for producing blood group A/B on its lipopolysaccharide. The most highly expressed 264 proteins in the cytosolic proteome included many corresponding proteins from H. pylori, but the rank profile in H. mustelae was distinctive. Of 27 genes shown to be essential for H. pylori colonization of the gerbil, all but three had orthologs in H. mustelae, identifying a shared set of core proteins for gastric persistence.

Conclusions

The determination of the genome sequence and expressed proteome of the ulcerogenic species H mustelae provides a comparative model for H. pylori to investigate bacterial gastric carcinogenesis in mammals, and to suggest ways whereby cag minus H. pylori strains might cause ulceration and cancer.

The genome sequence was deposited in EMBL/GenBank/DDBJ under accession number FN555004.

Keywords


Article
Depletion of the non-coding regulatory 6S RNA in E. coli causes a surprising reduction in the expression of the translation machinery

Authors: Neusser Thomas --- Polen Tino --- Geissen René --- Wagner Rolf
DOI: 10.1186/1471-2164-11-165
Pages: 165
Loading...
Loading...
Abstract

Abstract

Background

6S RNA from E. coli is known to bind to RNA polymerase interfering with transcription initiation. Because 6S RNA concentrations are maximal at stationary phase and binding occurs preferentially to the holoenzyme associated with σ70 (Eσ70) it is believed that 6S RNA supports adjustment to stationary phase transcription. Previous studies have also suggested that inhibition is specific for σ70-dependent promoters characterized by a weak -35 recognition motif or extended -10 promoters. There are many exceptions to this precept, showing that other types of promoters, including stationary phase-specific (σ38-dependent) promoters are inhibited.

Results

To solve this apparent ambiguity and to better understand the role of 6S RNA in stationary phase transition we have performed a genome-wide transcriptional analysis of wild-type and 6S RNA deficient cells growing to mid-log or early stationary phase. We found 245 genes at the exponential growth phase and 273 genes at the early stationary phase to be ≥ 1.5-fold differentially expressed. Up- and down-regulated genes include many transcriptional regulators, stress-related proteins, transporters and several enzymes involved in purine metabolism. As the most striking result during stationary phase, however, we obtained in the 6S RNA deficient strain a concerted expression reduction of genes constituting the translational apparatus. In accordance, primer extension analysis showed that transcription of ribosomal RNAs, representing the key molecules for ribosome biogenesis, is also significantly reduced under the same conditions. Consistent with this finding biochemical analysis of the 6S RNA deficient strain indicates that the lack of 6S RNA is apparently compensated by an increase of the basal ppGpp concentration, known to affect growth adaptation and ribosome biogenesis.

Conclusions

The analysis demonstrated that the effect of 6S RNA on transcription is not strictly confined to σ70-dependent promoters. Moreover, the results indicate that 6S RNA is embedded in stationary phase adaptation, which is governed by the capacity of the translational machinery.

Keywords


Article
TC-motifs at the TATA-box expected position in plant genes: a novel class of motifs involved in the transcription regulation

Authors: Bernard Virginie --- Brunaud Véronique --- Lecharny Alain
DOI: 10.1186/1471-2164-11-166
Pages: 166
Loading...
Loading...
Abstract

Abstract

Background

The TATA-box and TATA-variants are regulatory elements involved in the formation of a transcription initiation complex. Both have been conserved throughout evolution in a restricted region close to the Transcription Start Site (TSS). However, less than half of the genes in model organisms studied so far have been found to contain either one of these elements. Indeed different core-promoter elements are involved in the recruitment of the TATA-box-binding protein. Here we assessed the possibility of identifying novel functional motifs in plant genes, sharing the TATA-box topological constraints.

Results

We developed an ab-initio approach considering the preferential location of motifs relative to the TSS. We identified motifs observed at the TATA-box expected location and conserved in both Arabidopsis thaliana and Oryza sativa promoters. We identified TC-elements within non-TA-rich promoters 30 bases upstream of the TSS. As with the TATA-box and TATA-variant sequences, it was possible to construct a unique distance graph with the TC-element sequences. The structural and functional features of TC-element-containing genes were distinct from those of TATA-box- or TATA-variant-containing genes. Arabidopsis thaliana transcriptome analysis revealed that TATA-box-containing genes were generally those showing relatively high levels of expression and that TC-element-containing genes were generally those expressed in specific conditions.

Conclusions

Our observations suggest that the TC-elements might constitute a class of novel regulatory elements participating towards the complex modulation of gene expression in plants.

Keywords


Article
Large-scale transcriptome data reveals transcriptional activity of fission yeast LTR retrotransposons

Authors: Mourier Tobias --- Willerslev Eske
DOI: 10.1186/1471-2164-11-167
Pages: 167
Loading...
Loading...
Abstract

Abstract

Background

Retrotransposons are transposable elements that proliferate within eukaryotic genomes through a process involving reverse transcription. The numbers of retrotransposons within genomes and differences between closely related species may yield insight into the evolutionary history of the elements. Less is known about the ongoing dynamics of retrotransposons, as analysis of genome sequences will only reveal insertions of retrotransposons that are fixed - or near fixation - in the population or strain from which genetic material has been extracted for sequencing. One pre-requisite for retrotransposition is transcription of the elements. Given their intrinsic sequence redundancy, transcriptome-level analyses of transposable elements are scarce. We have used recently published transcriptome data from the fission yeast Schizosaccharomyces pombe to assess the ability to detect and describe transcriptional activity from Long Terminal Repeat (LTR) retrotransposons. LTR retrotransposons are normally flanked by two LTR sequences. However, the majority of LTR sequences in S. pombe exist as solitary LTRs, i.e. as single terminal repeat sequences not flanking a retrotransposon. Transcriptional activity was analysed for both full-length LTR retrotransposons and solitary LTRs.

Results

Two independent sets of transcriptome data reveal the presence of full-length, polyadenylated transcripts from LTR retrotransposons in S. pombe during growth phase in rich medium. The redundancy of retrotransposon sequences makes it difficult to assess which elements are transcriptionally active, but data strongly indicates that only a subset of the LTR retrotransposons contribute significantly to the detected transcription. A considerable level of reverse strand transcription is also detected. Equal levels of transcriptional activity are observed from both strands of solitary LTR sequences. Transcriptome data collected during meiosis suggests that transcription of solitary LTRs is correlated with the transcription of nearby protein-coding genes.

Conclusions

Presumably, the host organism negatively regulates proliferation of LTR retrotransposons. The finding of considerable transcriptional activity of retrotransposons suggests that part of this regulation is likely to take place at a post-transcriptional level. Alternatively, the transcriptional activity may signify a hitherto unrecognized activity level of retrotransposon proliferation. Our findings underline the usefulness of transcriptome data in elucidating dynamics in retrotransposon transcription.

Keywords


Article
Transcriptome survey of the anhydrobiotic tardigrade Milnesium tardigradum in comparison with Hypsibius dujardini and Richtersius coronifer

Authors: Mali Brahim --- Grohme Markus A --- Förster Frank --- Dandekar Thomas --- et al.
DOI: 10.1186/1471-2164-11-168
Pages: 168
Loading...
Loading...
Abstract

Abstract

Background

The phenomenon of desiccation tolerance, also called anhydrobiosis, involves the ability of an organism to survive the loss of almost all cellular water without sustaining irreversible damage. Although there are several physiological, morphological and ecological studies on tardigrades, only limited DNA sequence information is available. Therefore, we explored the transcriptome in the active and anhydrobiotic state of the tardigrade Milnesium tardigradum which has extraordinary tolerance to desiccation and freezing. In this study, we present the first overview of the transcriptome of M. tardigradum and its response to desiccation and discuss potential parallels to stress responses in other organisms.

Results

We sequenced a total of 9984 expressed sequence tags (ESTs) from two cDNA libraries from the eutardigrade M. tardigradum in its active and inactive, anhydrobiotic (tun) stage. Assembly of these ESTs resulted in 3283 putative unique transcripts, whereof ~50% showed significant sequence similarity to known genes. The resulting unigenes were functionally annotated using the Gene Ontology (GO) vocabulary. A GO term enrichment analysis revealed several GOs that were significantly underrepresented in the inactive stage. Furthermore we compared the putative unigenes of M. tardigradum with ESTs from two other eutardigrade species that are available from public sequence databases, namely Richtersius coronifer and Hypsibius dujardini. The processed sequences of the three tardigrade species revealed similar functional content and the M. tardigradum dataset contained additional sequences from tardigrades not present in the other two.

Conclusions

This study describes novel sequence data from the tardigrade M. tardigradum, which significantly contributes to the available tardigrade sequence data and will help to establish this extraordinary tardigrade as a model for studying anhydrobiosis. Functional comparison of active and anhydrobiotic tardigrades revealed a differential distribution of Gene Ontology terms associated with chromatin structure and the translation machinery, which are underrepresented in the inactive animals. These findings imply a widespread metabolic response of the animals on dehydration. The collective tardigrade transcriptome data will serve as a reference for further studies and support the identification and characterization of genes involved in the anhydrobiotic response.

Keywords


Article
A sequence motif enriched in regions bound by the Drosophila dosage compensation complex

Authors: Gallach Miguel --- Arnau Vicente --- Aldecoa Rodrigo --- Marín Ignacio
DOI: 10.1186/1471-2164-11-169
Pages: 169
Loading...
Loading...
Abstract

Abstract

Background

In Drosophila melanogaster, dosage compensation is mediated by the action of the dosage compensation complex (DCC). How the DCC recognizes the fly X chromosome is still poorly understood. Characteristic sequence signatures at all DCC binding sites have not hitherto been found.

Results

In this study, we compare the known binding sites of the DCC with oligonucleotide profiles that measure the specificity of the sequences of the D. melanogaster X chromosome. We show that the X chromosome regions bound by the DCC are enriched for a particular type of short, repetitive sequences. Their distribution suggests that these sequences contribute to chromosome recognition, the generation of DCC binding sites and/or the local spreading of the complex. Comparative data indicate that the same sequences may be involved in dosage compensation in other Drosophila species.

Conclusions

These results offer an explanation for the wild-type binding of the DCC along the Drosophila X chromosome, contribute to delineate the forces leading to the establishment of dosage compensation and suggest new experimental approaches to understand the precise biochemical features of the dosage compensation system.

Keywords


Article
The RST and PARP-like domain containing SRO protein family: analysis of protein structure, function and conservation in land plants

Authors: Jaspers Pinja --- Overmyer Kirk --- Wrzaczek Michael --- Vainonen Julia P --- et al.
DOI: 10.1186/1471-2164-11-170
Pages: 170
Loading...
Loading...
Abstract

Abstract

Background

The SROs (SIMILAR TO RCD-ONE) are a group of plant-specific proteins which have important functions in stress adaptation and development. They contain the catalytic core of the poly(ADP-ribose) polymerase (PARP) domain and a C-terminal RST (RCD-SRO-TAF4) domain. In addition to these domains, several, but not all, SROs contain an N-terminal WWE domain.

Results

SROs are present in all analyzed land plants and sequence analysis differentiates between two structurally distinct groups; cryptogams and monocots possess only group I SROs whereas eudicots also contain group II. Group I SROs possess an N-terminal WWE domain (PS50918) but the WWE domain is lacking in group II SROs. Group I domain structure is widely represented in organisms as distant as humans (for example, HsPARP11). We propose a unified nomenclature for the SRO family. The SROs are able to interact with transcription factors through the C-terminal RST domain but themselves are generally not regulated at the transcriptional level. The most conserved feature of the SROs is the catalytic core of the poly(ADP-ribose) polymerase (PS51059) domain. However, bioinformatic analysis of the SRO PARP domain fold-structure and biochemical assays of AtRCD1 suggested that SROs do not possess ADP-ribosyl transferase activity.

Conclusions

The SROs are a highly conserved family of plant specific proteins. Sequence analysis of the RST domain implicates a highly preserved protein structure in that region. This might have implications for functional conservation. We suggest that, despite the presence of the catalytic core of the PARP domain, the SROs do not possess ADP-ribosyl transferase activity. Nevertheless, the function of SROs is critical for plants and might be related to transcription factor regulation and complex formation.

Keywords


Article
Whole genome evaluation of horizontal transfers in the pathogenic fungus Aspergillus fumigatus

Authors: Mallet Ludovic V --- Becq Jennifer --- Deschavanne Patrick
DOI: 10.1186/1471-2164-11-171
Pages: 171
Loading...
Loading...
Abstract

Abstract

Background

Numerous cases of horizontal transfers (HTs) have been described for eukaryote genomes, but in contrast to prokaryote genomes, no whole genome evaluation of HTs has been carried out. This is mainly due to a lack of parametric methods specially designed to take the intrinsic heterogeneity of eukaryote genomes into account. We applied a simple and tested method based on local variations of genomic signatures to analyze the genome of the pathogenic fungus Aspergillus fumigatus.

Results

We detected 189 atypical regions containing 214 genes, accounting for about 1 Mb of DNA sequences. However, the fraction of atypical DNA detected was smaller than the average amount detected in the same conditions in prokaryote genomes (3.1% vs 5.6%). It appeared that about one third of these regions contained no annotated genes, a proportion far greater than in prokaryote genomes. When analyzing the origin of these HTs by comparing their signatures to a home made database of species signatures, 3 groups of donor species emerged: bacteria (40%), fungi (25%), and viruses (22%). It is to be noticed that though inter-domain exchanges are confirmed, we only put in evidence very few exchanges between eukaryotic kingdoms.

Conclusions

In conclusion, we demonstrated that HTs are not negligible in eukaryote genomes, bearing in mind that in our stringent conditions this amount is a floor value, though of a lesser extent than in prokaryote genomes. The biological mechanisms underlying those transfers remain to be elucidated as well as the biological functions of the transferred genes.

Keywords


Article
Myosin individualized: single nucleotide polymorphisms in energy transduction

Authors: Burghardt Thomas P --- Neff Kevin L --- Wieben Eric D --- Ajtai Katalin
DOI: 10.1186/1471-2164-11-172
Pages: 172
Loading...
Loading...
Abstract

Abstract

Background

Myosin performs ATP free energy transduction into mechanical work in the motor domain of the myosin heavy chain (MHC). Energy transduction is the definitive systemic feature of the myosin motor performed by coordinating in a time ordered sequence: ATP hydrolysis at the active site, actin affinity modulation at the actin binding site, and the lever-arm rotation of the power stroke. These functions are carried out by several conserved sub-domains within the motor domain. Single nucleotide polymorphisms (SNPs) affect the MHC sequence of many isoforms expressed in striated muscle, smooth muscle, and non-muscle tissue. The purpose of this work is to provide a rationale for using SNPs as a functional genomics tool to investigate structurefunction relationships in myosin. In particular, to discover SNP distribution over the conserved sub-domains and surmise what it implies about sub-domain stability and criticality in the energy transduction mechanism.

Results

An automated routine identifying human nonsynonymous SNP amino acid missense substitutions for any MHC gene mined the NCBI SNP data base. The routine tested 22 MHC genes coding muscle and non-muscle isoforms and identified 89 missense mutation positions in the motor domain with 10 already implicated in heart disease and another 8 lacking sequence homology with a skeletal MHC isoform for which a crystallographic model is available. The remaining 71 SNP substitutions were found to be distributed over MHC with 22 falling outside identified functional sub-domains and 49 in or very near to myosin sub-domains assigned specific crucial functions in energy transduction. The latter includes the active site, the actin binding site, the rigid lever-arm, and regions facilitating their communication. Most MHC isoforms contained SNPs somewhere in the motor domain.

Conclusions

Several functional-crucial sub-domains are infiltrated by a large number of SNP substitution sites suggesting these domains are engineered by evolution to be too-robust to be disturbed by otherwise intrusive sequence changes. Two functional sub-domains are SNP-free or relatively SNP-deficient but contain many disease implicated mutants. These sub-domains are apparently highly sensitive to any missense substitution suggesting they have failed to evolve a robust sequence paradigm for performing their function.

Keywords


Article
Expression profile of cuticular genes of silkworm, Bombyx mori

Authors: Liang Jiubo --- Zhang Liang --- Xiang Zhonghuai --- He Ningjia
DOI: 10.1186/1471-2164-11-173
Pages: 173
Loading...
Loading...
Abstract

Abstract

Background

Insect cuticle plays essential roles in many physiological functions. During molting and metamorphosis tremendous changes occur in silkworm cuticle where multiple proteins exist and genes encoding them constitute about 1.5% of all Bombyx mori genes.

Results

In an effort to determine their expression profiles, a microarray-based investigation was carried out using mRNA collected from larvae to pupae. The results showed that a total of 6676 genes involved in various functions and physiological pathways were activated. The vast majority (93%) of cuticular protein genes were expressed in selected stages with varying expression patterns. There was no correlation between expression patterns and the presence of conserved motifs. Twenty-six RR genes distributed in chromosome 22 were co-expressed at the larval and wandering stages. The 2 kb upstream regions of these genes were further analyzed and three putative elements were identified.

Conclusions

Data from the present study provide, for the first time, a comprehensive expression profile of genes in silkworm epidermal tissues and evidence that putative elements exist to allow massive production of mRNAs from specific cuticular protein genes.

Keywords


Article
QTL mapping for Mediterranean corn borer resistance in European flint germplasm using recombinant inbred lines

Authors: Ordas Bernardo --- Malvar Rosa A --- Santiago Rogelio --- Butron Ana
DOI: 10.1186/1471-2164-11-174
Pages: 174
Loading...
Loading...
Abstract

Abstract

Background

Ostrinia nubilalis (ECB) and Sesamia nonagrioides (MCB) are two maize stem borers which cause important losses in temperate maize production, but QTL analyses for corn borer resistance were mostly restricted to ECB resistance and maize materials genetically related (mapping populations derived from B73). Therefore, the objective of this work was to identify and characterize QTLs for MCB resistance and agronomic traits in a RILs population derived from European flint inbreds.

Results

Three QTLs were detected for stalk tunnel length at bins 1.02, 3.05 and 8.05 which explained 7.5% of the RILs genotypic variance. The QTL at bin 3.05 was co-located to a QTL related to plant height and grain humidity and the QTL at bin 8.05 was located near a QTL related to yield.

Conclusions

Our results, when compared with results from other authors, suggest the presence of genes involved in cell wall biosynthesis or fortification with effects on resistance to different corn borer species and digestibility for dairy cattle. Particularly, we proposed five candidate genes related to cell wall characteristics which could explain the QTL for stalk tunnelling in the region 3.05. However, the small proportion of genotypic variance explained by the QTLs suggest that there are also many other genes of small effect regulating MCB resistance and we conclude that MAS seems not promising for this trait. Two QTLs detected for stalk tunnelling overlap with QTLs for agronomic traits, indicating the presence of pleitropism or linkage between genes affecting resistance and agronomic traits.

Keywords


Article
Systematic analysis of off-target effects in an RNAi screen reveals microRNAs affecting sensitivity to TRAIL-induced apoptosis

Authors: Sudbery Ian --- Enright Anton J --- Fraser Andrew G --- Dunham Ian
DOI: 10.1186/1471-2164-11-175
Pages: 175
Loading...
Loading...
Abstract

Abstract

Background

RNA inhibition by siRNAs is a frequently used approach to identify genes required for specific biological processes. However RNAi screening using siRNAs is hampered by non-specific or off target effects of the siRNAs, making it difficult to separate genuine hits from false positives. It is thought that many of the off-target effects seen in RNAi experiments are due to siRNAs acting as microRNAs (miRNAs), causing a reduction in gene expression of unintended targets via matches to the 6 or 7 nt 'seed' sequence. We have conducted a careful examination of off-target effects during an siRNA screen for novel regulators of the TRAIL apoptosis induction pathway(s).

Results

We identified 3 hexamers and 3 heptamer seed sequences that appeared multiple times in the top twenty siRNAs in the TRAIL apoptosis screen. Using a novel statistical enrichment approach, we systematically identified a further 17 hexamer and 13 heptamer seed sequences enriched in high scoring siRNAs. The presence of one of these seeds sequences (which could explain 6 of 8 confirmed off-target effects) is sufficient to elicit a phenotype. Three of these seed sequences appear in the human miRNAs miR-26a, miR-145 and miR-384. Transfection of mimics of these miRNAs protects several cell types from TRAIL-induced cell death.

Conclusions

We have demonstrated a role for miR-26a, miR-145 and miR-26a in TRAIL-induced apoptosis. Further these results show that RNAi screening enriches for siRNAs with relevant off-target effects. Some of these effects can be identified by the over-representation of certain seed sequences in high-scoring siRNAs and we demonstrate the usefulness of such systematic analysis of enriched seed sequences.

Keywords


Article
The embryonic genes Dkk3, Hoxd8, Hoxd9 and Tbx1 identify muscle types in a diet-independent and fiber-type unrelated way

Authors: de Wilde Janneke --- Hulshof Martijn FM --- Boekschoten Mark V --- de Groot Philip --- et al.
DOI: 10.1186/1471-2164-11-176
Pages: 176
Loading...
Loading...
Abstract

Abstract

Background

The mouse skeletal muscle is composed of four distinct fiber types that differ in contractile function, number of mitochondria and metabolism. Every muscle type has a specific composition and distribution of the four fiber types. To find novel genes involved in specifying muscle types, we used microarray analysis to compare the gastrocnemius with the quadriceps from mice fed a low fat diet (LFD) or high fat diet (HFD) for 8 weeks. Additional qPCR analysis were performed in the gastrocnemius, quadriceps and soleus muscle from mice fed an LFD or HFD for 20 weeks.

Results

In mice fed the 8-week LFD 162 genes were differentially expressed in the gastrocnemius vs. the quadriceps. Genes with the strongest differences in expression were markers for oxidative fiber types (e.g. Tnni1) and genes which are known to be involved in embryogenesis (Dkk3, Hoxd8,Hoxd9 and Tbx1). Also Dkk2, Hoxa5, Hoxa10, Hoxc9, Hoxc10, Hoxc6 and Tbx15 were detectably, but not differentially expressed in adult muscle tissue. Expression of differentially expressed genes was not influenced by an 8-week or 20-week HFD. Comparing gastrocnemius, quadriceps and soleus, expression of Hoxd8 and Hoxd9 was not related with expression of markers for the four different fiber types. We found that the expression of both Hoxd8 and Hoxd9 was much higher in the gastrocnemius than in the quadriceps or soleus, whereas the expression of Dkk3 was high in quadriceps, but low in both gastrocnemius and soleus. Finally, expression of Tbx1 was high in quadriceps, intermediate in soleus and low in gastrocnemius.

Conclusions

We found that genes from the Dkk family, Hox family and Tbx family are detectably expressed in adult mouse muscle. Interestingly, expression of Dkk3, Hoxd8, Hoxd9 and Tbx1 was highly different between gastrocnemius, quadriceps and soleus. In fact, every muscle type showed a unique combination of expression of these four genes which was not influenced by diet. Altogether, we conclude that genes important for embryogenesis identify mouse muscle types in a diet-independent and fiber type-unrelated manner.

Keywords


Article
Phylogenetic analysis of fungal ABC transporters

Authors: Kovalchuk Andriy --- Driessen Arnold JM
DOI: 10.1186/1471-2164-11-177
Pages: 177
Loading...
Loading...
Abstract

Abstract

Background

The superfamily of ABC proteins is among the largest known in nature. Its members are mainly, but not exclusively, involved in the transport of a broad range of substrates across biological membranes. Many contribute to multidrug resistance in microbial pathogens and cancer cells. The diversity of ABC proteins in fungi is comparable with those in multicellular animals, but so far fungal ABC proteins have barely been studied.

Results

We performed a phylogenetic analysis of the ABC proteins extracted from the genomes of 27 fungal species from 18 orders representing 5 fungal phyla thereby covering the most important groups. Our analysis demonstrated that some of the subfamilies of ABC proteins remained highly conserved in fungi, while others have undergone a remarkable group-specific diversification. Members of the various fungal phyla also differed significantly in the number of ABC proteins found in their genomes, which is especially reduced in the yeast S. cerevisiae and S. pombe.

Conclusions

Data obtained during our analysis should contribute to a better understanding of the diversity of the fungal ABC proteins and provide important clues about their possible biological functions.

Keywords


Article
Co-expression of neighbouring genes in Arabidopsis: separating chromatin effects from direct interactions

Authors: Chen Wei-Hua --- de Meaux Juliette --- Lercher Martin J
DOI: 10.1186/1471-2164-11-178
Pages: 178
Loading...
Loading...
Abstract

Abstract

Background

In all eukaryotic species examined, genes that are chromosomal neighbours are more similar in their expression than random gene pairs. Currently, it is still unclear how much of this local co-expression is caused by direct transcriptional interactions, and how much is due to shared chromatin environments.

Results

We analysed neighbouring genes in Arabidopsis thaliana. At large intergenic distances (>400 bp), divergently and convergently transcribed gene pairs show very similar levels of co-expression, mediated most likely by shared chromatin environments. At gene distances below 400 bp, co-expression is strongly enhanced only for divergently transcribed gene pairs, indicating bi-directional transcription from a single promoter. Conversely, co-expression is suppressed for short convergently or uni-directionally transcribed pairs. This suppression points to transcriptional interference concentrated at the 3' end, e.g., in the context of transcription termination.

Conclusions

Classifying linked gene pairs by their orientation, we are able to partially tease apart the different levels of regional expression modulation. (i) Regional chromatin characteristics modulate the accessibility for regulation and transcription, regardless of gene orientation; the strength of this chromatin effect can be assessed from divergently or convergently transcribed distant neighbours. (ii) Shared promoter regions up to 400 bp in length enhance the co-expression of close bi-directional neighbours. (iii) Transcriptional interference of close neighbours is concentrated at the 3' ends of genes, and reduces co-expression on average by 40%.

Keywords


Article
Characterization of the ovine ribosomal protein SA gene and its pseudogenes

Authors: Van den Broeke Alice --- Van Poucke Mario --- Marcos-Carcavilla Ane --- Hugot Karine --- et al.
DOI: 10.1186/1471-2164-11-179
Pages: 179
Loading...
Loading...
Abstract

Abstract

Background

The ribosomal protein SA (RPSA), previously named 37-kDa laminin receptor precursor/67-kDa laminin receptor (LRP/LR) is a multifunctional protein that plays a role in a number of pathological processes, such as cancer and prion diseases. In all investigated species, RPSA is a member of a multicopy gene family consisting of one full length functional gene and several pseudogenes. Therefore, for studies on RPSA related pathways/pathologies, it is important to characterize the whole family and to address the possible function of the other RPSA family members. The present work aims at deciphering the RPSA family in sheep.

Results

In addition to the full length functional ovine RPSA gene, 11 other members of this multicopy gene family, all processed pseudogenes, were identified. Comparison between the RPSA transcript and these pseudogenes shows a large variety in sequence identities ranging from 99% to 74%. Only one of the 11 pseudogenes, i.e. RPSAP7, shares the same open reading frame (ORF) of 295 amino acids with the RPSA gene, differing in only one amino acid. All members of the RPSA family were annotated by comparative mapping and fluorescence in situ hybridization (FISH) localization. Transcription was investigated in the cerebrum, cerebellum, spleen, muscle, lymph node, duodenum and blood, and transcripts were detected for 6 of the 11 pseudogenes in some of these tissues.

Conclusions

In the present work we have characterized the ovine RPSA family. Our results have revealed the existence of 11 ovine RPSA pseudogenes and provide new data on their structure and sequence. Such information will facilitate molecular studies of the functional RPSA gene taking into account the existence of these pseudogenes in the design of experiments. It remains to be investigated if the transcribed members are functional as regulatory non-coding RNA or as functional proteins.

Keywords


Article
Transcriptome sequencing in an ecologically important tree species: assembly, annotation, and marker discovery

Authors: Parchman Thomas L --- Geist Katherine S --- Grahnen Johan A --- Benkman Craig W --- et al.
DOI: 10.1186/1471-2164-11-180
Pages: 180
Loading...
Loading...
Abstract

Abstract

Background

Massively parallel sequencing of cDNA is now an efficient route for generating enormous sequence collections that represent expressed genes. This approach provides a valuable starting point for characterizing functional genetic variation in non-model organisms, especially where whole genome sequencing efforts are currently cost and time prohibitive. The large and complex genomes of pines (Pinus spp.) have hindered the development of genomic resources, despite the ecological and economical importance of the group. While most genomic studies have focused on a single species (P. taeda), genomic level resources for other pines are insufficiently developed to facilitate ecological genomic research. Lodgepole pine (P. contorta) is an ecologically important foundation species of montane forest ecosystems and exhibits substantial adaptive variation across its range in western North America. Here we describe a sequencing study of expressed genes from P. contorta, including their assembly and annotation, and their potential for molecular marker development to support population and association genetic studies.

Results

We obtained 586,732 sequencing reads from a 454 GS XLR70 Titanium pyrosequencer (mean length: 306 base pairs). A combination of reference-based and de novo assemblies yielded 63,657 contigs, with 239,793 reads remaining as singletons. Based on sequence similarity with known proteins, these sequences represent approximately 17,000 unique genes, many of which are well covered by contig sequences. This sequence collection also included a surprisingly large number of retrotransposon sequences, suggesting that they are highly transcriptionally active in the tissues we sampled. We located and characterized thousands of simple sequence repeats and single nucleotide polymorphisms as potential molecular markers in our assembled and annotated sequences. High quality PCR primers were designed for a substantial number of the SSR loci, and a large number of these were amplified successfully in initial screening.

Conclusions

This sequence collection represents a major genomic resource for P. contorta, and the large number of genetic markers characterized should contribute to future research in this and other pines. Our results illustrate the utility of next generation sequencing as a basis for marker development and population genomics in non-model species.

Keywords


Article
Legionella pneumophila pangenome reveals strain-specific virulence factors

Authors: D'Auria Giuseppe --- Jiménez-Hernández Nuria --- Peris-Bondia Francesc --- Moya Andrés --- et al.
DOI: 10.1186/1471-2164-11-181
Pages: 181
Loading...
Loading...
Abstract

Abstract

Background

Legionella pneumophila subsp. pneumophila is a gram-negative γ-Proteobacterium and the causative agent of Legionnaires' disease, a form of epidemic pneumonia. It has a water-related life cycle. In industrialized cities L. pneumophila is commonly encountered in refrigeration towers and water pipes. Infection is always via infected aerosols to humans. Although many efforts have been made to eradicate Legionella from buildings, it still contaminates the water systems. The town of Alcoy (Valencian Region, Spain) has had recurrent outbreaks since 1999. The strain "Alcoy 2300/99" is a particularly persistent and recurrent strain that was isolated during one of the most significant outbreaks between the years 1999-2000.

Results

We have sequenced the genome of the particularly persistent L. pneumophila strain Alcoy 2300/99 and have compared it with four previously sequenced strains known as Philadelphia (USA), Lens (France), Paris (France) and Corby (England).

Pangenome analysis facilitated the identification of strain-specific features, as well as some that are shared by two or more strains. We identified: (1) three islands related to anti-drug resistance systems; (2) a system for transport and secretion of heavy metals; (3) three systems related to DNA transfer; (4) two CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) systems, known to provide resistance against phage infections, one similar in the Lens and Alcoy strains, and another specific to the Paris strain; and (5) seven islands of phage-related proteins, five of which seem to be strain-specific and two shared.

Conclusions

The dispensable genome disclosed by the pangenomic analysis seems to be a reservoir of new traits that have mainly been acquired by horizontal gene transfer and could confer evolutionary advantages over strains lacking them.

Keywords


Article
Chromosomal evolution in the plant family Solanaceae

Authors: Wu Feinan --- Tanksley Steven D
DOI: 10.1186/1471-2164-11-182
Pages: 182
Loading...
Loading...
Abstract

Abstract

Background

Over the past decades, extensive comparative mapping research has been performed in the plant family Solanaceae. The recent identification of a large set of single-copy conserved orthologous (COSII) markers has greatly accelerated comparative mapping studies among major solanaceous species including tomato, potato, eggplant, pepper and diploid Nicotiana species (as well as tetraploid tobacco). The large amount of comparative data now available for these species provides the opportunity to describe the overall patterns of chromosomal evolution in this important plant family. The results of this investigation are described herein.

Results

We combined data from multiple COSII studies, and other comparative mapping studies performed in tomato, potato, eggplant, pepper and diploid Nicotiana species, to deduce the features and outcomes of chromosomal evolution in the Solanaceae over the past 30 million years. This includes estimating the rates and timing of chromosomal changes (inversions and translocations) as well as deducing the age of ancestral progenitor species and predicting their genome configurations.

Conclusions

The Solanaceae has experienced chromosomal changes at a modest rate compared with other families and the rates are likely conserved across different lineages of the family. Chromosomal inversions occur at a consistently higher rate than do translocations. Further, we find evidences for non-random positioning of the chromosomal rearrangement breakpoints. This finding is consistent with the similar finding in mammals, where hot spots for chromosomal breakages have apparently played a significant role in shaping genome evolution. Finally, by utilizing multiple genome comparisons we were able to reconstruct the most likely genome configuration for a number of now-extinct progenitor species that gave rise to the extant solanaceous species used in this research. The results from this study provide the first broad overview of chromosomal evolution in the family Solanaceae, and one of the most detailed thus far for any family of plants.

Keywords


Article
Complementary transcriptomic, lipidomic, and targeted functional genetic analyses in cultured Drosophila cells highlight the role of glycerophospholipid metabolism in Flock House virus RNA replication

Authors: Castorena Kathryn M --- Stapleford Kenneth A --- Miller David J
DOI: 10.1186/1471-2164-11-183
Pages: 183
Loading...
Loading...
Abstract

Abstract

Background

Cellular membranes are crucial host components utilized by positive-strand RNA viruses for replication of their genomes. Published studies have suggested that the synthesis and distribution of membrane lipids are particularly important for the assembly and function of positive-strand RNA virus replication complexes. However, the impact of specific lipid metabolism pathways in this process have not been well defined, nor have potential changes in lipid expression associated with positive-strand RNA virus replication been examined in detail.

Results

In this study we used parallel and complementary global and targeted approaches to examine the impact of lipid metabolism on the replication of the well-studied model alphanodavirus Flock House virus (FHV). We found that FHV RNA replication in cultured Drosophila S2 cells stimulated the transcriptional upregulation of several lipid metabolism genes, and was also associated with increased phosphatidylcholine accumulation with preferential increases in lipid molecules with longer and unsaturated acyl chains. Furthermore, targeted RNA interference-mediated downregulation of candidate glycerophospholipid metabolism genes revealed a functional role of several genes in virus replication. In particular, we found that downregulation of Cct1 or Cct2, which encode essential enzymes for phosphatidylcholine biosynthesis, suppressed FHV RNA replication.

Conclusion

These results indicate that glycerophospholipid metabolism, and in particular phosphatidylcholine biosynthesis, plays an important role in FHV RNA replication. Furthermore, they provide a framework in which to further explore the impact of specific steps in lipid metabolism on FHV replication, and potentially identify novel cellular targets for the development of drugs to inhibit positive-strand RNA viruses.

Keywords


Article
Synteny mapping between common bean and soybean reveals extensive blocks of shared loci

Authors: McClean Phillip E --- Mamidi Sujan --- McConnell Melody --- Chikara Shireen --- et al.
DOI: 10.1186/1471-2164-11-184
Pages: 184
Loading...
Loading...
Abstract

Abstract

Background

Understanding syntentic relationship between two species is critical to assessing the potential for comparative genomic analysis. Common bean (Phaseolus vulgaris L.) and soybean (Glycine max L.), the two most important members of the Phaseoleae legumes, appear to have a diploid and polyploidy recent past, respectively. Determining the syntentic relationship between these two species will allow researchers to leverage not only genomic resources but also genetic data for important agronomic traits to improve both of these species.

Results

Genetically-positioned transcript loci of common bean were mapped relative to the recent soybean 1.01 pseudochromosome assembly. In nearly every case, each common bean locus mapped to two loci in soybean, a result consistent with the duplicate polyploidy history of soybean. Blocks of synteny averaging 32 cM in common bean and 4.9 Mb in soybean were observed for all 11 common bean linkage groups, and these blocks mapped to all 20 soybean pseudochromosomes. The median physical-to-genetic distance ratio in common bean (based on soybean physical distances) was ~120 kb/cM. ~15,000 common bean sequences (primarily EST contigs and EST singletons) were electronically positioned onto the common bean map using the shared syntentic blocks as references points.

Conclusion

The collected evidence from this mapping strongly supports the duplicate history of soybean. It further provides evidence that the soybean genome was fractionated and reassembled at some point following the duplication event. These well mapped syntentic relationships between common bean and soybean will enable researchers to target specific genomic regions to discover genes or loci that affect phenotypic expression in both species.

Keywords


Article
wDBTF: an integrated database resource for studying wheat transcription factor families

Authors: Romeuf Isabelle --- Tessier Dominique --- Dardevet Mireille --- Branlard Gérard --- et al.
DOI: 10.1186/1471-2164-11-185
Pages: 185
Loading...
Loading...
Abstract

Abstract

Background

Transcription factors (TFs) regulate gene expression by interacting with promoters of their target genes and are classified into families based on their DNA-binding domains. Genes coding for TFs have been identified in the sequences of model plant genomes. The rice (Oryza sativa spp. japonica) genome contains 2,384 TF gene models, which represent the mRNA transcript of a locus, classed into 63 families.

Results

We have created an extensive list of wheat (Triticum aestivum L) TF sequences based on sequence homology with rice TFs identified and classified in the Database of Rice Transcription Factors (DRTF). We have identified 7,112 wheat sequences (contigs and singletons) from a dataset of 1,033,960 expressed sequence tag and mRNA (ET) sequences available. This number is about three times the number of TFs in rice so proportionally is very similar if allowance is made for the hexaploidy of wheat. Of these sequences 3,820 encode gene products with a DNA-binding domain and thus were confirmed as potential regulators. These 3,820 sequences were classified into 40 families and 84 subfamilies and some members defined orphan families. The results were compiled in the Database of Wheat Transcription Factor (wDBTF), an inventory available on the web http://wwwappli.nantes.inra.fr:8180/wDBFT/. For each accession, a link to its library source and its Affymetrix identification number is provided. The positions of Pfam (protein family database) motifs were given when known.

Conclusions

wDBTF collates 3,820 wheat TF sequences validated by the presence of a DNA-binding domain out of 7,112 potential TF sequences identified from publicly available gene expression data. We also incorporated in silico expression data on these TFs into the database. Thus this database provides a major resource for systematic studies of TF families and their expression in wheat as illustrated here in a study of DOF family members expressed during seed development.

Keywords


Article
Differential expression of genes in salivary glands of male Rhipicephalus (Boophilus)microplus in response to infection with Anaplasma marginale

Authors: Zivkovic Zorica --- Esteves Eliane --- Almazán Consuelo --- Daffre Sirlei --- et al.
DOI: 10.1186/1471-2164-11-186
Pages: 186
Loading...
Loading...
Abstract

Abstract

Background

Bovine anaplasmosis, caused by the rickettsial tick-borne pathogen Anaplasma marginale (Rickettsiales: Anaplasmataceae), is vectored by Rhipicephalus (Boophilus)microplus in many tropical and subtropical regions of the world. A. marginale undergoes a complex developmental cycle in ticks which results in infection of salivary glands from where the pathogen is transmitted to cattle. In previous studies, we reported modification of gene expression in Dermacentor variabilis and cultured Ixodes scapularis tick cells in response to infection with A. marginale. In these studies, we extended these findings by use of a functional genomics approach to identify genes differentially expressed in R. microplus male salivary glands in response to A. marginale infection. Additionally, a R. microplus-derived cell line, BME26, was used for the first time to also study tick cell gene expression in response to A. marginale infection.

Results

Suppression subtractive hybridization libraries were constructed from infected and uninfected ticks and used to identify genes differentially expressed in male R. microplus salivary glands infected with A. marginale. A total of 279 ESTs were identified as candidate differentially expressed genes. Of these, five genes encoding for putative histamine-binding protein (22Hbp), von Willebrand factor (94Will), flagelliform silk protein (100Silk), Kunitz-like protease inhibitor precursor (108Kunz) and proline-rich protein BstNI subfamily 3 precursor (7BstNI3) were confirmed by real-time RT-PCR to be down-regulated in tick salivary glands infected with A. marginale. The impact of selected tick genes on A. marginale infections in tick salivary glands and BME26 cells was characterized by RNA interference. Silencing of the gene encoding for putative flagelliform silk protein (100Silk) resulted in reduced A. marginale infection in both tick salivary glands and cultured BME26 cells, while silencing of the gene encoding for subolesin (4D8) significantly reduced infection only in cultured BME26 cells. The knockdown of the gene encoding for putative metallothionein (93 Meth), significantly up-regulated in infected cultured BME26 cells, resulted in higher A. marginale infection levels in tick cells.

Conclusions

Characterization of differential gene expression in salivary glands of R. microplus in response to A. marginale infection expands our understanding of the molecular mechanisms at the tick-pathogen interface. Functional studies suggested that differentially expressed genes encoding for subolesin, putative von Willebrand factor and flagelliform silk protein could play a role in A. marginale infection and multiplication in ticks. These tick genes found to be functionally relevant for tick-pathogen interactions will likely be candidates for development of vaccines designed for control of both ticks and tick-borne pathogens.

Keywords


Article
Antimicrobial peptide-like genes in Nasonia vitripennis: a genomic perspective

Authors: Tian Caihuan --- Gao Bin --- Fang Qi --- Ye Gongyin --- et al.
DOI: 10.1186/1471-2164-11-187
Pages: 187
Loading...
Loading...
Abstract

Abstract

Background

Antimicrobial peptides (AMPs) are an essential component of innate immunity which can rapidly respond to diverse microbial pathogens. Insects, as a rich source of AMPs, attract great attention of scientists in both understanding of the basic biology of the immune system and searching molecular templates for anti-infective drug design. Despite a large number of AMPs have been identified from different insect species, little information in terms of these peptides is available from parasitic insects.

Results

By using integrated computational approaches to systemically mining the Hymenopteran parasitic wasp Nasonia vitripennis genome, we establish the first AMP repertoire whose members exhibit extensive sequence and structural diversity and can be distinguished into multiple molecular types, including insect and fungal defensin-like peptides (DLPs) with the cysteine-stabilized α-helical and β-sheet (CSαβ) fold; Pro- or Gly-rich abaecins and hymenoptaecins; horseshoe crab tachystatin-type AMPs with the inhibitor cystine knot (ICK) fold; and a linear α-helical peptide. Inducible expression pattern of seven N. vitripennis AMP genes were verified, and two representative peptides were synthesized and functionally identified to be antibacterial. In comparison with Apis mellifera (Hymenoptera) and several non-Hymenopteran model insects, N. vitripennis has evolved a complex antimicrobial immune system with more genes and larger protein precursors. Three classical strategies that are likely responsible for the complexity increase have been recognized: 1) Gene duplication; 2) Exon duplication; and 3) Exon-shuffling.

Conclusion

The present study established the N. vitripennis peptidome associated with antimicrobial immunity by using a combined computational and experimental strategy. As the first AMP repertoire of a parasitic wasp, our results offer a basic platform for further studying the immunological and evolutionary significances of these newly discovered AMP-like genes in this class of insects.

Keywords


Article
Single feature polymorphism (SFP)-based selective sweep identification and association mapping of growth-related metabolic traits in Arabidopsis thaliana