ISSN/EISSN: 14712164
Subject:
Genetics
Publisher: BioMed Central
Country: United Kingdom
Language: English
Start year 2000
Publication fee:
Yes
--- Further Information
Journal homepage at publisher site

Loading...
Loading...
Abstract
Background
Methylotrophic yeast species (e.g.
In this study, changes in the transcriptional profiles during adaptation of
Results
Two hours after the shift of cells from glucose to methanol nearly 20% (1184 genes) of the approximately 6000 annotated
A moderate increase in transcriptional levels (up to 4-fold) was observed for several
Approximately 20% (1246 genes) of the genes were downregulated. These included glycolytic genes as well as genes involved in transcription and translation.
Conclusion
Transcriptional profiling of

Loading...
Loading...
Abstract
Background
Results
The genome of the type strain of
Conclusions
The

Loading...
Loading...
Abstract
Background
In most species of mammals, the
Results
In this paper we focused our attention on the analysis of a large number of sheep TR β-chain transcripts derived from four different lymphoid tissues of three diverse sheep breed animals to certify the use and frequency of the three gene clusters in the β-chain repertoire. As the sheep
Conclusion
All together our data demonstrate that a disparate set of molecular mechanisms operate to perform a diversified repertoire in the sheep β-chain and this could confer some special biological properties to the corresponding αβ T cells in the ruminant lineage.

Loading...
Loading...
Abstract
Background
The newt
Results
A cDNA library of 100,000 cDNA clones was generated from newt hearts 14 days after ventricular injury. Sequencing of 11520 cDNA clones resulted in 2894 assembled contigs. BLAST searches revealed 1695 sequences with potential homology to sequences from the NCBI database. BLAST searches to TrEMBL and Swiss-Prot databases assigned 1116 proteins to Gene Ontology terms. We also identified a relatively large set of 174 ORFs, which are likely to be unique for urodele amphibians. Expression analysis of newt-zebrafish homologues confirmed the deregulation of selected genes during heart regeneration. Sequences, BLAST results and GO annotations were visualized in a relational web based database followed by grouping of identified proteins into clusters of GO Terms. Comparison of data from regenerating zebrafish hearts identified biological processes, which were uniformly overrepresented during cardiac regeneration in newt and zebrafish.
Conclusion
We concluded that heart regeneration in newts and zebrafish led to the activation of similar sets of genes, which suggests that heart regeneration in both species might follow similar principles. The design of the newly established newt EST database allows identification of molecular pathways important for heart regeneration.

Loading...
Loading...
Abstract
Background
Studies using microarray analysis of colorectal cancer have been generally beleaguered by the lack of a normal cell population of the same lineage as the tumor cell. One of the main objectives of this study was to generate a reference gene expression data set for normal colonic epithelium which can be used in comparisons with diseased tissues, as well as to provide a dataset that could be used as a baseline for studies in alternative splicing.
Results
We present a dependable expression reference data set for non-neoplastic colonic epithelial cells. An enriched population of fresh colon epithelial cells were obtained from non-neoplastic, colectomy specimens and analyzed using Affymetrix GeneChip EXON 1.0 ST arrays. For demonstration purposes, we have compared the data derived from these cells to a publically available set of tumor and matched normal colon data. This analysis allowed an assessment of global gene expression alterations and demonstrated that adjacent normal tissues, with a high degree of cellular heterogeneity, are not always representative of normal cells for comparison to tumors which arise from the colon epithelium. We also examined alternative splicing events in tumors compared to normal colon epithelial cells.
Conclusions
The findings from this study represent the first comprehensive expression profile for non-neoplastic colonic epithelial cells reported. Our analysis of splice variants illustrate that this is a very labor intensive procedure, requiring vigilant examination of the data. It is projected that the contribution of this set of data derived from pure colonic epithelial cells will enhance studies in colon-related disease and offer a vital baseline for studies aimed at elucidating the mechanisms of alternative splicing.

Loading...
Loading...
Abstract
Background
X chromosomes are subject to dosage compensation in
Results
We find that the general chromatin structure of female X chromosomes is distinct from autosomes. Additionally, specific histone marks associated with dosage compensation and active chromatin marks on the male X chromosome are also enriched on the X chromosomes of females, albeit to a lesser degree.
Conclusions
Our data indicate that X chromatin structure is fundamentally different from autosome structure in both sexes. We suggest that the differences between the X chromosomes and autosomes in females are a consequence of mechanisms that have evolved to ensure sufficient X chromosome expression in the soma of males.

Loading...
Loading...
Abstract
Background
RNAi screens via pooled short hairpin RNAs (shRNAs) have recently become a powerful tool for the identification of essential genes in mammalian cells. In the past years, several pooled large-scale shRNA screens have identified a variety of genes involved in cancer cell proliferation. All of those studies employed microarray analysis, utilizing either the shRNA's half hairpin sequence or an additional shRNA-associated 60 nt barcode sequence as a molecular tag. Here we describe a novel method to decode pooled RNAi screens, namely barcode tiling array analysis, and demonstrate how this approach can be used to precisely quantify the abundance of individual shRNAs from a pool.
Results
We synthesized DNA microarrays with six overlapping 25 nt long tiling probes complementary to each unique 60 nt molecular barcode sequence associated with every shRNA expression construct. By analyzing dilution series of expression constructs we show how our approach allows quantification of shRNA abundance from a pool and how it clearly outperforms the commonly used analysis via the shRNA's half hairpin sequences. We further demonstrate how barcode tiling arrays can be used to predict anti-proliferative effects of individual shRNAs from pooled negative selection screens. Out of a pool of 305 shRNAs, we identified 28 candidate shRNAs to fully or partially impair the viability of the breast carcinoma cell line MDA-MB-231. Individual validation of a subset of eleven shRNA expression constructs with potential inhibitory, as well as non-inhibitory, effects on the cell line proliferation provides further evidence for the accuracy of the barcode tiling approach.
Conclusions
In summary, we present an improved method for the rapid, quantitative and statistically robust analysis of pooled RNAi screens. Our experimental approach, coupled with commercially available lentiviral vector shRNA libraries, has the potential to greatly facilitate the discovery of putative targets for cancer therapy as well as sensitizers of drug toxicity.

Loading...
Loading...
Abstract
Background
Though a variety of linkage disequilibrium tests have recently been introduced to measure the signal of recent positive selection, the statistical properties of the various methods have not been directly compared. While most applications of these tests have suggested that positive selection has played an important role in recent human history, the results of these tests have varied dramatically.
Results
Here, we evaluate the performance of three statistics designed to detect incomplete selective sweeps, LRH and iHS, and ALnLH. To analyze the properties of these tests, we introduce a new computational method that can model complex population histories with migration and changing population sizes to simulate gene trees influenced by recent positive selection. We demonstrate that iHS performs substantially better than the other two statistics, with power of up to 0.74 at the 0.01 level for the variation best suited for full genome scans and a power of over 0.8 at the 0.01 level for the variation best suited for candidate gene tests. The performance of the iHS statistic was robust to complex demographic histories and variable recombination rates. Genome scans involving the other two statistics suffer from low power and high false positive rates, with false discovery rates of up to 0.96 for ALnLH. The difference in performance between iHS and ALnLH, did not result from the properties of the statistics, but instead from the different methods for mitigating the multiple comparison problem inherent in full genome scans.
Conclusions
We introduce a new method for simulating genealogies influenced by positive selection with complex demographic scenarios. In a power analysis based on this method, iHS outperformed LRH and ALnLH in detecting incomplete selective sweeps. We also show that the single-site iHS statistic is more powerful in a candidate gene test than the multi-site statistic, but that the multi-site statistic maintains a low false discovery rate with only a minor loss of power when applied to a scan of the entire genome. Our results highlight the need for careful consideration of multiple comparison problems when evaluating and interpreting the results of full genome scans for positive selection.

Loading...
Loading...
Abstract
Background
Non-genotoxic carcinogens are notoriously difficult to identify as they do not damage DNA directly and have diverse modes of action, necessitating long term in vivo studies. The early effects of the classic rodent non-genotoxic hepatocarcinogen phenobarbital have been investigated in the Fisher rat using a combination of metabolomics and transcriptomics, to investige early stage mechanistic changes that are predictive of longer term pathology.
Results
Liver and blood plasma were profiled across 14 days, and multivariate statistics used to identify perturbed pathways. Both metabolomics and transcriptomics detected changes in the liver which were dose dependent, even after one day of exposure. Integration of the two datasets associated perturbations with specific pathways. Hepatic glycogen was decreased due to a decrease in synthesis, and plasma triglycerides were decreased due to an increase in fatty acid uptake by the liver. Hepatic succinate was increased and this was associated with increased heme biosynthesis. Glutathione synthesis was also increased, presumably in response to oxidative stress. Liquid Chromatography Mass Spectrometry demonstrated a remodeling of lipid species, possibly resulting from proliferation of the smooth endoplasmic reticulum.
Conclusions
The data fusion of metabolomic and transcriptomic changes proved to be a highly sensitive approach for monitoring early stage changes in altered hepatic metabolism, oxidative stress and cytochrome P450 induction simultaneously. This approach is particularly useful in interpreting changes in metabolites such as succinate which are hubs of metabolism.

Loading...
Loading...
Abstract
Background
During the lifetime of a fermenter culture, the soil bacterium
Results
Surprisingly, we find that the metabolic switch actually consists of multiple finely orchestrated switching events. Strongly coherent clusters of genes show drastic changes in gene expression already many hours before the classically defined transition phase where the switch from primary to secondary metabolism was expected. The main switch in gene expression takes only 2 hours, and changes in antibiotic biosynthesis genes are delayed relative to the metabolic rearrangements. Furthermore, global variation in morphogenesis genes indicates an involvement of cell differentiation pathways in the decision phase leading up to the commitment to antibiotic biosynthesis.
Conclusions
Our study provides the first detailed insights into the complex sequence of early regulatory events during and preceding the major metabolic switch in

Loading...
Loading...
Abstract
Background
Biological nitrogen fixation is highly controlled at the transcriptional level by regulatory networks that respond to the availability of fixed nitrogen. In many diazotrophs, addition of excess ammonium in the growth medium results in immediate repression of
Results
Here we report a global transcriptional profiling analysis of nitrogen fixation and ammonium repression in
Conclusions
Based on our results, we conclude that transcriptional regulation of

Loading...
Loading...
Abstract
Background
Zinc is considered as an essential element for all living organisms, but it can be toxic at large concentrations. Bacteria therefore tightly regulate zinc metabolism. The Cg2502 protein of
Results
The
Conclusion
Whole-genome expression profiling and DNA band shift assays demonstrated that Zur directly represses in a zinc-dependent manner the expression of nine genes organized in five transcription units. Accordingly, the Zur (Cg2502) protein is the key transcription regulator for genes involved in zinc homeostasis in

Loading...
Loading...
Abstract
Background
Considerable progress has been made in our understanding of sex determination and dosage compensation mechanisms in model organisms such as
Results
By comparing microarrays from microdissected primitive streak from single chicken embryos, we identified a large number of genes differentially expressed between male and female embryos at a very early stage (Hamburger and Hamilton stage 4), long before any sexual differentiation occurs. Most of these genes are located on the Z chromosome, which indicates that dosage compensation is ineffective in early chicken embryos. Gene ontology analyses, using an enhanced annotation tool for Affymetrix probesets of the chicken genome developed in our laboratory (called Manteia), show that among these male-biased genes found on the Z chromosome, more than 20 genes play a role in sex differentiation.
Conclusions
These results corroborate previous studies demonstrating the rather inefficient dosage compensation for Z chromosome in birds and show that this sexual dimorphism in gene regulation is observed long before the onset of sexual differentiation. These data also suggest a potential role of non-compensated Z-linked genes in somatic sex differentiation in birds.

Loading...
Loading...
Abstract
Background
Snake mitochondrial genomes are of great interest in understanding mitogenomic evolution because of gene duplications and rearrangements and the fast evolutionary rate of their genes compared to other vertebrates. Mitochondrial gene sequences have also played an important role in attempts to resolve the contentious phylogenetic relationships of especially the early divergences among alethinophidian snakes. Two recent innovative studies found dramatic gene- and branch-specific relative acceleration in snake protein-coding gene evolution, particularly along internal branches leading to Serpentes and Alethinophidia. It has been hypothesized that some of these rate shifts are temporally (and possibly causally) associated with control region duplication and/or major changes in ecology and anatomy.
Results
The near-complete mitochondrial (mt) genomes of three henophidian snakes were sequenced:
Conclusions
Mitochondrial gene sequence data alone may not be able to robustly resolve basal divergences among alethinophidian snakes. Taxon sampling plays an important role in identifying mitogenomic evolutionary events within snakes, and in testing hypotheses explaining their origin. Dramatic rate shifts in mitogenomic evolution occur within Scolecophidia as well as Alethinophidia, thus falsifying the hypothesis that these shifts in snakes are associated exclusively with evolution of a non-burrowing lifestyle, macrostomatan feeding ecology and/or duplication of the control region, both restricted to alethinophidians among living snakes.

Loading...
Loading...
Abstract
Background
Microarray technologies produced large amount of data. In a previous study, we have shown the interest of
Results
We underline the excellent efficiency of approaches proposed and implemented by Bo and co-workers and especially one based on expected maximization (
Conclusions
More than 6.000.000 independent simulations have assessed the quality of 12 imputation methods on five very different biological datasets. Important improvements have so been done since our last study. The

Loading...
Loading...
Abstract
Background
The nuclear receptor peroxisome proliferator-activated receptor alpha (PPARα) regulates responses to chemical or physical stress in part by altering expression of genes involved in proteome maintenance. Many of these genes are also transcriptionally regulated by heat shock (HS) through activation by HS factor-1 (HSF1). We hypothesized that there are interactions on a genetic level between PPARα and the HS response mediated by HSF1.
Results
Wild-type and PPARα-null mice were exposed to HS, the PPARα agonist WY-14,643 (WY), or both; gene and protein expression was examined in the livers of the mice 4 or 24 hrs after HS. Gene expression profiling identified a number of
Conclusions
These findings demonstrate that the PPARα genotype has a dramatic effect on the transcriptional targets of HS and support an expanded role for PPARα in the regulation of proteome maintenance genes after exposure to diverse forms of environmental stress including HS.

Loading...
Loading...
Abstract
Background
The SLC17 family of transporters transports the amino acids: glutamate and aspartate, and, as shown recently, also nucleotides. Vesicular glutamate transporters are found in distinct species, such as
Results
Our phylogenetic analysis shows that the SLC17 family consists of four main phylogenetic clades which were all present before the divergence of the insect lineage. One of these clades has not been previously described and it is not found in vertebrates. The clade containing Slc17a9 had the most restricted evolutionary history with only one member in most species. We detected expression of Slc17a1-17a4 only in the peripheral tissues but not in the CNS, while Slc17a5- Slc17a9 are highly expressed in both the CNS and periphery.
Conclusions
The

Loading...
Loading...
Abstract
Background
In vertebrates, late oogenesis is a key period during which the oocyte acquires its ability to resume meiosis (
Results
Among the 1200 differentially expressed clones in rainbow trout, twenty-six candidate genes were selected for further analysis by real-time PCR in both species during late oogenesis. Among these genes, eight had similar expression profiles in trout and
Conclusions
We have successfully identified orthologous genes exhibiting conserved expression profiles in the ovarian follicle during late oogenesis in both trout and

Loading...
Loading...
Abstract
Background
Transmembrane receptor kinases play critical roles in both animal and plant signaling pathways regulating growth, development, differentiation, cell death, and pathogenic defense responses. In
Results
As a resource for the in-depth analysis of this important protein family, the complementary DNA sequences (cDNAs) of 194 LRR-RLKs were cloned into the GatewayR donor vector pDONR/ZeoR and analyzed by DNA sequencing. Among them, 157 clones showed sequences identical to the predictions in the Arabidopsis sequence resource, TAIR8. The other 37 cDNAs showed gene structures distinct from the predictions of TAIR8, which was mainly caused by alternative splicing of pre-mRNA. Most of the genes have been further cloned into GatewayR destination vectors with GFP or FLAG epitope tags and have been transformed into Arabidopsis for
Conclusions
Most of the Arabidopsis LRR-RLK genes have been isolated and the sequence analysis showed a number of alternatively spliced variants. The generated resources, including cDNA entry clones, expression constructs and transgenic plants, will facilitate further functional analysis of the members of this important gene family.

Loading...
Loading...
Abstract
Background
We performed gene expression profiling of the amygdala and hippocampus taken from inbred mouse strains C57BL/6J and A/J. The selected brain areas are implicated in neurobehavioral traits while these mouse strains are known to differ widely in behavior. Consequently, we hypothesized that comparing gene expression profiles for specific brain regions in these strains might provide insight into the molecular mechanisms of human neuropsychiatric traits. We performed a whole-genome gene expression experiment and applied a systems biology approach using weighted gene co-expression network analysis.
Results
We were able to identify modules of co-expressed genes that distinguish a strain or brain region. Analysis of the networks that are most informative for hippocampus and amygdala revealed enrichment in neurologically, genetically and psychologically related pathways. Close examination of the strain-specific gene expression profiles, however, revealed no functional relevance but a significant enrichment of single nucleotide polymorphisms in the probe sequences used for array hybridization. This artifact was not observed for the modules of co-expressed genes that distinguish amygdala and hippocampus.
Conclusions
The brain-region specific modules were found to be independent of genetic background and are therefore likely to represent biologically relevant molecular networks that can be studied to complement our knowledge about pathways in neuropsychiatric disease.

Loading...
Loading...
Abstract
Background
The approaches for shotgun-based sequencing of vertebrate genomes are now well-established, and have resulted in the generation of numerous draft whole-genome sequence assemblies. In contrast, the process of refining those assemblies to improve contiguity and increase accuracy (known as 'sequence finishing') remains tedious, labor-intensive, and expensive. As a result, the vast majority of vertebrate genome sequences generated to date remain at a draft stage.
Results
To date, our genome sequencing efforts have focused on comparative studies of targeted genomic regions, requiring sequence finishing of large blocks of orthologous sequence (average size 0.5-2 Mb) from various subsets of 75 vertebrates. This experience has provided a unique opportunity to compare the relative effort required to finish shotgun-generated genome sequence assemblies from different species, which we report here. Importantly, we found that the sequence assemblies generated for the same orthologous regions from various vertebrates show substantial variation with respect to misassemblies and, in particular, the frequency and characteristics of sequence gaps. As a consequence, the work required to finish different species' sequences varied greatly. Application of the same standardized methods for finishing provided a novel opportunity to "assay" characteristics of genome sequences among many vertebrate species. It is important to note that many of the problems we have encountered during sequence finishing reflect unique architectural features of a particular vertebrate's genome, which in some cases may have important functional and/or evolutionary implications. Finally, based on our analyses, we have been able to improve our procedures to overcome some of these problems and to increase the overall efficiency of the sequence-finishing process, although significant challenges still remain.
Conclusion
Our findings have important implications for the eventual finishing of the draft whole-genome sequences that have now been generated for a large number of vertebrates.

Loading...
Loading...
Abstract
Background
NF-κB regulates the expression of a large number of target genes involved in the immune and inflammatory response, apoptosis, cell proliferation, differentiation and survival. We have earlier reported that p65, a subunit of NF-κB, is acetylated
Results
In this study, we describe that site-specific mutation of p65 at lysines 314 and 315 enhances gene expression of a subset of NF-κB target genes including
Conclusions
Together, our results establish acetylation of K314 as an important regulatory modification of p65 and subsequently of NF-κB-dependent gene expression.

Loading...
Loading...
Abstract
Background
Early embryonic loss is a large contributor to infertility in cattle. Although genetic factors are known to affect early embryonic development, the discovery of such factors has been a serious challenge. The objective of this study was to identify genes differentially expressed between blastocysts and degenerative embryos at early stages of development.
Results
Using microarrays, genome-wide RNA expression was profiled and compared for in vitro fertilization (IVF) - derived blastocysts and embryos undergoing degenerative development up to the same time point. Surprisingly similar transcriptomic profiles were found in degenerative embryos and blastocysts. Nonetheless, we identified 67 transcripts that significantly differed between these two groups of embryos at a 15% false discovery rate, including 33 transcripts showing at least a two-fold difference. Several signaling and metabolic pathways were found to be associated with the developmental status of embryos, among which were previously known important steroid biosynthesis and cell communication pathways in early embryonic development.
Conclusions
This study presents the first direct and comprehensive comparison of transcriptomes between IVF blastocysts and degenerative embryos, providing important information for potential genes and pathways associated with early embryonic development.

Loading...
Loading...
Abstract
Background
Previously we found that
Results
Most exposures resulted in an induction of DNA damage signaling and apoptosis genes and a reduced expression of cell division genes in cells of both genotypes. As expected, responses to N-ac-AAF were very similar in both genotypes. ENU exposure did not lead to significant gene expression changes in cells of both genotypes, presumably due to its short half-life. Gene expression responses to clastogens, however, showed a genotype-dependent effect for BLM and MMC. MMC treated
Conclusion
These data support our finding that different types of clastogens exist and that responses to these types depend on the DNA repair status of the cells.

Loading...
Loading...
Abstract
Background
Neurofibrillary tangles (NFT), a cardinal neuropathological feature of Alzheimer's disease (AD) that is highly correlated with synaptic loss and dementia severity, appear to be partly attributable to increased phosphorylation of the microtubule stabilizing protein tau at certain AD-related residues. Identifying the kinases involved in the pathologic phosphorylation of tau may provide targets at which to aim new AD-modifying treatments.
Results
We report results from a screen of 572 kinases in the human genome for effects on tau hyperphosphorylation using a loss of function, high-throughput RNAi approach. We confirm effects of three kinases from this screen, the eukaryotic translation initiation factor 2 α kinase 2 (EIF2AK2), the dual-specificity tyrosine-(Y)-phosphorylation regulated kinase 1A (DYRK1A), and the A-kinase anchor protein 13 (AKAP13) on tau phosphorylation at the 12E8 epitope (serine 262/serine 356). We provide evidence that EIF2AK2 effects may result from effects on tau protein expression, whereas DYRK1A and AKAP13 are likely more specifically involved in tau phosphorylation pathways.
Conclusions
These findings identify novel kinases that phosphorylate tau protein and provide a valuable reference data set describing the kinases involved in phosphorylating tau at an AD-relevant epitope.

Loading...
Loading...
Abstract
Background
Prostate cancer is the second leading cause of cancer mortality among US men. Epidemiological evidence suggests that high vitamin D status protects men from prostate cancer and the active form of vitamin D, 1α,25 dihydroxyvitamin D3 (1,25(OH)2D) has anti-cancer effects in cultured prostate cells. Still, the molecular mechanisms and the gene targets for vitamin D-mediated prostate cancer prevention are unknown.
Results
We examined the effect of 1,25(OH)2D (+/- 100 nM, 6, 24, 48 h) on the transcript profile of proliferating RWPE1 cells, an immortalized, non-tumorigenic prostate epithelial cell line that is growth arrested by 1,25(OH)2D (Affymetrix U133 Plus 2.0, n = 4/treatment per time and dose). Our analysis revealed many transcript level changes at a 5% false detection rate: 6 h, 1571 (61% up), 24 h, 1816 (60% up), 48 h, 3566 (38% up). 288 transcripts were regulated similarly at all time points (182 up, 80 down) and many of the promoters for these transcripts contained putative vitamin D response elements. Functional analysis by pathway or Gene Set Analysis revealed early suppression of WNT, Notch, NF-kB, and IGF1 signaling. Transcripts related to inflammation were suppressed at 6 h (e.g. IL-1 pathway) and suppression of proinflammatory pathways continued at later time points (e.g. IL-17 and IL-6 pathways). There was also evidence for induction of anti-angiogenic pathways and induction of transcripts for protection from oxidative stress or maintenance of cell redox homeostasis at 6 h.
Conclusions
Our data reveal of large number of potential new, direct vitamin D target genes relevant to prostate cancer prevention. In addition, our data suggests that rather than having a single strong regulatory effect, vitamin D orchestrates a pattern of changes within prostate epithelial cells that limit or slow carcinogenesis.

Loading...
Loading...
Abstract
Background
A large amount of experimental data generated by modern high-throughput technologies is available through various public repositories. Our knowledge about molecular interaction networks, functional biological pathways and transcriptional regulatory modules is rapidly expanding, and is being organized in lists of functionally related genes. Jointly, these two sources of information hold a tremendous potential for gaining new insights into functioning of living systems.
Results
Genomics Portals platform integrates access to an extensive knowledge base and a large database of human, mouse, and rat genomics data with basic analytical visualization tools. It provides the context for analyzing and interpreting new experimental data and the tool for effective mining of a large number of publicly available genomics datasets stored in the back-end databases. The uniqueness of this platform lies in the volume and the diversity of genomics data that can be accessed and analyzed (gene expression, ChIP-chip, ChIP-seq, epigenomics, computationally predicted binding sites, etc), and the integration with an extensive knowledge base that can be used in such analysis.
Conclusion
The integrated access to primary genomics data, functional knowledge and analytical tools makes Genomics Portals platform a unique tool for interpreting results of new genomics experiments and for mining the vast amount of data stored in the Genomics Portals backend databases. Genomics Portals can be accessed and used freely at

Loading...
Loading...
Abstract
Background
Over the past years, the relationship between gene transcription and chromosomal location has been studied in a number of different vertebrate genomes. Regional differences in gene expression have been found in several different species. The chicken genome, as the closest sequenced genome relative to mammals, is an important resource for investigating regional effects on transcription in birds and studying the regional dynamics of chromosome evolution by comparative analysis.
Results
We used gene expression data to survey eight chicken tissues and create transcriptome maps for all chicken chromosomes. The results reveal the presence of two distinct types of chromosomal regions characterized by clusters of highly or lowly expressed genes. Furthermore, these regions correlate highly with a number of genome characteristics. Regions with clusters of highly expressed genes have higher gene densities, shorter genes, shorter average intron and higher GC content compared to regions with clusters of lowly expressed genes. A comparative analysis between the chicken and human transcriptome maps constructed using similar panels of tissues suggests that the regions with clusters of highly expressed genes are relatively conserved between the two genomes.
Conclusions
Our results revealed the presence of a higher order organization of the chicken genome that affects gene expression, confirming similar observations in other species. These results will aid in the further understanding of the regional dynamics of chromosome evolution.
The microarray data used in this analysis have been submitted to NCBI GEO database under accession number GSE17108. The reviewer access link is:

Loading...
Loading...
Abstract
Background
The chemical components of sex pheromones have been determined for more than a thousand moth species, but so far only a handful of genes encoding enzymes responsible for the biosynthesis of these compounds have been identified. For understanding the evolution of moth sexual communication, it is essential to know which genes are involved in the production of specific pheromone components and what controls the variation in their relative frequencies in the pheromone blend. We used a transcriptomic approach to characterize the pheromone gland of the Noctuid moth
Results
To facilitate identifying sets of genes involved in a broad range of processes and to capture rare transcripts, we developed our majority of ESTs from a normalized cDNA library of
Conclusion
These data constitute one of the first large-scale EST-projects for Noctuidae, a much-needed resource for exploring these pest species. Our analysis shows a surprisingly complex transcriptome and we identified a large number of potential pheromone biosynthetic pathway and immune-related genes that can be applied to population and systematic studies of

Loading...
Loading...
Abstract
Background
Classically, models of DNA-transcription factor binding sites (TFBSs) have been based on relatively few known instances and have treated them as sites of fixed length using position weight matrices (PWMs). Various extensions to this model have been proposed, most of which take account of dependencies between the bases in the binding sites. However, some transcription factors are known to exhibit some flexibility and bind to DNA in more than one possible physical configuration. In some cases this variation is known to affect the function of binding sites. With the increasing volume of ChIP-seq data available it is now possible to investigate models that incorporate this flexibility. Previous work on variable length models has been constrained by: a focus on specific zinc finger proteins in yeast using restrictive models; a reliance on hand-crafted models for just one transcription factor at a time; and a lack of evaluation on realistically sized data sets.
Results
We re-analysed binding sites from the TRANSFAC database and found motivating examples where our new variable length model provides a better fit. We analysed several ChIP-seq data sets with a novel motif search algorithm and compared the results to one of the best standard PWM finders and a recently developed alternative method for finding motifs of variable structure. All the methods performed comparably in held-out cross validation tests. Known motifs of variable structure were recovered for p53, Stat5a and Stat5b. In addition our method recovered a novel generalised version of an existing PWM for Sp1 that allows for variable length binding. This motif improved classification performance.
Conclusions
We have presented a new gapped PWM model for variable length DNA binding sites that is not too restrictive nor over-parameterised. Our comparison with existing tools shows that on average it does not have better predictive accuracy than existing methods. However, it does provide more interpretable models of motifs of variable structure that are suitable for follow-up structural studies. To our knowledge, we are the first to apply variable length motif models to eukaryotic ChIP-seq data sets and consequently the first to show their value in this domain. The results include a novel motif for the ubiquitous transcription factor Sp1.

Loading...
Loading...
Abstract
Background
The extracellular promastigote and the intracellular amastigote stages alternate in the digenetic life cycle of the trypanosomatid parasite
Results
Immunofluorescence analysis of promastigote-specific glycoprotein gp46 and expression modulation analysis of the amastigote-specific A2 gene have revealed that concomitant exposure to temperature increase and acidification leads to amastigote-like forms. The temperature-induced gene expression profile in the absence of pH variation resembles the profile obtained under combined exposure to both factors unlike that obtained for exposure to acidification alone. In fact, the subsequent fold change-based global iterative hierarchical clustering analysis supports these findings.
Conclusions
The specific influence of temperature and pH on the differential regulation of genes described in this study and the evidence provided by clustering analysis is consistent with the predominant role of temperature increase over extracellular pH decrease in the amastigote differentiation process, which provides new insights into

Loading...
Loading...
Abstract
Background
Results
91 differentially regulated protein spots, representing 64 different proteins, were identified by mass spectrometry (MS). They showed a continuous up-, down- or an oscillating regulation. Many of the identified proteins were involved in protein folding (chaperones), oxidative stress response, signal transduction, transcription, translation, carbohydrate and nitrogen metabolism. A correlation between alteration of transcript levels and corresponding proteins was detected for half of the differentially regulated proteins. Interestingly, some previously undescribed putative targets for the heat shock regulator Hsf1 were identified. This provides evidence for Hsf1-dependent regulation of mannitol biosynthesis, translation, cytoskeletal dynamics and cell division in
Conclusions
Our newly established DIGE data analysis workflow yields improved data quality and is widely applicable for other DIGE datasets. Our findings suggest that the heat shock response in

Loading...
Loading...
Abstract
Background
Recently, a number of high-resolution genome-wide maps of nucleosome locations in S. cerevisiae have been derived experimentally. However, nucleosome positions are determined in vivo by the combined effects of numerous factors. Consequently, nucleosomes are not simple static units, which may explain the discrepancies in reported nucleosome positions as measured by different experiments. In order to more accurately depict the genome-wide nucleosome distribution, we integrated multiple nucleosomal positioning datasets using a multi-angle analysis strategy.
Results
To evaluate the contribution of chromatin structure to transcription, we used the vast amount of available nucleosome analyzed data. Analysis of this data allowed for the comprehensive identification of the connections between promoter nucleosome positioning patterns and various transcription-dependent properties. Further, we characterised the function of nucleosome destabilisation in the context of transcription regulation. Our results indicate that genes with similar nucleosome occupancy patterns share general transcription attributes. We identified the local regulatory correlation (LRC) regions for two distinct types of nucleosomes and we assessed their regulatory properties. We also estimated the nucleosome reproducibility and measurement accuracy for high-confidence transcripts. We found that by maintaining a distance of ~13 bp between the upstream border of the +1 nucleosome and the transcription start sites (TSSs), the stable +1 nucleosome may form a barrier against the accessibility of the TSS and shape an optimum chromatin conformation for gene regulation. An in-depth analysis of nucleosome positioning in normally growing and heat shock cells suggested that the extent and patterns of nucleosome sliding are associated with gene activation.
Conclusions
Our results, which combine different types of data, suggest that cross-platform information, including discrepancy and consistency, reflects the mechanisms of nucleosome packaging in vivo more faithfully than individual studies. Furthermore, nucleosomes can be divided into two classes according to their stable and dynamic characteristics. We found that two different nucleosome-positioning characteristics may significantly impact transcription programs. Besides, some positioned-nucleosomes are involved in the transition from stable state to dynamic state in response to abrupt environmental changes.

Loading...
Loading...
Abstract
Background
Malaria is the most important parasitic disease in the world with approximately two million people dying every year, mostly due to
Results
A comprehensive directory of proteins reported to be potentially involved in
Conclusion
This is the first attempt to build a comprehensive directory of potential transcription-associated proteins in

Loading...
Loading...
Abstract
Background
Results
We propose a new method that uses gene expression data to assess inter-species annotation transfers. Our approach starts from a set of likely orthologs between a reference species (here
Conclusions
Our approach uses both sequence similarity and gene expression data to help inter-species gene annotation transfers. Experiments show that this strategy improves the accuracy achieved when using solely sequence similarity and outperforms the accuracy of the GBA approach. In addition, our experiments with

Loading...
Loading...
Abstract
Background
Lactic acid bacteria (LAB) are a group of gram-positive, lactic acid producing Firmicutes. They have been extensively used in food fermentations, including the production of various dairy products. The proteolytic system of LAB converts proteins to peptides and then to amino acids, which is essential for bacterial growth and also contributes significantly to flavor compounds as end-products. Recent developments in high-throughput genome sequencing and comparative genomics hybridization arrays provide us with opportunities to explore the diversity of the proteolytic system in various LAB strains.
Results
We performed a genome-wide comparative genomics analysis of proteolytic system components, including cell-wall bound proteinase, peptide transporters and peptidases, in 22 sequenced LAB strains. The peptidase families PepP/PepQ/PepM, PepD and PepI/PepR/PepL are described as examples of our
The diversity of proteolytic system components in 39
Conclusions
The improved functional annotation of the proteolytic system components provides an excellent framework for future experimental validations of predicted enzymatic activities. The genome sequence data can be coupled to other "omics" data e.g. transcriptomics and metabolomics for prediction of proteolytic and flavor-forming potential of LAB strains. Such an integrated approach can be used to tune the strain selection process in food fermentations.

Loading...
Loading...
Abstract
Background
Synaptotagmin proteins were first identified in nervous tissue, residing in synaptic vesicles. Synaptotagmins were subsequently found to form a large family, some members of which play important roles in calcium triggered exocytic events. These members have been investigated intensively, but other family members are not well understood, making it difficult to grasp the meaning of family membership in functional terms. Further difficulty arises as families are defined quite legitimately in different ways: by common descent or by common possession of distinguishing features. One definition does not necessarily imply the other. The evolutionary range of genome sequences now available, can shed more light on synaptotagmin gene phylogeny and clarify family relationships. The aim of compiling this open access collection of synaptotagmin and synaptotagmin-like sequences, is that its use may lead to greater understanding of the biological function of these proteins in an evolutionary context.
Results
46 metazoan genomes were examined and their complement of
Conclusions
Only a subset of synaptotagmin proteins appear able to function as calcium triggers. Syt1, Syt7 and Syt9 are ancient conserved synaptotagmins of this type. Some animals carry extensive repertoires of synaptotagmin genes. Other animals of no less complexity, carry only a small repertoire. Current understanding does not explain why this is so. The biological roles of many synaptotagmins remain to be understood. This collection of genes offers prospects for fruitful speculation about the functional roles of the synaptotagmin repertoires of different animals and includes a great range of biological complexity. With reference to this gene collection, functional relationships among

Loading...
Loading...
Abstract
Background
The Soybean Consensus Map 4.0 facilitated the anchoring of 95.6% of the soybean whole genome sequence developed by the Joint Genome Institute, Department of Energy, but its marker density was only sufficient to properly orient 66% of the sequence scaffolds. The discovery and genetic mapping of more single nucleotide polymorphism (SNP) markers were needed to anchor and orient the remaining genome sequence. To that end, next generation sequencing and high-throughput genotyping were combined to obtain a much higher resolution genetic map that could be used to anchor and orient most of the remaining sequence and to help validate the integrity of the existing scaffold builds.
Results
A total of 7,108 to 25,047 predicted SNPs were discovered using a reduced representation library that was subsequently sequenced by the Illumina sequence-by-synthesis method on the clonal single molecule array platform. Using multiple SNP prediction methods, the validation rate of these SNPs ranged from 79% to 92.5%. A high resolution genetic map using 444 recombinant inbred lines was created with 1,790 SNP markers. Of the 1,790 mapped SNP markers, 1,240 markers had been selectively chosen to target existing unanchored or un-oriented sequence scaffolds, thereby increasing the amount of anchored sequence to 97%.
Conclusion
We have demonstrated how next generation sequencing was combined with high-throughput SNP detection assays to quickly discover large numbers of SNPs. Those SNPs were then used to create a high resolution genetic map that assisted in the assembly of scaffolds from the 8× whole genome shotgun sequences into pseudomolecules corresponding to chromosomes of the organism.

Loading...
Loading...
Abstract
Background
Excessive fat deposition is one of the largest problems faced by salmon aquaculture industries, leading to production losses due to high volume of adipose tissue offal. In addition, increased lipid accumulation may impose considerable stress on adipocytes leading to adipocyte activation and production and secretion of inflammatory mediators, as observed in mammals.
Results
Microarray and qPCR analyses were performed to follow transcriptome changes during adipogenesis in the primary culture of adipose stromo-vascular fraction (aSVF) of Atlantic salmon. Cellular heterogeneity decreased by confluence as evidenced by the down-regulation of markers of osteo/chondrogenic, myogenic, immune and vasculature lineages. Transgelin (TAGLN), a marker of the multipotent pericyte, was prominently expressed around confluence while adipogenic PPARγ was up-regulated already in subconfluent cells. Proliferative activity and subsequent cell cycle arrest were reflected in the fluctuations of pro- and anti-mitotic regulators. Marked regulation of genes involved in lipid and glucose metabolism and pathways producing NADPH and glycerol-3-phosphate (G3P) was seen during the terminal differentiation, also characterised by diverse stress responses. Activation of the glutathione and thioredoxin antioxidant systems and changes in the iron metabolism suggested the need for protection against oxidative stress. Signs of endoplasmic reticulum (ER) stress and unfolded protein response (UPR) occured in parallel with the increased lipid droplet (LD) formation and production of secretory proteins (adipsin, visfatin). The UPR markers XBP1 and ATF6 were induced together with genes involved in ubiquitin-proteasome and lysosomal proteolysis. Concurrently, translation was suppressed as evidenced by the down-regulation of genes encoding elongation factors and components of the ribosomal machinery. Notably, expression changes of a panel of genes that belong to different immune pathways were seen throughout adipogenesis. The induction of AP1 (Jun, Fos), which is a master regulator of stress responses, culminated by the end of adipogenesis, concurrent with the maximal observed lipid deposition.
Conclusions
Our data point to an intimate relationship between metabolic regulation and immune responses in white adipocytes of a cold-blooded vertebrate. Stress imposed on adipocytes by LD formation and expansion is prominently reflected in the ER compartment and the activated UPR response could have an important role at visceral obesity in fish.

Loading...
Loading...
Abstract
Background
Results
The results showed that enzymes for acetate transport and oxidation, and for proton transport across the inner membrane were well conserved. An NADH dehydrogenase, the ATP synthase, and several TCA cycle enzymes were among the best conserved in the genomes. However, most of the cytochromes required for Fe(III)-reduction were not, including many of the outer-membrane cytochromes. While conservation of cytochromes was poor, an abundance and diversity of cytochromes were found in every genome, with duplications apparent in several species.
Conclusions
These results indicate there is a common pathway for acetate oxidation and energy generation across the family and in the last common ancestor. They also suggest that while cytochromes are important for extracellular electron transport, the path of electrons across the periplasm and outer membrane is variable. This combination of abundant cytochromes with weak sequence conservation suggests they may not be specific terminal reductases, but rather may be important in their heme-bearing capacity, as sinks for electrons between the inner-membrane electron transport chain and the extracellular acceptor.

Loading...
Loading...
Abstract
Background
Chronic myelogenous leukemia (CML) results from the neoplastic transformation of a haematopoietic stem cell. The hallmark genetic abnormality of CML is a chimeric
Results
Here we report multiple genome aberrations in a collection of 78 CML and 14 control samples by oligonucleotide array comparative genomic hybridization. We found a unique signature of genome deletions within the immunoglobulin heavy chain (
Conclusions
None of these genome losses were detected in any of the CML samples with myeloid transformation, chronic phase or controls, indicating that their presence is obligatory for the development of a malignant clone with a lymphoid phenotype. Notably, the coincidental deletions at

Loading...
Loading...
Abstract
Background
Little is known about how apicomplexan parasites have evolved to infect different host species and cell types.
Results
Strong evidence is provided that the sub-telomerically encoded
Conclusions
This analysis provides evidence that two distinct secretome genes families have evolved under contrasting selective pressures. The data supports current hypotheses regarding the biological role of TashAT family proteins in the management of host cell phenotype that may have evolved to allow adaptation of

Loading...
Loading...
Abstract
Background
Peach fruit undergoes a rapid softening process that involves a number of metabolic changes. Storing fruit at low temperatures has been widely used to extend its postharvest life. However, this leads to undesired changes, such as mealiness and browning, which affect the quality of the fruit. In this study, a 2-D DIGE approach was designed to screen for differentially accumulated proteins in peach fruit during normal softening as well as under conditions that led to fruit chilling injury.
Results
The analysis allowed us to identify 43 spots -representing about 18% of the total number analyzed- that show statistically significant changes. Thirty-nine of the proteins could be identified by mass spectrometry. Some of the proteins that changed during postharvest had been related to peach fruit ripening and cold stress in the past. However, we identified other proteins that had not been linked to these processes. A graphical display of the relationship between the differentially accumulated proteins was obtained using pairwise average-linkage cluster analysis and principal component analysis. Proteins such as endopolygalacturonase, catalase, NADP-dependent isocitrate dehydrogenase, pectin methylesterase and dehydrins were found to be very important for distinguishing between healthy and chill injured fruit. A categorization of the differentially accumulated proteins was performed using Gene Ontology annotation. The results showed that the 'response to stress', 'cellular homeostasis', 'metabolism of carbohydrates' and 'amino acid metabolism' biological processes were affected the most during the postharvest.
Conclusions
Using a comparative proteomic approach with 2-D DIGE allowed us to identify proteins that showed stage-specific changes in their accumulation pattern. Several proteins that are related to response to stress, cellular homeostasis, cellular component organization and carbohydrate metabolism were detected as being differentially accumulated. Finally, a significant proportion of the proteins identified had not been associated with softening, cold storage or chilling injury-altered fruit before; thus, comparative proteomics has proven to be a valuable tool for understanding fruit softening and postharvest.

Loading...
Loading...
Abstract
Background
Bacterial repetitive extragenic palindromes (REPs) compose a distinct group of genomic repeats. They usually occur in high abundance (>100 copies/genome) and are often arranged in composite repetitive structures - bacterial interspersed mosaic elements (BIMEs). In BIMEs, regularly spaced REPs are present in alternating orientations. BIMEs and REPs have been shown to serve as binding sites for several proteins and suggested to play role in chromosome organization and transcription termination. Their origins are, at present, unknown.
Results
In this report, we describe a novel class of putative transposases related to IS
Conclusions
In light of our findings, we propose that RAYTs are responsible for establishment of REPs and BIMEs in bacterial genomes, as well as for their exceptional dynamics and species-specifity. Conversely, we suggest that BIMEs are in fact a special type of nonautonomous transposable elements, mobilizable by RAYTs.

Loading...
Loading...
Abstract
Background
Chemical communication in mammals involves globular lipocalins that protect and transport pheromones during their passage out of the body. Efficient communication via this protein - pheromone complex is essential for triggering multiple responses including aggression, mate choice, copulatory behaviour, and onset and synchronization of oestrus. The roles of lipocalins in communication were studied in many organisms and especially in mice (i.e.
Results
Here we show that Aphrodisin-like proteins exist in other species, belong to a group of Odorant Binding Proteins (Obp), and contrary to the expression of Aphrodisin only in hamster genital tract and parotid glands of females, we have detected these transcripts in both sexes of
Conclusion
Our results demonstrate that three novel Obps (Obp1, Obp2, and Obp3) are predominant lipocalins in

Loading...
Loading...
Abstract
Background
Little is known regarding the pool of mobile genetic elements associated with the human gut microbiome. In this study we employed the culture independent TRACA system to isolate novel plasmids from the human gut microbiota, and a comparative metagenomic analysis to investigate the distribution and relative abundance of functions encoded by these plasmids in the human gut microbiome.
Results
Novel plasmids were acquired from the human gut microbiome, and homologous nucleotide sequences with high identity (>90%) to two plasmids (pTRACA10 and pTRACA22) were identified in the multiple human gut microbiomes analysed here. However, no homologous nucleotide sequences to these plasmids were identified in the murine gut or environmental metagenomes. Functions encoded by the plasmids pTRACA10 and pTRACA22 were found to be more prevalent in the human gut microbiome when compared to microbial communities from other environments. Among the most prevalent functions identified was a putative RelBE toxin-antitoxin (TA) addiction module, and subsequent analysis revealed that this was most closely related to putative TA modules from gut associated bacteria belonging to the
Conclusions
The application of a culture independent system to capture novel plasmids from the human gut mobile metagenome, coupled with subsequent comparative metagenomic analysis, highlighted the unexpected prevalence of plasmid encoded functions in the gut microbial ecosystem. In particular the increased relative abundance and broad phylogenetic distribution was identified for a putative RelBE toxin/antitoxin addiction module, a putative phosphohydrolase/phosphoesterase, and an ORF of unknown function. Our analysis also indicates that some plasmids or plasmid families are present in the gut microbiomes of geographically isolated human hosts with a broad global distribution (America, Japan and Europe), and are potentially unique to the human gut microbiome. Further investigation of the plasmid population associated with the human gut is likely to provide important insights into the development, functioning and evolution of the human gut microbiota.

Loading...
Loading...
Abstract
Background
The antenno-maxilary complex (AMC) forms the chemosensory system of the
Results
A total of 17 samples were first analysed with hierarchical clustering. To determine those genes affected by loss of
Conclusions
The current data contribute to the clarification of the

Loading...
Loading...
Abstract
Background
Recently, it has been discovered that the human genome contains many transcription start sites for non-coding RNA. Regulatory regions related to transcription of this non-coding RNAs are poorly studied. Some of these regulatory regions may be associated with CpG islands located far from transcription start-sites of any protein coding gene. The human genome contains many such CpG islands; however, until now their properties were not systematically studied.
Results
We studied CpG islands located in different regions of the human genome using methods of bioinformatics and comparative genomics. We have observed that CpG islands have a preference to overlap with exons, including exons located far from transcription start site, but usually extend well into introns. Synonymous substitution rate of CpG-containing codons becomes substantially reduced in regions where CpG islands overlap with protein-coding exons, even if they are located far downstream from transcription start site. CAGE tag analysis displayed frequent transcription start sites in all CpG islands, including those found far from transcription start sites of protein coding genes. Computational prediction and analysis of published ChIP-chip data revealed that CpG islands contain an increased number of sites recognized by Sp1 protein. CpG islands containing more CAGE tags usually also contain more Sp1 binding sites. This is especially relevant for CpG islands located in 3' gene regions. Various examples of transcription, confirmed by mRNAs or ESTs, but with no evidence of protein coding genes, were found in CAGE-enriched CpG islands located far from transcription start site of any known protein coding gene.
Conclusions
CpG islands located far from transcription start sites of protein coding genes have transcription initiation activity and display Sp1 binding properties. In exons, overlapping with these islands, the synonymous substitution rate of CpG containing codons is decreased. This suggests that these CpG islands are involved in transcription initiation, possibly of some non-coding RNAs.

Loading...
Loading...
Abstract
Background
Tissue gene expression is generally regulated by multiple transcription factors (TFs). A major first step toward understanding how tissues achieve their specificity is to identify, at the genome scale, interacting TFs regulating gene expression in different tissues. Despite previous discoveries, the mechanisms that control tissue gene expression are not fully understood.
Results
We have integrated a function conservation approach, which is based on evolutionary conservation of biological function, and genes with highest expression level in human tissues to predict TF pairs controlling tissue gene expression. To this end, we have identified 2549 TF pairs associated with a certain tissue. To find interacting TFs controlling tissue gene expression in a broad spatial and temporal manner, we looked for TF pairs common to the same type of tissues and identified 379 such TF pairs, based on which TF-TF interaction networks were further built. We also found that tissue-specific TFs may play an important role in recruiting non-tissue-specific TFs to the TF-TF interaction network, offering the potential for coordinating and controlling tissue gene expression across a variety of conditions.
Conclusion
The findings from this study indicate that tissue gene expression is regulated by large sets of interacting TFs either on the same promoter of a gene or through TF-TF interaction networks.

Loading...
Loading...
Abstract
Background
The Affymetrix GeneChip is a widely used gene expression profiling platform. Since the chips were originally designed, the genome databases and gene definitions have been considerably updated. Thus, more accurate interpretation of microarray data requires parallel updating of the specificity of GeneChip probes. We propose a new probe remapping protocol, using the zebrafish GeneChips as an example, by removing nonspecific probes, and grouping the probes into transcript level probe sets using an integrated zebrafish genome annotation. This genome annotation is based on combining transcript information from multiple databases. This new remapping protocol, especially the new genome annotation, is shown here to be an important factor in improving the interpretation of gene expression microarray data.
Results
Transcript data from the RefSeq, GenBank and Ensembl databases were downloaded from the UCSC genome browser, and integrated to generate a combined zebrafish genome annotation. Affymetrix probes were filtered and remapped according to the new annotation. The influence of transcript collection and gene definition methods was tested using two microarray data sets. Compared to remapping using a single database, this new remapping protocol results in up to 20% more probes being retained in the remapping, leading to approximately 1,000 more genes being detected. The differentially expressed gene lists are consequently increased by up to 30%. We are also able to detect up to three times more alternative splicing events. A small number of the bioinformatics predictions were confirmed using real-time PCR validation.
Conclusions
By combining gene definitions from multiple databases, it is possible to greatly increase the numbers of genes and splice variants that can be detected in microarray gene expression experiments.

Loading...
Loading...
Abstract
Background
Saliva of adult female mosquitoes help sugar and blood feeding by providing enzymes and polypeptides that help sugar digestion, control microbial growth and counteract their vertebrate host hemostasis and inflammation. Mosquito saliva also potentiates the transmission of vector borne pathogens, including arboviruses.
Results
A total of 1,753 clones randomly selected from an adult female
Conclusion
Comparison of the

Loading...
Loading...
Abstract
Background
In eukaryotes, microRNAs (miRNAs) have emerged as critical regulators of gene expression. The Silkworm (
Results
We generated small RNA libraries from feeding larvae, spinning larvae, pupae and adults of
Conclusions
We identified 101 homologs of conserved miRNAs, 14 species-specific and two antisense miRNAs in the silkworm. Our results provided deeper insights into changes in conserved and novel miRNA and miRNA* accumulation during development.

Loading...
Loading...
Abstract
Background
Non-coding RNAs (ncRNAs) play a crucial role in the intricate regulation of bacterial gene expression, allowing bacteria to quickly adapt to changing environments. In the past few years, a growing number of regulatory RNA elements have been predicted by computational methods, mostly in well-studied γ-proteobacteria but lately in several α-proteobacteria as well. Here, we have compared an extensive compilation of these non-coding RNA predictions to intergenic expression data of a whole-genome high-resolution tiling array in the soil-dwelling α-proteobacterium
Results
Expression of 89 candidate ncRNAs was detected, both on the chromosome and on the six megaplasmids encompassing the
Conclusions
Our data support the practical value of previous ncRNA prediction algorithms and significantly expand the list of candidate ncRNAs encoded in the intergenic regions of

Loading...
Loading...
Abstract
Background
Bacterial plant pathogens are very harmful to their host plants, which can cause devastating agricultural losses in the world. With the development of microbial genome sequencing, many strains of phytopathogens have been sequenced. However, some misannotations exist in these phytopathogen genomes. Our objective is to improve these annotations and store them in a central database DIGAP.
Description
DIGAP includes the following improved information on phytopathogen genomes. (i) All the 'hypothetical proteins' were checked, and non-coding ORFs recognized by the Z curve method were removed. (ii) The translation initiation sites (TISs) of 20% ~ 25% of all the protein-coding genes have been corrected based on the NCBI RefSeq, ProTISA database and an
Conclusion
The results in DIGAP provide useful information for understanding the pathogenetic mechanisms of phytopathogens and for finding agricultural bactericides. DIGAP is freely available at

Loading...
Loading...
Abstract
Background
Small endogenous non-coding RNAs (sncRNAs) such as small interfering RNA (siRNA), microRNA and other small RNA transcripts are derived from distinct loci in the genome and play critical roles in RNA-mediated gene silencing mechanisms in plants and metazoa. They are approximately 22 nucleotides long; regulate mRNA stability through perfect or imperfect match to the targets. The biological activities of sncRNAs have been related to many biological events, from resistance to microbe infections to cellular differentiation. The development of the zoonotic parasite
Results
The endogenous siRNAs were found mainly derived from transposable elements (TE) or transposons and the natural antisense transcripts (NAT). In contrast to other organisms, the TE-derived siRNAs in
Conclusions
Many miRNA and endogenous siRNA transcripts were identified in

Loading...
Loading...
Abstract
Background
This paper describes an efficient
Results
Applied to nine hemiascomycete yeast genomes, this method predicts that 2% of the genes are involved in TGAs and gene relics are present in 11% of TGAs. The frequency of TGAs with degenerated gene copies means that a significant fraction of tandem duplicated genes follows the birth-and-death model of evolution. A comparison of sequence identity distributions between sets of homologous gene pairs shows that the different copies of tandem arrayed paralogs are less divergent than copies of dispersed paralogs in yeast genomes. It suggests that paralogs included in tandem structures are more recent or more subject to the gene conversion mechanism than other paralogs.
Conclusion
The method reported here is a useful computational tool to provide a database of TGAs composed of functional or nonfunctional gene copies. Such a database has obvious applications in the fields of structural and comparative genomics. Notably, a detailed study of the TGA catalog will make it possible to tackle the fundamental questions of the origin and evolution of tandem gene clusters.

Loading...
Loading...
Abstract
Background
As uricoletic animals, chickens produce cleidoic eggs, which are self-contained bacteria-resistant biological packages for extra-uterine development of the chick embryo. The eggshell constitutes a natural physical barrier against bacterial penetration if it forms correctly and remains intact. The eggshell's remarkable mechanical properties are due to interactions among mineral components and the organic matrix proteins. The purpose of our study was to identify novel eggshell proteins by examining the transcriptome of the uterus during calcification of the eggshell. An extensive bioinformatic analysis on genes over-expressed in the uterus allowed us to identify novel eggshell proteins that contribute to the egg's natural defenses.
Results
Our 14 K Del-Mar Chicken Integrated Systems microarray was used for transcriptional profiling in the hen's uterus during eggshell deposition. A total of 605 transcripts were over-expressed in the uterus compared with the magnum or white isthmus across a wide range of abundance (1.1- to 79.4-fold difference). The 605 highly-expressed uterine transcripts correspond to 469 unique genes, which encode 437 different proteins. Gene Ontology (GO) analysis was used for interpretation of protein function. The most over-represented GO terms are related to genes encoding ion transport proteins, which provide eggshell mineral precursors. Signal peptide sequence was found for 54 putative proteins secreted by the uterus during eggshell formation. Many functional proteins are involved in calcium binding or biomineralization--prerequisites for interacting with the mineral phase during eggshell fabrication. While another large group of proteins could be involved in proper folding of the eggshell matrix. Many secreted uterine proteins possess antibacterial properties, which would protect the egg against microbial invasion. A final group includes proteases and protease inhibitors that regulate protein activity in the acellular uterine fluid where eggshell formation takes place.
Conclusions
Our original study provides the first detailed description of the chicken uterus transcriptome during formation of the eggshell. We have discovered a cache of about 600 functional genes and identified a large number of encoded proteins secreted into uterine fluid for fabrication of the eggshell and chemical protection of the egg. Some of these uterine genes could prove useful as biological markers for genetic improvement of phenotypic traits (i.e., egg and eggshell quality).

Loading...
Loading...
Abstract
Background
Computational methods that infer single nucleotide polymorphism (SNP) interactions from phenotype data may uncover new biological mechanisms in non-Mendelian diseases. However, practical aspects of such analysis face many problems. Present experimental studies typically use SNP arrays with hundreds of thousands of SNPs but record only hundreds of samples. Candidate SNP pairs inferred by interaction analysis may include a high proportion of false positives. Recently, Gayan et al. (2008) proposed to reduce the number of false positives by combining results of interaction analysis performed on subsets of data (replication groups), rather than analyzing the entire data set directly. If performing as hypothesized, replication groups scoring could improve interaction analysis and also any type of feature ranking and selection procedure in systems biology. Because Gayan et al. do not compare their approach to the standard interaction analysis techniques, we here investigate if replication groups indeed reduce the number of reported false positive interactions.
Results
A set of simulated and false interaction-imputed experimental SNP data sets were used to compare the inference of SNP-SNP interactions by means of replication groups to the standard approach where the entire data set was directly used to score all candidate SNP pairs. In all our experiments, the inference of interactions from the entire data set (e.g. without using the replication groups) reported fewer false positives.
Conclusions
With respect to the direct scoring approach the utility of replication groups does not reduce false positive rates, and may, depending on the data set, often perform worse.

Loading...
Loading...
Abstract
Background
Histone acetyltransferase enzymes (HATs) are implicated in regulation of transcription. HATs from different families may overlap in target and substrate specificity.
Results
We isolated the
Conclusions
Comparison of phenotypes and expression profiles in single, double and triple mutants indicate that these HAT enzymes have overlapping functions. Consistent with this, overlapping specificity in histone H3 acetylation is observed. However, there is no evidence for overlap with another HAT enzyme, encoded by the essential

Loading...
Loading...
Abstract
Background
Identical sequences with a minimal length of about 300 base pairs (bp) have been involved in the generation of various meiotic/mitotic genomic rearrangements through non-allelic homologous recombination (NAHR) events. Genomic disorders and structural variation, together with gene remodelling processes have been associated with many of these rearrangements. Based on these observations, we identified and integrated all the 100% identical repeats of at least 300 bp in the NCBI version 36.2 human genome reference assembly into non-overlapping regions, thus defining the Identical Repeated Backbone (IRB) of the reference human genome.
Results
The IRB sequences are distributed all over the genome in 66,600 regions, which correspond to ~2% of the total NCBI human genome reference assembly. Important structural and functional elements such as common repeats, segmental duplications, and genes are contained in the IRB. About 80% of the IRB bp overlap with known copy-number variants (CNVs). By analyzing the genes embedded in the IRB, we were able to detect some identical genes not previously included in the Ensembl release 50 annotation of human genes. In addition, we found evidence of IRB gene copy-number polymorphisms in raw sequence reads of two diploid sequenced genomes.
Conclusions
In general, the IRB offers new insight into the complex organization of the identical repeated sequences of the human genome. It provides an accurate map of potential NAHR sites which could be used in targeting the study of novel CNVs, predicting DNA copy-number variation in newly sequenced genomes, and improve genome annotation.

Loading...
Loading...
Abstract
Background
Recent studies have demonstrated that non-protein-coding RNAs (npcRNAs/ncRNAs) play important roles during eukaryotic development, species evolution, and in the etiology of disease. Rhesus macaques are the most widely used primate model in both biomedical research and primate evolutionary studies. However, most reports on these animals focus on the functional roles of protein-coding sequences, whereas very little is known about macaque ncRNAs.
Results
In the present study, we performed the first systematic profiling of intermediate-size ncRNAs (50 to 500 nt) from the rhesus monkey by constructing a cDNA library. We identified 117 rhesus monkey ncRNAs, including 80 small nucleolar RNAs (snoRNAs), 29 other types of known RNAs (snRNAs, Y RNA, and others), and eight unclassified ncRNAs. Comparative genomic analysis and northern blot hybridizations demonstrated that some snoRNAs were lineage- or species-specific. Paralogous sequences were found for most rhesus monkey snoRNAs, the expression of which might be attributable to extensive duplication within the rhesus monkey genome. Further investigation of snoRNA flanking sequences showed that some rhesus monkey snoRNAs are retrogenes derived from L1-mediated integration. Finally, phylogenetic analysis demonstrated that birds and primates share some snoRNAs and host genes thereof, suggesting that both the relevant host genes and the snoRNAs contained therein may be inherited from a common ancestor. However, some rhesus monkey snoRNAs hosted by non-ribosome-related genes appeared after the evolutionary divergence between birds and mammals.
Conclusions
We provide the first experimentally-derived catalog of rhesus monkey ncRNAs and uncover some interesting genomic and evolutionary features. These findings provide important information for future functional characterization of snoRNAs during primate evolution.

Loading...
Loading...
Abstract
Background
Copy number variation is an important component of genetic variation in higher eukaryotes. The extent of natural copy number variation in
Results
We have used array comparative genomic hybridization (aCGH) to detect copy number variation in the genomes of 12 natural isolates of
Conclusion
Copy number variation is extensive in

Loading...
Loading...
Abstract
Background
The availability of two genetically very similar cell lines (A and B) derived from the laboratory isolate
Results
To obtain a comprehensive picture of the differences between the cell lines, we compared their transcriptomes using an oligonucleotide-based microarray and confirmed findings with quantitative real-time PCR. Out of 6242 genes represented on the array, 87 are differentially transcribed (≥two-fold) in the two cell lines. Approximately 50% code for hypothetical proteins. Interestingly, only 19 genes show a five-fold or higher differential expression. These include three
Conclusions
In this study we have identified a defined set of genes that are differentially transcribed between the non-pathogenic cell line A and the pathogenic cell line B of

Loading...
Loading...
Abstract
Background
Results
The CGH microarray analysis estimated the core genome of
Conclusions
An overview of genomic contents of a large collection of isolates from different countries allowed us to derive a core genome and a phylogenetic structure of

Loading...
Loading...
Abstract
Background
Tungsten carbide (WC) and tungsten carbide cobalt (WC-Co) nanoparticles are of occupational health relevance because of the increasing usage in hard metal industries. Earlier studies showed an enhanced toxic potential for WC-Co compared to WC or cobalt ions alone. Therefore, we investigated the impact of these particles, compared to cobalt ions applied as CoCl2, on the global gene expression level in human keratinocytes (HaCaT)
Results
WC nanoparticles exerted very little effects on the transcriptomic level after 3 hours and 3 days of exposure. In contrast, WC-Co nanoparticles caused significant transcriptional changes that were similar to those provoked by CoCl2. However, CoCl2 exerted even more pronounced changes in the transcription patterns. Gene set enrichment analyses revealed that the differentially expressed genes were related to hypoxia response, carbohydrate metabolism, endocrine pathways, and targets of several transcription factors. The role of the transcription factor HIF1 (hypoxia inducible factor 1) is particularly highlighted and aspects of downstream events as well as the role of other transcription factors related to cobalt toxicity are considered.
Conclusions
This study provides extensive data useful for the understanding of nanoparticle and cobalt toxicity. It shows that WC nanoparticles caused low transcriptional responses while WC-Co nanoparticles are able to exert responses similar to that of free cobalt ions, particularly the induction of hypoxia-like effects via interactions with HIF1α in human keratinocytes. However, the enhanced toxicity of WC-Co particles compared to CoCl2 could not be explained by differences in gene transcription.

Loading...
Loading...
Abstract
Background
Anamika
Results
Re-examination of these kinases reveals claimed novel arrangements cannot withstand scrutiny; each is either not novel or represents over-analysis of weakly confident computer generated gene models. Additional sequence evidence available at the time of the paper's submission either directly contradict the gene models or suggest alternate gene models. These alternate models would minimize or eliminate the observed differences between human and chimp kinases.
Conclusion
None of the proposed novel chimpanzee kinase architectures are supported by experiment evidence. Guidelines to prevent such erroneous conclusions in similar papers are proposed.

Loading...
Loading...
Abstract
Background
Results
Twenty-one of the repeat-associated coding regions were determined to be potentially transcriptionally or translationally phase variable. In each case the homopolymeric tract was within the predicted promoter region or at the 5' end of the coding region, respectively. However, eight coding sequences were identified with simple sequence repeats toward the 3' end of the open reading frame. In these cases, the repeat tract would be too far into the coding region to be mediating translational phase variation. All of the 29 coding region-associated homopolymeric tracts display variability in tract length in the sequencing read data.
Conclusions
Twenty-nine coding regions have been identified in the genome sequence of

Loading...
Loading...
Abstract
Background
Food supply from the ocean is constrained by the shortage of domesticated and selected fish. Development of genomic models of economically important fishes should assist with the removal of this bottleneck. European sea bass
Results
End sequencing of a sea bass genomic BAC-library enabled the comparative mapping of the sea bass genome using the three-spined stickleback
Conclusions
The BAC map is used for sequencing single BACs or BAC-pools covering defined genomic entities by second generation sequencing technologies. Together with the WGS dataset it initiates a sea bass genome sequencing project. This will allow the quantification of polymorphisms through resequencing, which is important for selecting highly performing domesticated fish.

Loading...
Loading...
Abstract
Background
Plants engineered for abiotic stress tolerance may soon be commercialized. The engineering of these plants typically involves the manipulation of complex multigene networks and may therefore have a greater potential to introduce pleiotropic effects than the simple monogenic traits that currently dominate the plant biotechnology market. While research on unintended effects in transgenic plant systems has been instrumental in demonstrating the substantial equivalence of many transgenic plant systems, it is essential that such analyses be extended to transgenic plants engineered for stress tolerance. Drought-tolerant
Results
Microarray analysis of control plant lines that underwent Cre-mediated excision of the
Conclusions
The action of ABF3 is tightly controlled in

Loading...
Loading...
Abstract
Background
The prairie vole (
Results
We constructed a prairie vole BAC library (CHORI-232) consisting of 194,267 recombinant clones with an average insert size of 139 kb. Hybridization-based screening of the gridded library at 19 loci established that the library has an average depth of coverage of ~10×. To obtain a small-scale sampling of the prairie vole genome, we generated 3884 BAC end-sequences totaling ~2.8 Mb. One-third of these BAC-end sequences could be mapped to unique locations in the mouse genome, thereby anchoring 1003 prairie vole BAC clones to an orthologous position in the mouse genome. Fluorescence in situ hybridization (FISH) mapping of 62 prairie vole clones with BAC-end sequences mapping to orthologous positions in the mouse genome was used to develop a first-generation genome-wide prairie vole-mouse comparative cytogenetic map. While conserved synteny was observed between this pair of rodent genomes, rearrangements between the prairie vole and mouse genomes were detected, including a minimum of five inversions and 16 inter-chromosomal rearrangements.
Conclusions
The construction of the prairie vole BAC library and the vole-mouse comparative cytogenetic map represent the first genome-wide modern genomic resources developed for this species. The BAC library will support future genomic, genetic and molecular characterization of this genome and species, and the isolation of clones of high interest to the vole research community will allow for immediate characterization of the regulatory and coding sequences of genes known to play important roles in social behaviors. In addition, these resources provide an excellent platform for future higher resolution cytogenetic mapping and full genome sequencing.

Loading...
Loading...
Abstract
Background
Genes in bacteria may be organised into operons, leading to strict co-expression of the genes that participate in the same operon. However, comparisons between different bacterial genomes have shown that much of the operon structure is dynamic on an evolutionary time scale. This indicates that there are opposing effects influencing the tendency for operon formation, and these effects may be reflected in properties like evolutionary rate, complex formation, metabolic pathways and gene fusion.
Results
We have used multi-species protein-protein comparisons to generate a high-quality set of genes that are persistent in bacterial genomes (i.e. they have close to universal distribution). We have analysed these genes with respect to operon participation and important functional properties, including evolutionary rate and protein-protein interactions.
Conclusions
Genes for ribosomal proteins show a very slow rate of evolution. This is consistent with a strong tendency for the genes to participate in operons and for their proteins to be involved in essential and well defined complexes. Persistent genes for non-ribosomal proteins can be separated into two classes according to tendency to participate in operons. Those with a strong tendency for operon participation make proteins with fewer interaction partners that seem to participate in relatively static complexes and possibly linear pathways. Genes with a weak tendency for operon participation tend to produce proteins with more interaction partners, but possibly in more dynamic complexes and convergent pathways. Genes that are not regulated through operons are therefore more evolutionary constrained than the corresponding operon-associated genes and will on average evolve more slowly.

Loading...
Loading...
Abstract
Background
Daily and seasonal changes in temperature are challenges that fish within aquaculture settings cannot completely avoid, and are known to elicit complex organismal and cellular stress responses. We conducted a large-scale gene discovery and transcript expression study in order to better understand the genes that are potentially involved in the physiological and cellular aspects of stress caused by heat-shock. We used suppression subtractive hybridization (SSH) cDNA library construction and characterization to identify transcripts that were dysregulated by heat-shock in liver, skeletal muscle and head kidney of Atlantic cod. These tissues were selected due to their roles in metabolic regulation, locomotion and growth, and immune function, respectively. Fish were exposed for 3 hours to an 8°C elevation in temperature, and then allowed to recover for 24 hours at the original temperature (i.e. 10°C). Tissue samples obtained before heat-shock (BHS), at the cessation of heat-shock (CS), and 3, 12, and 24 hours after the cessation of heat-shock (ACS), were used for reciprocal SSH library construction and quantitative reverse transcription - polymerase chain reaction (QPCR) analysis of gene expression using samples from a group that was transferred but not heat-shocked (CT) as controls.
Results
We sequenced and characterized 4394 ESTs (1524 from liver, 1451 from head kidney and 1419 from skeletal muscle) from three "forward subtracted" libraries (enriched for genes up-regulated by heat-shock) and 1586 from the liver "reverse subtracted" library (enriched for genes down-regulated by heat-shock), for a total of 5980 ESTs. Several cDNAs encoding putative chaperones belonging to the heat-shock protein (HSP) family were found in these libraries, and "protein folding" was among the gene ontology (GO) terms with the highest proportion in the libraries. QPCR analysis of HSP90α and HSP70-1 (synonym:
Conclusion
Chaperones are an important part of the cellular response to stress, and genes identified in this work may play important roles in resistance to thermal-stress. Moreover, the transcript for one key immune response gene (TLR22) was down-regulated by heat-shock, and this down-regulation may be a component of heat-induced immunosuppression.

Loading...
Loading...
Abstract
Background
Glutathione S-transferases (GSTs) are the ubiquitous enzymes that play a key role in cellular detoxification. Although several GSTs have been identified and characterized in various plant species, the knowledge about their role in developmental processes and response to various stimuli is still very limited. In this study, we report genome-wide identification, characterization and comprehensive expression analysis of members of GST gene family in crop plant rice, to reveal their function(s).
Results
A systematic analysis revealed the presence of at least 79 GST genes in the rice genome. Phylogenetic analysis grouped GST proteins into seven classes. Sequence analysis together with the organization of putative motifs indicated the potential diverse functions of GST gene family members in rice. The tandem gene duplications have contributed a major role in expansion of this gene family. Microarray data analysis revealed tissue-/organ- and developmental stage-specific expression patterns of several rice GST genes. At least 31 GST genes showed response to plant hormones auxin and cytokinin. Furthermore, expression analysis showed the differential expression of quite a large number of GST genes during various abiotic stress (20), arsenate stress (32) and biotic stress (48) conditions. Many of the GST genes were commonly regulated by developmental processes, hormones, abiotic and biotic stresses.
Conclusion
The transcript profiling suggests overlapping and specific role(s) of GSTs during various stages of development in rice. Further, the study provides evidence for the role of GSTs in mediating crosstalk between various stress and hormone response pathways and represents a very useful resource for functional analysis of selected members of this family in rice.

Loading...
Loading...
Abstract
Background
Sothern blotting is a DNA analysis technique that has found widespread application in molecular biology. It has been used for gene discovery and mapping and has diagnostic and forensic applications, including mutation detection in patient samples and DNA fingerprinting in criminal investigations. Southern blotting has been employed as the definitive method for detecting transgene integration, and successful homologous recombination in gene targeting experiments.
The technique employs a labeled DNA probe to detect a specific DNA sequence in a complex DNA sample that has been separated by restriction-digest and gel electrophoresis. Critically for the technique to succeed the probe must be unique to the target locus so as not to cross-hybridize to other endogenous DNA within the sample.
Investigators routinely employ a manual approach to probe design. A genome browser is used to extract DNA sequence from the locus of interest, which is searched against the target genome using a BLAST-like tool. Ideally a single perfect match is obtained to the target, with little cross-reactivity caused by homologous DNA sequence present in the genome and/or repetitive and low-complexity elements in the candidate probe. This is a labor intensive process often requiring several attempts to find a suitable probe for laboratory testing.
Results
We have written an informatic pipeline to automatically design genomic Sothern blot probes that specifically attempts to optimize the resultant probe, employing a brute-force strategy of generating many candidate probes of acceptable length in the user-specified design window, searching all against the target genome, then scoring and ranking the candidates by uniqueness and repetitive DNA element content. Using these
We went on to experimentally validate a number of these automated designs by Southern blotting. The majority of probes we tested performed well confirming our
Conclusions
Software and supplementary information are freely available at:

Loading...
Loading...
Abstract
Background
Gene regulation at transcript level can provide a good indication of the complex signaling mechanisms underlying physiological and pathological processes. Transcriptomic methods such as microarray and quantitative real-time PCR require stable reference genes for accurate normalization of gene expression. Some but not all studies have shown that housekeeping genes (HGKs), β-actin (ACTB) and glyceraldehyde-3-phosphate dehydrogenase (GAPDH), which are routinely used for normalization, may vary significantly depending on the cell/tissue type and experimental conditions. It is currently unclear if these genes are stably expressed in cells undergoing drastic morphological changes during neuronal differentiation. Recent meta-analysis of microarray datasets showed that some but not all of the ribosomal protein genes are stably expressed. To test the hypothesis that some ribosomal protein genes can serve as reference genes for neuronal differentiation, a genome-wide analysis was performed and putative reference genes were identified based on stability of expressions. The stabilities of these potential reference genes were then analyzed by reverse transcription quantitative real-time PCR in six differentiation conditions.
Results
Twenty stably expressed genes, including thirteen ribosomal protein genes, were selected from microarray analysis of the gene expression profiles of GDNF and NGF induced differentiation of PC12 cells. The expression levels of these candidate genes as well as ACTB and GAPDH were further analyzed by reverse transcription quantitative real-time PCR in PC12 cells differentiated with a variety of stimuli including NGF, GDNF, Forskolin, KCl and ROCK inhibitor, Y27632. The performances of these candidate genes as stable reference genes were evaluated with two independent statistical approaches, geNorm and NormFinder.
Conclusions
The ribosomal protein genes, RPL19 and RPL29, were identified as suitable reference genes during neuronal differentiation of PC12 cells, regardless of the type of differentiation conditions. The combination of these two novel reference genes, but not the commonly used HKG, GAPDH, allows robust and accurate normalization of differentially expressed genes during PC12 differentiation.

Loading...
Loading...
Abstract
Background
Results
ESTs were generated from a cDNA library of
Conclusions
We successfully discovered and examined a large portion of the previously unexplored

Loading...
Loading...
Abstract
Background
The increasing interest in small non-coding RNAs (ncRNAs) such as microRNAs (miRNAs), small interfering RNAs (siRNAs) and Piwi-interacting RNAs (piRNAs) and recent advances in sequencing technology have yielded large numbers of short (18-32 nt) RNA sequences from different organisms, some of which are derived from small nucleolar RNAs (snoRNAs) and transfer RNAs (tRNAs). We observed that these short ncRNAs frequently cover the entire length of annotated snoRNAs or tRNAs, which suggests that other loci specifying similar ncRNAs can be identified by clusters of short RNA sequences.
Results
We combined publicly available datasets of tens of millions of short RNA sequence tags from
Conclusions
In this study, we have introduced a new approach to identify new members of known classes of ncRNAs based on the features of TCs corresponding to known ncRNAs. A large number of the identified TCs are yet to be examined experimentally suggesting that many more novel ncRNAs remain to be discovered.

Loading...
Loading...
Abstract
Background
Non-host resistance in rice to its bacterial pathogen,
Results
A microarray experiment was performed to reveal the molecular mechanisms underlying HR of rice to
Conclusions
In conclusion,

Loading...
Loading...
Abstract
Background
Proteins do not act in isolation; they frequently act together in protein complexes to carry out concerted cellular functions. The evolution of complexes is poorly understood, especially in organisms other than yeast, where little experimental data has been available.
Results
We generated accurate, high coverage datasets of protein complexes for
Conclusions
Our results suggest that there are differences in the way protein complexes have evolved in

Loading...
Loading...
Abstract
Background
High density genotyping arrays have become established as a valuable research tool in human genetics. Currently, more than 300 genome wide association studies were published for human reporting about 1,000 SNPs that are associated with a phenotype. Also in animal sciences high density genotyping arrays are harnessed to analyse genetic variation. To exploit the full potential of this technology single nucleotide polymorphisms (SNPs) on the chips should be well characterized and their chromosomal position should be precisely known. This, however, is a challenge if the genome sequence is still subject to changes.
Results
We have developed a mapping strategy and a suite of software scripts to update the chromosomal positions of oligomer sequences used for SNP genotyping on high density arrays. We describe the mapping procedure in detail so that scientists with moderate bioinformatics skills can reproduce it. We furthermore present a case study in which we re-mapped 54,001 oligomer sequences from Ilumina's BovineSNP50 beadchip to the bovine genome sequence. We found in 992 cases substantial discrepancies between the manufacturer's annotations and our results. The software scripts in the
Conclusions
The positions of oligomer sequences in the genome are volatile even within one build of the genome. To facilitate the analysis of data from a GWAS or from an expression study, especially with species whose genome assembly is still unstable, it is recommended to update the oligomer positions before data analysis.

Loading...
Loading...
Abstract
Background
More and more completely sequenced fungal genomes are becoming available and many more sequencing projects are in progress. This deluge of data should improve our knowledge of the various primary and secondary metabolisms of Fungi, including their synthesis of useful compounds such as antibiotics or toxic molecules such as mycotoxins. Functional annotation of many fungal genomes is imperfect, especially of genes encoding enzymes, so we need dedicated tools to analyze their metabolic pathways in depth.
Description
FUNGIpath is a new tool built using a two-stage approach. Groups of orthologous proteins predicted using complementary methods of detection were collected in a relational database. Each group was further mapped on to steps in the metabolic pathways published in the public databases KEGG and MetaCyc. As a result, FUNGIpath allows the primary and secondary metabolisms of the different fungal species represented in the database to be compared easily, making it possible to assess the level of specificity of various pathways at different taxonomic distances. It is freely accessible at
Conclusions
As more and more fungal genomes are expected to be sequenced during the coming years, FUNGIpath should help progressively to reconstruct the ancestral primary and secondary metabolisms of the main branches of the fungal tree of life and to elucidate the evolution of these ancestral fungal metabolisms to various specific derived metabolisms.

Loading...
Loading...
Abstract
Background
X monosomic mice (39,XO) have a remarkably mild phenotype when compared to women with Turner syndrome (45,XO). The generally accepted hypothesis to explain this discrepancy is that the number of genes on the mouse X chromosome which escape X inactivation, and thus are expressed at higher levels in females, is very small. However this hypothesis has never been tested and only a small number of genes have been assayed for their X-inactivation status in the mouse. We performed a global expression analysis in four somatic tissues (brain, liver, kidney and muscle) of adult 40,XX and 39,XO mice using the Illumina Mouse WG-6 v1_1 Expression BeadChip and an extensive validation by quantitative real time PCR, in order to identify which genes are expressed from both X chromosomes.
Results
We identified several genes on the X chromosome which are overexpressed in XX females, including those previously reported as escaping X inactivation, as well as new candidates. However, the results obtained by microarray and qPCR were not fully concordant, illustrating the difficulty in ascertaining modest fold changes, such as those expected for genes escaping X inactivation. Remarkably, considerable variation was observed between tissues, suggesting that inactivation patterns may be tissue-dependent. Our analysis also exposed several autosomal genes involved in mitochondrial metabolism and in protein translation which are differentially expressed between XX and XO mice, revealing secondary transcriptional changes to the alteration in X chromosome dosage.
Conclusions
Our results support the prediction that the mouse inactive X chromosome is largely silent, while providing a list of the genes potentially escaping X inactivation in rodents. Although the lower expression of X-linked genes in XO mice may not be relevant in the particular tissues/systems which are affected in human X chromosome monosomy, genes deregulated in XO mice are good candidates for further study in an involvement in Turner Syndrome phenotype.

Loading...
Loading...
Abstract
Background
Phosphatidylinositol 3-phosphate is involved in regulation of several key cellular processes, mainly endocytosis, signaling, nuclear processes, cytoskeletal remodelling, cell survival, membrane trafficking, phagosome maturation and autophagy. In most cases effector proteins bind to this lipid, using either FYVE or PX domain. These two domains are distributed amongst varied life forms such as virus, protists, fungi, viridiplantae and metazoa. As the binding ligand is identical for both domains, the goal of this study was to understand if there is any selectivity for either of these domains in different taxa. Further, to understand the different cellular functions that these domains may be involved in, we analyzed the taxonomic distribution of additional domains that associate with FYVE and PX.
Results
There is selectivity for either FYVE or PX in individual genomes where both domains are present. Fungi and metazoa encode more PX, whereas streptophytes in viridiplantae encode more FYVE. Excess of FYVE in streptophytes results from proteins containing RCC1and DZC domains and FYVE domains in these proteins have a non-canonical ligand-binding site. Within a taxonomic group the selected domain associates with a higher number of other domains and is thus expected to discharge a larger number of cellular functions. Also, while certain associated domains are present in all taxonomic groups, most of them are unique to a specific group indicating that while certain common functions are discharged by these domains in all taxonomic groups, some functions appear to be group specific.
Conclusions
Although both FYVE and PX bind to PtdIns(3)P, genomes of different taxa show distinct selectivity of encoding either of the two. Higher numbers of taxonomic group specific domains co-occur with the more abundant domain (FYVE/PX) indicating that group-specific rare domain architectures might have emerged to accomplish certain group-specific functions.

Loading...
Loading...
Abstract
Background
Identification of disease genes for Type 2 Diabetes (T2D) by traditional methods has yielded limited success. Based on our previous observation that T2D may result from disturbed protein-protein interactions affected through disrupting modular domain interactions, here we have designed an approach to rank the candidates in the T2D linked genomic regions as plausible disease genes.
Results
Our approach integrates Weight value (Wv) method followed by prioritization using clustering coefficients derived from domain interaction network. Wv for each candidate is calculated based on the assumption that disease genes might be functionally related, mainly facilitated by interactions among domains of the interacting proteins. The benchmarking using a test dataset comprising of both known T2D genes and non-T2D genes revealed that Wv method had a sensitivity and specificity of 0.74 and 0.96 respectively with 9 fold enrichment. The candidate genes having a Wv > 0.5 were called High Weight Elements (HWEs). Further, we ranked HWEs by using the network property-the clustering coefficient (Ci). Each HWE with a Ci < 0.015 was prioritized as plausible disease candidates (HWEc) as previous studies indicate that disease genes tend to avoid dense clustering (with an average Ci of 0.015). This method further prioritized the identified disease genes with a sensitivity of 0.32 and a specificity of 0.98 and enriched the candidate list by 6.8 fold. Thus, from the dataset of 4052 positional candidates the method ranked 435 to be most likely disease candidates. The gene ontology sharing for the candidates showed higher representation of metabolic and signaling processes. The approach also captured genes with unknown functions which were characterized by network motif analysis.
Conclusions
Prioritization of positional candidates is essential for cost-effective and an expedited discovery of disease genes. Here, we demonstrate a novel approach for disease candidate prioritization from numerous loci linked to T2D.

Loading...
Loading...
Abstract
Background
MicroRNAs (miRNAs) repress target genes at the post-transcriptional level, and function in the development and cell-lineage pathways of host species. Tissue-specific expression of miRNAs is highly relevant to their physiological roles in the corresponding tissues. However, to date, few miRNAs have been spatially identified in the silkworm.
Results
We establish for the first time the spatial expression patterns of nearly 100 miRNAs in multiple normal tissues (organs) of
Conclusions
In this study, we conducted preliminary spatial measurements of several miRNAs in the silkworm. Periods of rapid morphological change were associated with alterations in miRNA expression patterns in the body wall, silk glands, midgut and fat body during metamorphosis. Accordingly, we propose that corresponding ubiquitous or tissue-specific expression of miRNAs supports their critical roles in tissue specification. These results should facilitate future functional analyses.

Loading...
Loading...
Abstract
Background
While the genomic era is accumulating a tremendous amount of data, the question of how genomics can describe a bacterial species remains to be fully addressed. The recent sequencing of the genome of the
Results
The two genomes differ by ca. 130 kbp with that of 5632 being the largest (1006 kbp). The make up of this additional genetic material mainly corresponds (i) to mobile genetic elements and (ii) to expanded repertoire of gene families that encode putative surface proteins and display features of highly-variable systems. More specifically, three entire copies of a previously described integrative conjugative element are found in 5632 that accounts for ca. 80 kbp. Other mobile genetic elements, found in 5632 but not in PG2, are the more classical insertion sequences which are related to those found in two other ruminant pathogens,
Conclusion
Overall, comparative analyses of two

Loading...
Loading...
Abstract
Background
Analysis of fungal genome sequence assemblies reveals that telomeres are poorly represented even though telomeric reads tend to be superabundant. We surmised that the problem might lie in the DNA shearing conditions used to create clone libraries for genome sequencing.
Results
A shotgun strategy was used to sequence and assemble circular and linear cosmid DNAs sheared using conditions typical for a genome project. The DNA sheared in circular form assembled into a single sequence contig. However, the linearized cosmid produced an incomplete assembly because the two DNA termini, though greatly overrepresented in the clone library used for sequencing, were separated from neighboring sequences by gaps of ~1.4 and 1.8 kb. These gap sizes were reduced, but not eliminated, by shearing the linear cosmid into smaller fragments. Mapping of shearing breakpoints revealed a paucity of breaks in the subterminal regions of the linearized cosmid and also near chromosome ends of the fungus
Conclusion
Together, our data indicate that the ends of linear DNA molecules are recalcitrant to hydrodynamic shearing. We propose that this causes DNA termini to be overrepresented in the resulting fragment population but ultimately prevents their incorporation into sequence assemblies.

Loading...
Loading...
Abstract
Background
Experimental evolution of microbial populations provides a unique opportunity to study evolutionary adaptation in response to controlled selective pressures. However, until recently it has been difficult to identify the precise genetic changes underlying adaptation at a genome-wide scale. New DNA sequencing technologies now allow the genome of parental and evolved strains of microorganisms to be rapidly determined.
Results
We sequenced >93.5% of the genome of a laboratory-evolved strain of the yeast
Conclusions
This study suggests that whole-genome sequencing can provide a rapid approach to uncover the genetic basis of evolutionary adaptations, with further applications in the study of laboratory selections and mutagenesis screens. In addition, we show how single-end, short read sequencing data can provide detailed information about structural rearrangements, and generate predictions about the genomic features and processes that underlie genome plasticity.

Loading...
Loading...
Abstract
Background
Retrotransposons are key players in the evolution of eukaryotic genomes. Moreover, it is now known that some retrotransposon classes, like the abundant and plant-specific Sireviruses, have intriguingly distinctive host preferences. Yet, it is largely unknown if this bias is supported by different genome structures.
Results
We performed sensitive comparative analysis of the genomes of a large set of Ty1/
Conclusions
Our comparative analysis of retrotransposon genomes using advanced

Loading...
Loading...
Abstract
Background
Results
The exposure to H2O2 and NaOCl resulted in an upregulation of the transcription of 315 (4.4%) and 386 (5.4%) genes, respectively. Transcription of 185 (2.6%) and 331 (4.6%) genes was decreased in response to the respective treatments. Many of the upregulated genes in the NaOCl- and H2O2-treated biofilms are involved in oxidative stress as well as general stress response, emphasizing the importance of the efficient neutralization and scavenging of ROS. In addition, multiple upregulated genes encode proteins that are necessary to repair ROS-induced cellular damage. Unexpectedly, a prolonged treatment with H2O2 also resulted in an increased transcription of multiple phage-related genes. A closer inspection of hybridisation signals obtained with probes targeting intergenic regions led to the identification of a putative 6S RNA.
Conclusion
Our results reveal that the transcription of a large fraction of

Loading...
Loading...
Abstract
Background
Results
Sequencing and assembly of the
Conclusions
The data obtained by the genome project suggest that

Loading...
Loading...
Abstract
Background
The methylation of eukaryotic proteins has been proposed to be widespread, but this has not been conclusively shown to date. In this study, we examined 36,854 previously generated peptide mass spectra from 2,607
Results
A total of 83 high-confidence lysine and arginine methylation sites were found in 66 proteins. Motif analysis revealed many methylated sites were associated with MK, RGG/RXG/RGX or WXXXR motifs. Functionally, methylated proteins were significantly enriched for protein translation, ribosomal biogenesis and assembly and organellar organisation and were predominantly found in the cytoplasm and ribosome. Intriguingly, methylated proteins were seen to have significantly longer half-life than proteins for which no methylation was found. Some 43% of methylated lysine sites were predicted to be amenable to ubiquitination, suggesting methyl-lysine might block the action of ubiquitin ligase.
Conclusions
This study suggests protein methylation to be quite widespread, albeit associated with specific functions. Large-scale tandem mass spectroscopy analyses will help to further confirm the modifications reported here.

Loading...
Loading...
Abstract
Background
Even though the process of potato tuber starch biosynthesis is well understood, mechanisms regulating biosynthesis are still unclear. Transcriptome analysis provides valuable information as to how genes are regulated. Therefore, this work aimed at investigating transcriptional regulation of starch biosynthetic genes in leaves and tubers of potato plants under various conditions. More specifically we looked at gene expression diurnally in leaves and tubers, during tuber induction and in tubers growing at different velocities. To determine velocity of potato tuber growth a new method based on X-ray Computed Tomography (X-ray CT) was established.
Results
Comparative transcriptome analysis between leaves and tubers revealed striking similarities with the same genes being differentially expressed in both tissues. In tubers, oscillation of granule bound starch synthase (GBSS) expression) was observed which could be linked to sucrose supply from source leaves. X-ray CT was used to determine time-dependent changes in tuber volume and the growth velocity was calculated. Although there is not a linear correlation between growth velocity and expression of starch biosynthetic genes, there are significant differences between growing and non-growing tubers. Co-expression analysis was used to identify transcription factors positively correlating with starch biosynthetic genes possibly regulating starch biosynthesis.
Conclusion
Most starch biosynthetic enzymes are encoded by gene families. Co-expression analysis revealed that the same members of these gene families are co-regulated in leaves and tubers. This suggests that regulation of transitory and storage starch biosynthesis in leaves and tubers, respectively, is surprisingly similar. X-ray CT can be used to monitor growth and development of belowground organs and allows to link tuber growth to changes in gene expression. Comparative transcriptome analysis provides a useful tool to identify transcription factors possibly involved in the regulation of starch biosynthesis.

Loading...
Loading...
Abstract
Background
Results
cDNAs of
Conclusion
A large EST dataset with a total of 76,459 consensus sequences is generated, aiming to provide sequence information for deciphering secondary metabolism, especially for flavonoid pathway in

Loading...
Loading...
Abstract
Background
ClpB-cyt/HSP100 protein acts as chaperone, mediating disaggregation of denatured proteins. Previous studies have shown that ClpB-cyt/HSP100 gene belongs to the group class I Clp ATPase proteins and ClpB-cyt/HSP100 transcript is regulated by heat stress and developmental cues.
Results
Nine ORFs were noted to constitute rice class I Clp ATPases in the following manner: 3 ClpB proteins (ClpB-cyt, Os05g44340; ClpB-m, Os02g08490; ClpB-c, Os03g31300), 4 ClpC proteins (ClpC1, Os04g32560; ClpC2, Os12g12580; ClpC3, Os11g16590; ClpC4, Os11g16770) and 2 ClpD proteins (ClpD1, Os02g32520; ClpD2, Os04g33210). Using the respective signal sequences cloned upstream to GFP/CFP reporter proteins and transient expression studies with onion epidermal cells, evidence is provided that rice ClpB-m and Clp-c proteins are indeed localized to their respective cell locations mitochondria and chloroplasts, respectively. Associated with their diverse cell locations, domain structures of OsClpB-c, OsClpB-m and OsClpB-cyt proteins are noted to possess a high-level conservation.
Conclusions
Rice class I Clp ATPase gene family is constituted of 9 members. Of these 9, only 3 belonging to ClpB group are heat stress regulated. Distribution of ClpB proteins to different cell organelles indicates that their functioning might be critical in different cell locations. From the complementation assays, OsClpD1 appears to be more effective than OsClpB-cyt protein in rescuing the thermosensitive defect of the yeast

Loading...
Loading...
Abstract
Background
Readily accessible samples such as peripheral blood or cell lines are increasingly being used in large cohorts to characterise gene expression differences between a patient group and healthy controls. However, cell and RNA isolation procedures and the variety of cell types that make up whole blood can affect gene expression measurements. We therefore systematically investigated global gene expression profiles in peripheral blood from six individuals collected during two visits by comparing five of the following cell and RNA isolation methods: whole blood (PAXgene), peripheral blood mononuclear cells (PBMCs), lymphoblastoid cell lines (LCLs), CD19 and CD20 specific B-cell subsets.
Results
Gene expression measurements were clearly discriminated by isolation method although the reproducibility was high for all methods (range ρ = 0.90-1.00). The PAXgene samples showed a decrease in the number of expressed genes (P < 1*10-16) with higher variability (P < 1*10-16) compared to the other methods. Differentially expressed probes between PAXgene and PBMCs were correlated with the number of monocytes, lymphocytes, neutrophils or erythrocytes. The correlations (ρ = 0.83; ρ = 0.79) of the expression levels of detected probes between LCLs and B-cell subsets were much lower compared to the two B-cell isolation methods (ρ = 0.98). Gene ontology analysis of detected genes showed that genes involved in inflammatory responses are enriched in B-cells CD19 and CD20 whereas genes involved in alcohol metabolic process and the cell cycle were enriched in LCLs.
Conclusion
Gene expression profiles in blood-based samples are strongly dependent on the predominant constituent cell type(s) and RNA isolation method. It is crucial to understand the differences and variability of gene expression measurements between cell and RNA isolation procedures, and their relevance to disease processes, before application in large clinical studies.

Loading...
Loading...
Abstract
Background
Although polyploidy has long been recognized as a major force in the evolution of plants, most of what we know about the genetic consequences of polyploidy comes from the study of crops and model systems. Furthermore, although many polyploid species have formed repeatedly, patterns of genome evolution and gene expression are largely unknown for natural polyploid populations of independent origin. We therefore examined patterns of loss and expression in duplicate gene pairs (homeologs) in multiple individuals from seven natural populations of independent origin of
Results
Using cDNA-AFLPs, we found differential band patterns that could be attributable to gene silencing, novel expression, and/or maternal/paternal effects between
Conclusions
Genomic and cDNA CAPS analyses indicated that plants representing multiple populations of this young natural allopolyploid have experienced frequent and preferential elimination of homeologous loci. Comparable analyses of synthetic F1 hybrids showed only additivity. These results suggest that loss of homeologs and changes in gene expression are not the immediate result of hybridization, but are processes that occur following polyploidization, occurring during the early (<40) generations of the young polyploid. Both

Loading...
Loading...
Abstract
Background
Results
Transcriptional profiling of
Conclusions
These genes that we have identified as up-regulated in

Loading...
Loading...
Abstract
Background
Results
Data mining of the
Conclusions
The biased mucosal habitat for microbial species encoding BspA-like proteins, the characterisation of a vast structural diversity for the TvBspA proteins, differential expression of a subset of TvBspA genes and the cellular localisation and immunological data for one TvBspA; all point to the importance of the TvBspA proteins to various aspects of

Loading...
Loading...
Abstract
Background
γδ T cells differ from αβ T cells with regard to the types of antigen with which their T cell receptors interact; γδ T cell antigens are not necessarily peptides nor are they presented on MHC. Cattle are considered a "γδ T cell high" species indicating they have an increased proportion of γδ T cells in circulation relative to that in "γδ T cell low" species such as humans and mice. Prior to the onset of the studies described here, there was limited information regarding the genes that code for the T cell receptor delta chains of this γδ T cell high species.
Results
By annotating the bovine (
Conclusions
The bovine genome contains a large and diverse repertoire of TRD genes when compared to the genomes of "γδ T cell low" species. This suggests that in cattle γδ T cells play a more important role in immune function since they would be predicted to bind a greater variety of antigens.

Loading...
Loading...
Abstract
Background
In many eukaryotes, microRNAs (miRNAs) bind to complementary sites in the 3'-untranslated regions (3'-UTRs) of target messenger RNAs (mRNAs) and regulate their expression at the stage of translation. Recent studies have revealed that many miRNAs are evolutionarily conserved; however, the evolution of their target genes has yet to be systematically characterized. We sought to elucidate a set of conserved miRNA/target-gene pairs and to analyse the mechanism underlying miRNA-mediated gene regulation in the early stage of bilaterian evolution.
Results
Initially, we extracted five evolutionarily conserved miRNAs (
Conclusions
The application of our new method enables the identification of 31 miRNA/target-gene pairs that were expected to have been regulated from the era of the common bilaterian ancestor. The downregulation of all six candidate pairs suggests that orthologous information contributed to the elucidation of the primordial set of genes that has been regulated by miRNAs; it was also an efficient tool for the elimination of false positives from the predicted candidates. In conclusion, our study identified potentially important miRNA-target pairs that were evolutionarily conserved throughout diverse bilaterian animals and that may provide new insights into early-stage miRNA functions.

Loading...
Loading...
Abstract
Background
While, traditionally, regulation of gene expression can be grouped into transcriptional, translational, and post-translational mechanisms, some mechanisms of rapid genetic variation can also contribute to regulation of gene expression, e.g., phase variation.
Results
We show here that prokaryotes evolved to include homopolymeric tracts (HTs) within coding genes as a system that allows for efficient gene inactivation. Analyses of 81 bacterial and 18 archaeal genomes showed that poly(A) and poly(T) HTs are overrepresented in these genomes and preferentially located at the 5' end of coding genes. Location of HTs at the 5' end is not driven by a preferential placement of aminoacids encoded by the AAA and TTT codons at the N-terminal of proteins. The
Conclusions
As frameshift mutations in HTs can occur at high frequencies and enable efficient gene inactivation, hypermutable HTs appear to represent a universal system for regulation of gene expression in prokaryotes. Combined with other studies indicating that HTs also enable rapid diversification of both coding and regulatory genetic sequences in eukaryotes, our data suggest that hypermutable HTs represent a general and rapid evolutionary mechanism facilitating adaptation and gene regulation across diverse organisms.

Loading...
Loading...
Abstract
Background
The marine cyanobacterium
Results
Pronounced strand-specific asymmetry in synonymous codon usage is observed exclusively in LL strains. Distinct dinucleotide abundance profiles are exhibited by 2 LL strains with larger genomes and G+C-content ≈ 50% (group LLa), 4 LL strains having reduced genomes and G+C-content ≈ 35-37% (group LLb), and 6 HL strains. Taking into account the emergence of LLa, LLb and HL strains (based on 16S rRNA phylogeny), a gradual increase in average aromaticity, pI values and beta- & coil-forming propensities and a decrease in mean hydrophobicity, instability indices and helix-forming propensities of core proteins are observed. Greater variations in orthologous gene repertoire are found between LLa and LLb strains, while higher number of positively selected genes exist between LL and HL strains.
Conclusion
Strains of different

Loading...
Loading...
Abstract
Background
Infection of plants by pathogens and the subsequent disease development involves substantial changes in the biochemistry and physiology of both partners. Analysis of genes that are expressed during these interactions represents a powerful strategy to obtain insights into the molecular events underlying these changes. We have employed expressed sequence tag (EST) analysis to identify rice genes involved in defense responses against infection by the blast fungus
Results
A cDNA library was constructed with RNA from rice leaves (
Conclusion
This study shows the power of ESTs to refine genome annotation and functional characterization. Results of this work have advanced our understanding of the molecular mechanisms underpinning fungal-plant interactions and formed the basis for new hypothesis.

Loading...
Loading...
Abstract
Background
Fungi secrete various proteins that have diverse functions. Prediction of secretory proteins using only one program is unsatisfactory. To enhance prediction accuracy, we constructed Fungal Secretome Database (FSD).
Description
A three-layer hierarchical identification rule based on nine prediction programs was used to identify putative secretory proteins in 158 fungal/oomycete genomes (208,883 proteins, 15.21% of the total proteome). The presence of putative effectors containing known host targeting signals such as RXLX [EDQ] and RXLR was investigated, presenting the degree of bias along with the species. The FSD's user-friendly interface provides summaries of prediction results and diverse web-based analysis functions through Favorite, a personalized repository.
Conclusions
The FSD can serve as an integrated platform supporting researches on secretory proteins in the fungal kingdom. All data and functions described in this study can be accessed on the FSD web site at

Loading...
Loading...
Abstract
Background
Two categories of introns are known, a common U2 type and a rare U12 type. These two types of introns are removed by distinct spliceosomes. The phylogenetic distribution of spliceosomal RNAs that are characteristic of the U12 spliceosome, i.e. the U11, U12, U4atac and U6atac RNAs, suggest that U12 spliceosomes were lost in many phylogenetic groups. We have now examined the distribution of U2 and U12 introns in many of these groups.
Results
U2 and U12 introns were predicted by making use of available EST and genomic sequences. The results show that in species or branches where U12 spliceosomal components are missing, also U12 type of introns are lacking. Examples are the choanoflagellate
Conclusions
The phylogenetic distribution of U12 introns and spliceosomal RNAs give further support to an early origin of U12 dependent splicing. In addition, this distribution identifies a large number of instances during eukaryotic evolution where such splicing was lost.

Loading...
Loading...
Abstract
Background
Delineating the genetic basis of body composition is important to agriculture and medicine. In addition, the incorporation of gene-gene interactions in the statistical model provides further insight into the genetic factors that underlie body composition traits. We used Bayesian model selection to comprehensively map main, epistatic and sex-specific QTL in an F2 reciprocal intercross between two chicken lines divergently selected for high or low growth rate.
Results
We identified 17 QTL with main effects across 13 chromosomes and several sex-specific and sex-antagonistic QTL for breast meat yield, thigh + drumstick yield and abdominal fatness. Different sets of QTL were found for both breast muscles [
Conclusions
The detection of main effects, epistasis and sex-dimorphic QTL suggest complex genetic regulation of somatic growth. An understanding of such regulatory mechanisms is key to mapping specific genes that underlie QTL controlling somatic growth in an avian model.

Loading...
Loading...
Abstract
Background
The use of functional genomics has largely increased our understanding of cell biology and promises to help the development of systems biology needed to understand the complex order of events that regulates cellular differentiation
Results
In order to identify extra cellular signals that promote B lymphocyte development we created a database with approximately 400 receptor ligand pairs and software matching gene expression data from two cell populations to obtain information about possible communication pathways. Using this database and gene expression data from NIH3T3 cells (unable to support B cell development), OP-9 cells (strongly supportive of B cell development), pro-B and pre-B cells as well as mature peripheral B-lineage cells, we were able to identify a set of potential stage and stromal cell restricted communication pathways. Functional analysis of some of these potential ways of communication allowed us to identify BMP-4 as a potent stimulator of B-cell development
Conclusions
We believe that this gene expression data analysis method allows for the identification of functionally relevant interactions and therefore could be applied to other data sets to unravel novel communication pathways.

Loading...
Loading...
Abstract
The version of this article published in
Background
MicroRNAs are short (~21 base) single stranded RNAs that, in plants, are generally coded by specific genes and cleaved specifically from hairpin precursors. MicroRNAs are critical for the regulation of multiple developmental, stress related and other physiological processes in plants. The recent annotation of the genome of the grapevine (
Results
Here we use oligonucleotide arrays to provide the first indication that many of these microRNAs show differential expression patterns between tissues and during the maturation of fruit in the grapevine. Furthermore we demonstrate that whole transcriptome sequencing and deep-sequencing of small RNA fractions can be used both to identify which microRNA precursors are expressed in different tissues and to estimate genomic coordinates and patterns of splicing and alternative splicing for many primary miRNA transcripts.
Conclusions
Our results show that many microRNAs are differentially expressed in different tissues and during fruit maturation in the grapevine. Furthermore, the demonstration that whole transcriptome sequencing can be used to identify candidate splicing events and approximate primary microRNA transcript coordinates represents a significant step towards the large-scale elucidation of mechanisms regulating the expression of microRNAs at the transcriptional and post-transcriptional levels.

Loading...
Loading...
Abstract
Background
The bumblebee,
Results
A normalised cDNA library was constructed from the thorax and abdomen of
Conclusion
The resource that these

Loading...
Loading...
Abstract
Background
Cyclotides are a family of circular peptides that exhibit a range of biological activities, including anti-bacterial, cytotoxic, anti-HIV activities, and are proposed to function in plant defence. Their high stability has motivated their development as scaffolds for the stabilisation of peptide drugs.
Results
Using flow cytometry the holoploid genome size (1C-value) of
Conclusion
The EST library generated in this study provides a valuable resource for the study of the cyclisation of plant peptides. Further analysis of the candidates for cyclotide processing discovered in this work will increase our understanding and aid in reconstructing cyclotide production using transgenic systems and will benefit their development in pharmaceutical applications and insect-resistant crop plants.

Loading...
Loading...
Abstract
Background
Ratio-based analysis is the current standard for the analysis of dual-color microarray data. Indeed, this method provides a powerful means to account for potential technical variations such as differences in background signal, spot size and spot concentration. However, current high density dual-color array platforms are of very high quality, and inter-array variance has become much less pronounced. We therefore raised the question whether it is feasible to use an intensity-based analysis rather than ratio-based analysis of dual-color microarray datasets. Furthermore, we compared performance of both ratio- and intensity-based analyses in terms of reproducibility and sensitivity for differential gene expression.
Results
By analyzing three distinct and technically replicated datasets with either ratio- or intensity-based models, we determined that, when applied to the same dataset, intensity-based analysis of dual-color gene expression experiments yields 1) more reproducible results, and 2) is more sensitive in the detection of differentially expressed genes. These effects were most pronounced in experiments with large biological variation and complex hybridization designs. Furthermore, a power analysis revealed that for direct two-group comparisons above a certain sample size, ratio-based models have higher power, although the difference with intensity-based models is very small.
Conclusions
Intensity-based analysis of dual-color datasets results in more reproducible results and increased sensitivity in the detection of differential gene expression than the analysis of the same dataset with ratio-based analysis. Complex dual-color setups such as interwoven loop designs benefit most from ignoring the array factor. The applicability of our approach to array platforms other than dual-color needs to be further investigated.

Loading...
Loading...
Abstract
Background
Transposable elements are the most abundant components of all characterized genomes of higher eukaryotes. It has been documented that these elements not only contribute to the shaping and reshaping of their host genomes, but also play significant roles in regulating gene expression, altering gene function, and creating new genes. Thus, complete identification of transposable elements in sequenced genomes and construction of comprehensive transposable element databases are essential for accurate annotation of genes and other genomic components, for investigation of potential functional interaction between transposable elements and genes, and for study of genome evolution. The recent availability of the soybean genome sequence has provided an unprecedented opportunity for discovery, and structural and functional characterization of transposable elements in this economically important legume crop.
Description
Using a combination of structure-based and homology-based approaches, a total of 32,552 retrotransposons (Class I) and 6,029 DNA transposons (Class II) with clear boundaries and insertion sites were structurally annotated and clearly categorized, and a soybean transposable element database, SoyTEdb, was established. These transposable elements have been anchored in and integrated with the soybean physical map and genetic map, and are browsable and visualizable at any scale along the 20 soybean chromosomes, along with predicted genes and other sequence annotations. BLAST search and other infrastracture tools were implemented to facilitate annotation of transposable elements or fragments from soybean and other related legume species. The majority (> 95%) of these elements (particularly a few hundred low-copy-number families) are first described in this study.
Conclusion
SoyTEdb provides resources and information related to transposable elements in the soybean genome, representing the most comprehensive and the largest manually curated transposable element database for any individual plant genome completely sequenced to date. Transposable elements previously identified in legumes, the third largest family of flowering plants, are relatively scarce. Thus this database will facilitate structural, evolutionary, functional, and epigenetic analyses of transposable elements in soybean and other legume species.

Loading...
Loading...
Abstract
Background
Genome-wide computational analysis of alternative splicing (AS) in several flowering plants has revealed that pre-mRNAs from about 30% of genes undergo AS.
Results
Our analysis of AS using BLAT and a modified version of the Sircah tool revealed AS of 498 transcriptional units with 611 events, representing about 3% of the total number of genes. As in land plants, intron retention is the most prevalent form of AS. Retained introns and skipped exons tend to be shorter than their counterparts in constitutively spliced genes. The splice site signals in all types of AS events are weaker than those in constitutively spliced genes. Furthermore, in alternatively spliced genes, the prevalent splice form has a stronger splice site signal than the non-prevalent form. Analysis of constitutively spliced introns revealed an over-abundance of motifs with simple repetitive elements in comparison to introns involved in intron retention. In almost all cases, AS results in a truncated ORF, leading to a coding sequence that is around 50% shorter than the prevalent splice form. Using RT-PCR we verified AS of two genes and show that they produce more isoforms than indicated by EST data. All cDNA/EST alignments and splice graphs are provided in a website at
Conclusions
The extent of AS in

Loading...
Loading...
Abstract
Background
Results
In order to identify the genes associated to the light-induced sporulation in
Conclusions
Gene ontology analysis suggested that most significantly downregulated genes are linked to DNA repair, cell division, inhibition of cell migration, and calcium release, while highly upregulated genes were involved in cell death, cell polarization, maintenance of integrity, and differentiation. In addition, cell death- associated transcripts were overrepresented between the upregulated transcripts. These changes are associated to a network of actin-binding proteins encoded by genes that are differentially regulated before and after light induction.

Loading...
Loading...
Abstract
Background
Outbreaks of the casuarina moth,
Results
The genome of LyxyMNPV consists of 156,344 bases, has a G+C content of 53.4% and contains 157 putative open reading frames (ORFs). The gene content and gene order of LyxyMNPV were similar to those of LdMNPV, with 151 ORFs identified as homologous to those reported in the LdMNPV genome. Two genes (Lyxy49 and Lyxy123) were homologous to other baculoviruses, and four unique LyxyMNPV ORFs (Lyxy11, Lyxy19, Lyxy130 and Lyxy131) were identified in the LyxyMNPV genome, including a
Conclusion
The gene parity plot analysis, percent identity of the gene homologues and a phylogenetic analysis suggested that LyxyMNPV is a Group II NPV that is most closely related to LdMNPV but with a highly distinct genomic organisation.

Loading...
Loading...
Abstract
Background
Downy mildew is a destructive grapevine disease caused by
Results
Early transcriptional changes associated with
Conclusions
Several candidate resistance genes were identified that could be exploited in future biotechnological approaches to increase disease resistance in susceptible grapevine species. Measurements of jasmonic acid and methyl jasmonate in infected leaves suggest that this hormone may also be involved in

Loading...
Loading...
Abstract
Background
The process of translation can be affected by the use of rare versus common codons within the mRNA transcript.
Results
Here, we show that rare codons are enriched at the 5' and 3' termini of genes from
Conclusions
Potential functional roles for the enrichment of rare codons at terminal positions are explored.

Loading...
Loading...
Abstract
Background
MicroRNAs (miRNAs) are small non-coding RNAs that post-transcriptionally regulate gene expression in a variety of organisms, including insects, vertebrates, and plants. miRNAs play important roles in cell development and differentiation as well as in the cellular response to stress and infection. To date, there are limited reports of miRNA identification in mosquitoes, insects that act as essential vectors for the transmission of many human pathogens, including flaviviruses. West Nile virus (WNV) and dengue virus, members of the
Results
We identified a total of 65 miRNAs in the
Conclusions

Loading...
Loading...
Abstract
Background
A large, multi-province outbreak of listeriosis associated with ready-to-eat meat products contaminated with
Results
The chromosomes were collinear, but differences included 28 single nucleotide polymorphisms (SNPs) and three indels, including a 33 kbp prophage that accounted for the observed difference in
Conclusions
High-throughput genome sequencing provided a more detailed real-time assessment of genetic traits characteristic of the outbreak strains than could be achieved with routine subtyping methods. This study confirms that the latest generation of DNA sequencing technologies can be applied during high priority public health events, and laboratories need to prepare for this inevitability and assess how to properly analyze and interpret whole genome sequences in the context of molecular epidemiology.

Loading...
Loading...
Abstract
Background
Rice is one of the most important food crops in the world. With increasing world demand for food crops, there is an urgent need to develop new cultivars that have enhanced performance with regard to yield, disease resistance, and so on. Wild rice is expected to provide useful genetic resources that could improve the present cultivated species. However, the quantity and quality of these unexplored resources remain unclear. Recent accumulation of the genomic information of both cultivated and wild rice species allows for their comparison at the molecular level. Here, we compared the genome sequence of
Results
We found that about four to five percent of the BESs of the two wild rice species and about seven percent of the African rice could not be mapped to the
Conclusion
We found that the

Loading...
Loading...
Abstract
Background
The presence of closely related genomes in polyploid species makes the assembly of total genomic sequence from shotgun sequence reads produced by the current sequencing platforms exceedingly difficult, if not impossible. Genomes of polyploid species could be sequenced following the ordered-clone sequencing approach employing contigs of bacterial artificial chromosome (BAC) clones and BAC-based physical maps. Although BAC contigs can currently be constructed for virtually any diploid organism with the SNaPshot high-information-content-fingerprinting (HICF) technology, it is currently unknown if this is also true for polyploid species. It is possible that BAC clones from orthologous regions of homoeologous chromosomes would share numerous restriction fragments and be therefore included into common contigs. Because of this and other concerns, physical mapping utilizing the SNaPshot HICF of BAC libraries of polyploid species has not been pursued and the possibility of doing so has not been assessed. The sole exception has been in common wheat, an allohexaploid in which it is possible to construct single-chromosome or single-chromosome-arm BAC libraries from DNA of flow-sorted chromosomes and bypass the obstacles created by polyploidy.
Results
The potential of the SNaPshot HICF technology for physical mapping of polyploid plants utilizing global BAC libraries was evaluated by assembling contigs of fingerprinted clones in an
Conclusions
The negligibly low level of incorporation of clones from homoeologous chromosome arms into a contig during contig assembly suggested that it is feasible to construct contigs and physical maps using global BAC libraries of wheat and almost certainly also of other plant polyploid species with genome sizes comparable to that of wheat. Because of the high purity of the resulting assembled contigs, they can be directly used for genome sequencing. It is currently unknown but possible that equally good BAC contigs can be also constructed for polyploid species containing smaller, more gene-rich genomes.

Loading...
Loading...
Abstract
Background
The Lipase Engineering Database (LED) integrates information on sequence, structure and function of lipases, esterases and related proteins with the α/β hydrolase fold. A new superfamily for
Results
The new release 3.0 (December 2009) of the Lipase Engineering Database contains 24783 sequence entries for 18585 proteins as well as 656 experimentally determined protein structures, including the structure of CALA. In comparison to the previous release
Conclusion
The LED serves as a valuable tool for the systematic analysis of single proteins or protein families. The updated release 3.0 was used for the evaluation of α/β hydrolases. The HTML version of the database with new features is available at

Loading...
Loading...
Abstract
Background
The evolutionarily ancient parasite,
Results
Over eleven million unique tags were generated, producing expression data for 7360 genes, covering 81% of the genes in the genome. Compared to microarray analysis of the related
Conclusions
DGE analysis is extremely sensitive for detecting gene expression differences, revealing firstly that a far greater number of genes are stage-regulated than had previously been identified and secondly and more importantly, this analysis has revealed the existence of several differentially expressed clusters of genes present on what appears to be the same polycistronic units, a phenomenon which had not previously been observed in microarray studies. These differentially regulated clusters of genes are in addition to the previously identified RNA polymerase I polycistronic units of variant surface glycoproteins and procyclin expression sites, which encode the major surface proteins of the parasite. This raises a number of questions regarding the function and regulation of the gene clusters that clearly warrant further study.

Loading...
Loading...
Abstract
Background
A high-sensitivity DNA microarray platform requiring nanograms of RNA input facilitates the application of transcriptome analysis to individual skeletal muscle (SM) tissue samples. Culturing myotubes from SM-biopsies enables investigating transcriptional defects and assaying therapeutic strategies. This study compares the transcriptome of aneurally cultured human SM cells versus that of tissue biopsies.
Results
We used the Illumina expression BeadChips to determine the transcriptomic differences between tissue and cultured SM samples from five individuals. Changes in the expression of several genes were confirmed by QuantiGene Plex assay or reverse transcription real-time PCR. In cultured myotubes compared to the tissue, 1216 genes were regulated: 583 down and 633 up. Gene ontology analysis showed that downregulated genes were mainly associated with cytoplasm, particularly mitochondria, and involved in metabolism and the muscle-system/contraction process. Upregulated genes were predominantly related to cytoplasm, endoplasmic reticulum, and extracellular matrix. The most significantly regulated pathway was mitochondrial dysfunction. Apoptosis genes were also modulated. Among the most downregulated genes detected in this study were genes encoding metabolic proteins AMPD1, PYGM, CPT1B and UCP3, muscle-system proteins TMOD4, MYBPC1, MYOZ1 and XIRP2, the proteolytic CAPN3 and the myogenic regulator MYF6. Coordinated reduced expression of five members of the GIMAP gene family, which form a cluster on chromosome 7, was shown, and the GIMAP4-reduction was validated. Within the most upregulated group were genes encoding senescence/apoptosis-related proteins CDKN1A and KIAA1199 and potential regulatory factors HIF1A, TOP2A and CCDC80.
Conclusions
Cultured muscle cells display reductive metabolic and muscle-system transcriptome adaptations as observed in muscle atrophy and they activate tissue-remodeling and senescence/apoptosis processes.

Loading...
Loading...
Abstract
Background
Fast evolving genes are targets of an increasing panel of biological studies, from cancer research to population genetics and species specific adaptations. Yet, their identification and isolation are still laborious, particularly for non-model organisms. We developed a method, named the Inter-Specific Selective Hybridization (ISSH) method, for generating cDNA libraries enriched in fast evolving genes. It utilizes transcripts of homologous tissues of distinct yet related species. Experimental hybridization conditions are monitored in order to discard transcripts that do not find their homologous counterparts in the two species sets as well as transcripts that display a strong complementarity between the two species. Only heteroduplexes that disanneal at low stringency are used for constructing the resulting cDNA library.
Results
We demonstrate the efficiency of the ISSH method by generating a brain cDNA library enriched in fast evolving transcripts of a non-model catfish species as well as a control, non-enriched library. Our results indicate that the enriched library contains effectively more fast evolving sequences than the control library. Gene annotation analyses also indicate enrichment in genes with low expression levels and non-ubiquitously expressed genes, both categories encompassing the majority of fast evolving genes. Furthermore, most of the identified transcripts show higher sequence divergence between two closely related catfish species as compared to recognized fast evolving DNA markers.
Conclusions
The ISSH method offers a simple, inexpensive and efficient way to screen the transcriptome for isolating fast evolving genes. This method opens new opportunities in the investigation of biological mechanisms that include fast evolving genes, such as the evolution of lineage specific processes and traits responsible for species adaptation to their environment.

Loading...
Loading...
Abstract
Background
Werner Syndrome (WS) is a rare disorder characterized by the premature onset of a number of age-related diseases. The gene responsible for WS encodes a DNA helicase/exonuclease protein believed to affect different aspects of transcription, replication, and/or DNA repair. In addition to genomic instability, human WS cells exhibit oxidative stress. In this report, we have examined the impact of exogenous hydrogen peroxide on the expression profile of mouse embryonic fibroblasts lacking part of the helicase domain of the
Results
Conclusion
Expression profiling of

Loading...
Loading...
Abstract
Background
In recent years a number of genome sequences for different
Results
To improve the current annotation and gene identification status of the draft genomes of
Conclusions
These findings allow a more thorough investigation of the genes that are important for host specificity in malaria.

Loading...
Loading...
Abstract
Background
Results
We constructed two linkage maps covering the 19 linkage groups in willow. The most detailed consensus map, S1, contains 495 markers with a total genetic distance of 2477 cM and an average distance of 5.0 cM between the markers. The S3 consensus map contains 221 markers and has a total genetic distance of 1793 cM and an average distance of 8.1 cM between the markers. We found high degree of synteny and gene order conservation between willow and poplar. There is however evidence for two major interchromosomal rearrangements involving poplar LG I and XVI and willow LG Ib, suggesting a fission or a fusion in one of the lineages, as well as five intrachromosomal inversions. The number of silent substitutions were three times lower (median: 0.12) between orthologs than between paralogs (median: 0.37 - 0.41).
Conclusions
The relatively slow rates of genomic change between willow and poplar mean that the genomic resources in poplar will be most useful in genomic research in willow, such as identifying genes underlying QTLs of important traits. Our data suggest that the whole-genome duplication occurred long before the divergence of the two genera, events which have until now been regarded as contemporary. Estimated silent substitution rates were 1.28 × 10-9 and 1.68 × 10-9 per site and year, which are close to rates found in other perennials but much lower than rates in annuals.

Loading...
Loading...
Abstract
Background
Results
A genomic survey for DNA transposons in the sequence assemblies of
Conclusions
New families of

Loading...
Loading...
Abstract
Background
The original sequencing and annotation of the
Results
Using Illumina sequencing and MAQ software, 83% of the base pair sequence reads were aligned to the reference genome available at Wormbase, providing a 21-fold coverage of the genome. Using the software programs MAQ and Slider, we observed 1124 base pair differences between Rec-1 and the reference genome in Wormbase (WS190), and 441 between the mutagenized Rec-1 (BC313) and the wild-type N2 strain (VC2010). The most frequent base-substitution was G:C to A:T, 141 for the entire genome most of which were on chromosomes I or X, 55 and 31 respectively. With this data removed, no obvious pattern in the distribution of the base differences along the chromosomes was apparent. No major chromosomal rearrangements were observed, but additional insertions of transposable elements were detected. There are 11 extra copies of Tc1, and 8 of Tc2 in the Rec-1 genome, most likely the remains of past high-hopper activity in a progenitor strain.
Conclusion
Our analysis of high-throughput sequencing was able to detect regions of direct repeat sequences, deletions, insertions of transposable elements, and base pair differences. A subset of sequence alterations affecting coding regions were confirmed by an independent approach using oligo array comparative genome hybridization. The major phenotype of the Rec-1 strain is an alteration in the preferred position of the meiotic recombination event with no other significant phenotypic consequences. In this study, we observed no evidence of a mutator effect at the nucleotide level attributable to the Rec-1 mutation.

Loading...
Loading...
Abstract
Background
Intracranial pediatric germ cell tumors (GCTs) are rare and heterogeneous neoplasms and vary in histological differentiation, prognosis and clinical behavior. Germinoma and mature teratoma are GCTs that have a good prognosis, while other types of GCTs, termed nongerminomatous malignant germ cell tumors (NGMGCTs), are tumors with an intermediate or poor prognosis. The second group of tumors requires more extensive drug and irradiation treatment regimens. The mechanisms underlying the differences in incidence and prognosis of the various GCT subgroups are unclear.
Results
We identified a distinct mRNA profile correlating with GCT histological differentiation and prognosis, and also present in this study the first miRNA profile of pediatric primary intracranial GCTs. Most of the differentially expressed miRNAs were downregulated in germinomas, but miR-142-5p and miR-146a were upregulated. Genes responsible for self-renewal (such as POU5F1 (OCT4), NANOG and KLF4) and the immune response were abundant in germinomas, while genes associated with neuron differentiation, Wnt/β-catenin pathway, invasiveness and epithelial-mesenchymal transition (including SNAI2 (SLUG) and TWIST2) were abundant in NGMGCTs. Clear transcriptome segregation based on patient survival was observed, with malignant NGMGCTs being closest to embryonic stem cells. Chromosome copy number variations (CNVs) at cytobands 4q13.3-4q28.3 and 9p11.2-9q13 correlated with GCT malignancy and clinical risk. Six genes (BANK1, CXCL9, CXCL11, DDIT4L, ELOVL6 and HERC5) within 4q13.3-4q28.3 were more abundant in germinomas.
Conclusions
Our results integrate molecular profiles with clinical observations and provide insights into the underlying mechanisms causing GCT malignancy. The genes, pathways and microRNAs identified have the potential to be novel therapeutic targets.

Loading...
Loading...
Abstract
Background
Protein phosphorylation is responsible for a large portion of the regulatory functions of eukaryotic cells. Although the list of sequenced genomes of filamentous fungi has grown rapidly, the kinomes of recently sequenced species have not yet been studied in detail. The objective of this study is to apply a comparative analysis of the kinase distribution in different fungal phyla, and to explore its relevance to understanding the evolution of fungi and their taxonomic classification. We have analyzed in detail 12 subgroups of kinases and their distribution over 30 species, as well as their potential use as a classifier for members of the fungal kingdom.
Results
Our findings show that despite the similarity of the kinase distribution in all fungi, their domain distributions and kinome density can potentially be used to classify them and give insight into their evolutionary origin. In general, we found that the overall representation of kinase groups is similar across fungal genomes, the only exception being a large number of tyrosine kinase-like (TKL) kinases predicted in
Conclusions
Comparison of the predicted kinomes of sequenced fungi suggests essential signaling functions common to all species, but also specific adaptations of the signal transduction networks to particular species.

Loading...
Loading...
Abstract
Background
Microarray technology is a popular means of producing whole genome transcriptional profiles, however high cost and scarcity of mRNA has led many studies to be conducted based on the analysis of single samples. We exploit the design of the Illumina platform, specifically multiple arrays on each chip, to evaluate intra-experiment technical variation using repeated hybridisations of universal human reference RNA (UHRR) and duplicate hybridisations of primary breast tumour samples from a clinical study.
Results
A clear batch-specific bias was detected in the measured expressions of both the UHRR and clinical samples. This bias was found to persist following standard microarray normalisation techniques. However, when mean-centering or empirical Bayes batch-correction methods (ComBat) were applied to the data, inter-batch variation in the UHRR and clinical samples were greatly reduced. Correlation between replicate UHRR samples improved by two orders of magnitude following batch-correction using ComBat (ranging from 0.9833-0.9991 to 0.9997-0.9999) and increased the consistency of the gene-lists from the duplicate clinical samples, from 11.6% in quantile normalised data to 66.4% in batch-corrected data. The use of UHRR as an inter-batch calibrator provided a small additional benefit when used in conjunction with ComBat, further increasing the agreement between the two gene-lists, up to 74.1%.
Conclusion
In the interests of practicalities and cost, these results suggest that single samples can generate reliable data, but only after careful compensation for technical bias in the experiment. We recommend that investigators appreciate the propensity for such variation in the design stages of a microarray experiment and that the use of suitable correction methods become routine during the statistical analysis of the data.

Loading...
Loading...
Abstract
Background
Grass carp (
Results
We constructed a first generation linkage map of grass carp with a mapping panel containing two F1 families including 192 progenies. Sixteen SNPs in genes and 263 microsatellite markers were mapped to twenty-four linkage groups (LGs). The number of LGs was corresponding to the haploid chromosome number of grass carp. The sex-specific map was 1149.4 and 888.8 cM long in females and males respectively whereas the sex-averaged map spanned 1176.1 cM. The average resolution of the map was 4.2 cM/locus. BLAST searches of sequences of mapped markers of grass carp against the whole genome sequence of zebrafish revealed substantial macrosynteny relationship and extensive colinearity of markers between grass carp and zebrafish.
Conclusions
The linkage map of grass carp presented here is the first linkage map of a food fish species based on co-dominant markers in the family Cyprinidae. This map provides a valuable resource for mapping phenotypic variations and serves as a reference to approach comparative genomics and understand the evolution of fish genomes and could be complementary to grass carp genome sequencing project.

Loading...
Loading...
Abstract
Background
To understand gene expression networks leading to functional properties of the soybean seed, we have undertaken a detailed examination of soybean seed development during the stages of major accumulation of oils, proteins, and starches, as well as the desiccating and mature stages, using microarrays consisting of up to 27,000 soybean cDNAs. A subset of these genes on a highly-repetitive 70-mer oligonucleotide microarray was also used to support the results.
Results
It was discovered that genes related to cell growth and maintenance processes, as well as energy processes like photosynthesis, decreased in expression levels as the cotyledons approached the mature, dry stage. Genes involved with some storage proteins had their highest expression levels at the stage of highest fresh weight. However, genes encoding many transcription factors and DNA binding proteins showed higher expression levels in the desiccating and dry seeds than in most of the green stages.
Conclusions
Data on 27,000 cDNAs have been obtained over five stages of soybean development, including the stages of major accumulation of agronomically-important products, using two different types of microarrays. Of particular interest are the genes found to peak in expression at the desiccating and dry seed stages, such as those annotated as transcription factors, which may indicate the preparation of pathways that will be needed later in the early stages of imbibition and germination.

Loading...
Loading...
Abstract
Background
Cancer cells undergo massive alterations to their DNA methylation patterns that result in aberrant gene expression and malignant phenotypes. However, the mechanisms that underlie methylome changes are not well understood nor is the genomic distribution of DNA methylation changes well characterized.
Results
Here, we performed methylated DNA immunoprecipitation combined with high-throughput sequencing (MeDIP-seq) to obtain whole-genome DNA methylation profiles for eight human breast cancer cell (BCC) lines and for normal human mammary epithelial cells (HMEC). The MeDIP-seq analysis generated non-biased DNA methylation maps by covering almost the entire genome with sufficient depth and resolution. The most prominent feature of the BCC lines compared to HMEC was a massively reduced methylation level particularly in CpG-poor regions. While hypomethylation did not appear to be associated with particular genomic features, hypermethylation preferentially occurred at CpG-rich gene-related regions independently of the distance from transcription start sites. We also investigated methylome alterations during epithelial-to-mesenchymal transition (EMT) in MCF7 cells. EMT induction was associated with specific alterations to the methylation patterns of gene-related CpG-rich regions, although overall methylation levels were not significantly altered. Moreover, approximately 40% of the epithelial cell-specific methylation patterns in gene-related regions were altered to those typical of mesenchymal cells, suggesting a cell-type specific regulation of DNA methylation.
Conclusions
This study provides the most comprehensive analysis to date of the methylome of human mammary cell lines and has produced novel insights into the mechanisms of methylome alteration during tumorigenesis and the interdependence between DNA methylome alterations and morphological changes.

Loading...
Loading...
Abstract
Background
Coliform bacteria are the most common etiologic agents in severe mastitis of cows.
Results
After 6 h 13 probe sets of differentially expressed genes (DEG) were detected in infected quarters versus control animals. Eighteen hours later 2154 and 476 DEG were found in infected and in neighboring quarters vs. control animals. Cluster analysis revealed DEG found only in infected quarters (local response) and DEG detected in both infected and neighboring quarters (systemic response). The first group includes genes mainly involved in immune response and inflammation, while the systemic reaction comprises antigen processing and presentation, cytokines, protein degradation and apoptosis. Enhanced expression of antimicrobial genes (
Conclusions
The discovery and holistic analysis of an extensive systemic reaction in the mammary gland significantly expands the knowledge of host-pathogen interactions in mastitis which may be relevant for the development of novel therapies and for genetic selection towards mastitis resistance.

Loading...
Loading...
Abstract
Background
Genome-wide homozygosity estimation from genomic data is becoming an increasingly interesting research topic. The aim of this study was to compare different methods for estimating individual homozygosity-by-descent based on the information from human genome-wide scans rather than genealogies. We considered the four most commonly used methods and investigated their applicability to single-nucleotide polymorphism (SNP) data in both a simulation study and by using the human genotyped data. A total of 986 inhabitants from the isolated Island of Vis, Croatia (where inbreeding is present, but no pedigree-based inbreeding was observed at the level of F > 0.0625) were included in this study. All individuals were genotyped with the Illumina HumanHap300 array with 317,503 SNP markers.
Results
Simulation data suggested that multi-point FEstim is the method most strongly correlated to true homozygosity-by-descent. Correlation coefficients between the homozygosity-by-descent estimates were high but only for inbred individuals, with nearly absolute correlation between single-point measures.
Conclusions
Deciding who is really inbred is a methodological challenge where multi-point approaches can be very helpful once the set of SNP markers is filtered to remove linkage disequilibrium. The use of several different methodological approaches and hence different homozygosity measures can help to distinguish between homozygosity-by-state and homozygosity-by-descent in studies investigating the effects of genomic autozygosity on human health.

Loading...
Loading...
Abstract
Background
In addition to shotgun sequencing, next generation sequencing has been shown to be suitable for deep sequencing of many specific PCR-amplified target genes in parallel. However, unspecific product formation is a common problem in amplicon sequencing since these fragments are difficult to fully remove by gel purification, and their presence inevitably reduces the number of mappable sequence reads that can be obtained in each sequencing run.
Results
We have used a novel flow cytometric sorting approach to specifically enrich Roche/454 DNA Capture beads carrying target DNA sequences on their surface, and reject beads carrying unspecific sequences. This procedure gives a nearly three-fold increase in the fraction of informative sequences obtained. Presented results also show that there are no significant differences in the distribution or presence of different genotypes between a FACS-enriched sample and a standard-enriched control sample.
Conclusions
Target-specific FACS enrichment prior to Roche/454 sequencing provides a quick, inexpensive way of increasing the amount of high quality data obtained in a single sequencing run, without introducing any sequence bias.

Loading...
Loading...
Abstract
Background
Higher crustaceans (class Malacostraca) represent the most species-rich and morphologically diverse group of non-insect arthropods and many of its members are commercially important. Although the crustacean DNA sequence information is growing exponentially, little is known about the genome organization of Malacostraca. Here, we constructed a bacterial artificial chromosome (BAC) library and performed BAC-end sequencing to provide genomic information for kuruma shrimp (
Results
Mj024A04 BAC clone, which includes one redundant sequence, contained 27 putative genes and seemed to display a normal genomic DNA structure. Notably, of the putative genes, 3 genes encode homologous proteins to the inhibitor of apoptosis protein and 7 genes encode homologous proteins to white spot syndrome virus, a virulent pathogen known to affect crustaceans. Colony hybridization and PCR analysis of 381 BAC clones showed that almost half of the BAC clones maintain DNA segments whose sequences are homologous to the representative BAC clone Mj024A04. The Mj024A04 partial sequence was detected multiple times in the kuruma shrimp nuclear genome with a calculated copy number of at least 100. Microsatellites based BAC genotyping clearly showed that Mj024A04 homologous sequences were cloned from at least 48 different chromosomal loci. The absence of micro-syntenic relationships with the available genomic sequences of
Conclusions
Our results demonstrate that hyper-expansion of large DNA segments took place in the kuruma shrimp genome. Although we analyzed only a part of the duplicated DNA segments, our result suggested that it is difficult to analyze the shrimp genome following normal analytical methodology. Hence, it is necessary to avoid repetitive sequence (such as segmental duplications) when studying the other unique structures in the shrimp genome.

Loading...
Loading...
Abstract
Background
Transcriptomics has resulted in the development of large data sets and tools for the progression of functional genomics and systems biology in many model organisms. Currently there is no commercially available microarray to allow such expression studies in
Results
A custom designed Affymetrix tobacco expression microarray was generated from a set of over 40k unigenes and used to measure gene expression in 19 different tobacco samples to produce the Tobacco Expression Atlas (TobEA). TobEA provides a snap shot of the transcriptional activity for thousands of tobacco genes in different tissues throughout the lifecycle of the plant and enables the identification of the biological processes occurring in these different tissues. 772 of 2513 transcription factors previously identified in tobacco were mapped to the array, with 87% of them being expressed in at least one tissue in the atlas. Putative transcriptional networks were identified based on the co-expression of these transcription factors. Several interactions in a floral identity transcription factor network were consistent with previous results from other plant species. To broaden access and maximise the benefit of TobEA a set of tools were developed to provide researchers with expression information on their genes of interest via the Solanaceae Genomics Network (SGN) web site. The array has also been made available for public use via the Nottingham Arabidopsis Stock Centre microarray service.
Conclusions
The generation of a tobacco expression microarray is an important development for research in this model plant. The data provided by TobEA represents a valuable resource for plant functional genomics and systems biology research and can be used to identify gene targets for both fundamental and applied scientific applications in tobacco.

Loading...
Loading...
Abstract
Background
Results
The
Conclusions
Although the chloroplast genome of

Loading...
Loading...
Abstract
Background
Cells have the ability to respond and adapt to environmental changes through activation of stress-activated protein kinases (SAPKs). Although p38 SAPK signalling is known to participate in the regulation of gene expression little is known on the molecular mechanisms used by this SAPK to regulate stress-responsive genes and the overall set of genes regulated by p38 in response to different stimuli.
Results
Here, we report a whole genome expression analyses on mouse embryonic fibroblasts (MEFs) treated with three different p38 SAPK activating-stimuli, namely osmostress, the cytokine TNFα and the protein synthesis inhibitor anisomycin. We have found that the activation kinetics of p38α SAPK in response to these insults is different and also leads to a complex gene pattern response specific for a given stress with a restricted set of overlapping genes. In addition, we have analysed the contribution of p38α the major p38 family member present in MEFs, to the overall stress-induced transcriptional response by using both a chemical inhibitor (SB203580) and p38α deficient (p38α-/-) MEFs. We show here that p38 SAPK dependency ranged between 60% and 88% depending on the treatments and that there is a very good overlap between the inhibitor treatment and the ko cells. Furthermore, we have found that the dependency of SAPK varies depending on the time the cells are subjected to osmostress.
Conclusions
Our genome-wide transcriptional analyses shows a selective response to specific stimuli and a restricted common response of up to 20% of the stress up-regulated early genes that involves an important set of transcription factors, which might be critical for either cell adaptation or preparation for continuous extra-cellular changes. Interestingly, up to 85% of the up-regulated genes are under the transcriptional control of p38 SAPK. Thus, activation of p38 SAPK is critical to elicit the early gene expression program required for cell adaptation to stress.

Loading...
Loading...
Abstract
Background
The use of global gene expression profiling is a well established approach to understand biological processes. One of the major goals of these investigations is to identify sets of genes with similar expression patterns. Such gene signatures may be very informative and reveal new aspects of particular biological processes. A logical and systematic next step is to reduce the identified gene signatures to the regulatory components that induce the relevant gene expression changes. A central issue in this context is to identify transcription factors, or transcription factor binding sites (TFBS), likely to be of importance for the expression of the gene signatures.
Results
We develop a strategy that efficiently produces TFBS/promoter databases based on user-defined criteria. The resulting databases constitute all genes in the Santa Cruz database and the positions for all TFBS provided by the user as position weight matrices. These databases are then used for two purposes, to identify significant TFBS in the promoters in sets of genes and to identify clusters of co-occurring TFBS. We use two criteria for significance, significantly enriched TFBS in terms of total number of binding sites for the promoters, and significantly present TFBS in terms of the fraction of promoters with binding sites. Significant TFBS are identified by a re-sampling procedure in which the query gene set is compared with typically 105 gene lists of similar size randomly drawn from the TFBS/promoter database. We apply this strategy to a large number of published ChIP-Chip data sets and show that the proposed approach faithfully reproduces ChIP-Chip results. The strategy also identifies relevant TFBS when analyzing gene signatures obtained from the MSigDB database. In addition, we show that several TFBS are highly correlated and that co-occurring TFBS define functionally related sets of genes.
Conclusions
The presented approach of promoter analysis faithfully reproduces the results from several ChIP-Chip and MigDB derived gene sets and hence may prove to be an important method in the analysis of gene signatures obtained through ChIP-Chip or global gene expression experiments. We show that TFBS are organized in clusters of co-occurring TFBS that together define highly coherent sets of genes.

Loading...
Loading...
Abstract
Background
Rat models are frequently used to link genomic regions to experimentally induced arthritis in quantitative trait locus (QTL) analyses. To facilitate the search for candidate genes within such regions, we have previously developed an application (CGC) that uses weighted keywords to rank genes based on their descriptive text. In this study, CGC is used for analyzing the localization of candidate genes from two viewpoints: distribution over the rat genome and functional connections between arthritis QTLs.
Methods
To investigate if candidate genes identified by CGC are more likely to be found inside QTLs, we ranked 2403 genes genome wide in rat. The number of genes within different ranges of CGC scores localized inside and outside QTLs was then calculated. Furthermore, we investigated if candidate genes within certain QTLs share similar functions, and if these functions could be connected to genes within other QTLs. Based on references between genes in OMIM, we created connections between genes in QTLs identified in two distinct rat crosses. In this way, QTL pairs with one QTL from each cross that share an unexpectedly high number of gene connections were identified. The genes that were found to connect a pair of QTLs were then functionally analysed using a publicly available classification tool.
Results
Out of the 2403 genes ranked by the CGC application, 1160 were localized within QTL regions. No difference was observed between highly and lowly rated genes. Hence, highly rated candidate genes for arthritis seem to be distributed randomly inside and outside QTLs. Furthermore, we found five pairs of QTLs that shared a significantly high number of interconnected genes. When functionally analyzed, most genes connecting two QTLs could be included in a single functional cluster. Thus, the functional connections between these genes could very well be involved in the development of an arthritis phenotype.
Conclusions
From the genome wide CGC search, we conclude that candidate genes for arthritis in rat are randomly distributed between QTL and non-QTL regions. We do however find certain pairs of QTLs that share a large number of functionally connected candidate genes, suggesting that these QTLs contain a number of genes involved in similar functions contributing to the arthritis phenotype.

Loading...
Loading...
Abstract
Background
The apicomplexans are a diverse phylum of parasites causing an assortment of diseases including malaria in a wide variety of animals and lymphoproliferation in cattle. Little is known about how these varied parasites regulate their transcriptional regulons. Even less is known about how regulon systems, consisting of transcription factors and target genes together with their associated biological process, evolve in these diverse parasites.
Results
In order to obtain insights into the differences in transcriptional regulation between these parasites we compared the orthology profiles of putative malaria transcription factors across species and examined the enrichment patterns of four binding sites across eleven apicomplexans.
About three-fifths of the factors are broadly conserved in several phylogenetic orders of sequenced apicomplexans. This observation suggests the existence of regulons whose regulation is conserved across this ancient phylum. Transcription factors not broadly conserved across the phylum are possibly involved in regulon systems that have diverged between species. Examining binding site enrichment patterns in light of transcription factor conservation patterns suggests a second mode via which regulon systems may diverge - rewiring of existing transcription factors and their associated binding sites in specific ways. Integrating binding sites with transcription factor conservation patterns also facilitated prediction of putative regulators for one of the binding sites.
Conclusions
Even though transcription factors are underrepresented in apicomplexans, the distribution of these factors and their associated regulons reflect common and family-specific transcriptional regulatory processes.

Loading...
Loading...
Abstract
Background
MicroRNA (miRNA) and other small regulatory RNAs contribute to the modulation of a large number of cellular processes. We sequenced three small RNA libraries prepared from the whole body, and the anterior-middle and posterior silk glands of
Results
With the aid of large-scale Solexa sequencing technology, we validated 257 unique miRNA genes, including 202 novel and 55 previously reported genes, corresponding to 324 loci in the silkworm genome. Over 30 known silkworm miRNAs were further corrected in their sequence constitutes and length. A number of reads originated from the loop regions of the precursors of two previously reported miRNAs (bmo-miR-1920 and miR-1921). Interestingly, the majority of the newly identified miRNAs were silkworm-specific, 23 unique miRNAs were widely conserved from invertebrates to vertebrates, 13 unique miRNAs were limited to invertebrates, and 32 were confined to insects. We identified 24 closely positioned clusters and 45 paralogs of miRNAs in the silkworm genome. However, sequence tags showed that paralogs or clusters were not prerequisites for coordinated transcription and accumulation. The majority of silkworm-specific miRNAs were located in transposable elements, and displayed significant differences in abundance between the anterior-middle and posterior silk gland.
Conclusions
Conservative analysis revealed that miRNAs can serve as phylogenetic markers and function in evolutionary signaling. The newly identified miRNAs greatly enrich the repertoire of insect miRNAs, and provide insights into miRNA evolution, biogenesis, and expression in insects. The differential expression of miRNAs in the anterior-middle and posterior silk glands supports their involvement as new levels in the regulation of the silkworm silk gland.

Loading...
Loading...
Abstract
Background
Ultra Conserved Regions (UCRs) are a class of 481 noncoding sequences located in both intra- and inter-genic regions of the genome. The recent findings that they are significantly altered in adult chronic lymphocytic leukemias, carcinomas, and pediatric neuroblastomas lead to the hypothesis that UCRs may play a role in tumorigenesis.
Results
We present a novel application of Ribo-SPIA™ isothermal linear amplification of minute RNA quantities for quantifying Transcribed-UCR (T-UCR) expression by quantitative PCR. Direct comparison of non-amplified with amplified cDNA in two neuroblastoma cell lines showed that the amplification approach increases sensitivity and repeatability in T-UCR quantification. It is noteworthy that the Ribo-SPIA™ step allowed us to analyze all 481 T-UCRs by using 150 ng of RNA, while introducing a minimal bias and preserving the magnitude of relative expression. Only the less abundant T-UCRs have high intra-assay variability, consistently with the Poisson distribution statistics and stochastic effects on PCR repeatability.
Conclusions
We demonstrated that the quantification procedure shown here is an accurate and reliable technique for genome-wide non coding gene (i.e., UCRs) profiling using small amounts of RNA. This issue is particularly important because studies of transcription regulation are increasingly conducted in small homogeneous samples, such as laser capture microdissected or sorted cell populations.

Loading...
Loading...
Abstract
Background
With its genome sequence and other experimental attributes,
Results
Using a genomic microarray representing the majority of the transcriptome, we profiled gene expression in stem segments that spanned primary to secondary growth. We found 3,016 genes that were differentially expressed during stem development (Q-value ≤ 0.05; >2-fold expression variation), and 15% of these genes encode proteins with no significant identities to known genes. We identified all gene family members putatively involved in secondary growth for carbohydrate active enzymes, tubulins, actins, actin depolymerizing factors, fasciclin-like AGPs, and vascular development-associated transcription factors. Almost 70% of expressed transcription factors were upregulated during the transition to secondary growth. The primary shoot elongation region of the stem contained specific carbohydrate active enzyme and expansin family members that are likely to function in primary cell wall synthesis and modification. Genes involved in plant defense and protective functions were also dominant in the primary growth region.
Conclusion
Our results describe the global patterns of gene expression that occur during the transition from primary to secondary stem growth. We were able to identify three major patterns of gene expression and over-represented gene ontology categories during stem development. The new regulatory factors and cell wall biogenesis genes that we identified provide candidate genes for further functional characterization, as well as new tools for molecular breeding and biotechnology aimed at improvement of tree growth rate, crown form, and wood quality.

Loading...
Loading...
Abstract
Background
Ultraconserved elements (UCEs) are highly constrained elements of mammalian genomes, whose functional role has not been completely elucidated yet. Previous studies have shown that some of them act as enhancers in mouse, while some others are expressed in both normal and cancer-derived human tissues. Only one UCE element so far was shown to present these two functions concomitantly, as had been observed in other isolated instances of single, non ultraconserved enhancer elements.
Results
We used a custom microarray to assess the levels of UCE transcription during mouse development and integrated these data with published microarray and next-generation sequencing datasets as well as with newly produced PCR validation experiments. We show that a large fraction of non-exonic UCEs is transcribed across all developmental stages examined from only one DNA strand. Although the nature of these transcripts remains a mistery, our meta-analysis of RNA-Seq datasets indicates that they are unlikely to be short RNAs and that some of them might encode nuclear transcripts. In the majority of cases this function overlaps with the already established enhancer function of these elements during mouse development. Utilizing several next-generation sequencing datasets, we were further able to show that the level of expression observed in non-exonic UCEs is significantly higher than in random regions of the genome and that this is also seen in other regions which act as enhancers.
Conclusion
Our data shows that the concurrent presence of enhancer and transcript function in non-exonic UCE elements is more widespread than previously shown. Moreover through our own experiments as well as the use of next-generation sequencing datasets, we were able to show that the RNAs encoded by non-exonic UCEs are likely to be long RNAs transcribed from only one DNA strand.

Loading...
Loading...
Abstract
Background
Rodents represent a high-risk reservoir for the emergence of new human pathogens. The recent completion of the 2.3 Mb genome of
Results
To study the population dynamics of
Conclusion
Our study has revealed a geographic microstructure of

Loading...
Loading...
Abstract
Background
CTBT (7-chlorotetrazolo [5,1-c]benzo[1,2,4]triazine) increases efficacy of commonly used antifungal agents by an unknown mechanism. It increases the susceptibility of
Results
To identify the cellular processes affected by CTBT, we screened the systematic haploid deletion mutant collection for CTBT sensitive mutants. We identified 169 hypersensitive deletion mutants. The deleted genes encode proteins mainly involved in mitochondrial functions, DNA repair, transcription and chromatin remodeling, and oxidative stress response. We found that the susceptibility of yeast cells to CTBT depends on molecular oxygen. Transcriptome analysis of the immediate early response to CTBT revealed rapid induction of oxidant and stress response defense genes. Many of these genes depend on the transcription factors Yap1 and Cin5. Yap1 accumulates rapidly in the nucleus in CTBT treated cells suggesting acute oxidative stress. Moreover, molecular calculations supported a superoxide generating activity of CTBT. Superoxide production
Conclusion
We conclude that CTBT causes intracellular superoxide production and oxidative stress in fungal cells and is thus enhancing antimycotic drug effects by a secondary stress.

Loading...
Loading...
Abstract
Background
We have previously sequenced more than 500 kb of the duplicated MHC class I regions in Atlantic salmon. In the IA region we identified the loci for the MHC class I gene
Results
The genomic localisation of two MHC class I lineages (Z and S) has been resolved. 7 BACs were sequenced using a combination of standard Sanger and 454 sequencing. The new sequence data extended the IA region with 150 kb identifying the location of one Z-lineage locus,
Conclusion
Here we describe the genomic organization of MHC class I loci from the U-, Z-, and S-lineages in Atlantic salmon. Nine of the described class I genes are located in the extension of the duplicated IA and IB regions, while three class I genes are found on two separate linkage groups. The gene organization of the two regions indicates that the IB region is evolving at a different pace than the IA region. Expression profiling, polymorphic content, peptide binding properties and phylogenetic relationship show that Atlantic salmon has only one MHC class Ia gene (

Loading...
Loading...
Abstract
Background
MicroRNAs (miRNAs) are endogenous small RNAs that modulate gene expression at the post-transcriptional level by binding complementary sites in the 3'-UTR. In a recent genome-wide study reporting a new miRNA target class (miBridge), we identified and validated interactions between 5'-UTRs and miRNAs. Separately, upstream AUGs (uAUGs) in 5'-UTRs are known to regulate genes translationally without affecting mRNA levels, one of the mechanisms for miRNA-mediated repression.
Results
Using sequence data from whole-genome cDNA alignments we identified 1418 uAUG sequences on the 5'-UTR that specifically interact with 3'-ends of conserved miRNAs. We computationally identified miRNAs that can target six genes through their uAUGs that were previously reported to suppress translation. We extended this meta-analysis by confirming expression of these miRNAs in cell-lines used in the uAUG studies. Similarly, seven members of the
Conclusions
While both miRNAs and uAUGs are separately known to down-regulate protein expression, we show that they may be functionally related by identifying potential interactions through a sequence-specific binding mechanism. Using prior experimental evidence that shows uAUG effects on translation repression together with miRNA expression data specific to cell lines, we demonstrate through computational analysis that cell-specific down-regulation of protein expression (while maintaining mRNA levels) correlates well with the simultaneous presence of miRNA and target uAUG sequences in one cell type and not others, suggesting tissue-specific translation repression by miRNAs through uAUGs.

Loading...
Loading...
Abstract
Background
For decades, linkage mapping has been one of the most powerful and widely used approaches for elucidating the genetic architecture of phenotypic traits of medical, agricultural and evolutionary importance. However, successful mapping of Mendelian and quantitative phenotypic traits depends critically on the availability of fast and preferably high-throughput genotyping platforms. Several array-based single nucleotide polymorphism (SNP) genotyping platforms have been developed for genetic model organisms during recent years but most of these methods become prohibitively expensive for screening large numbers of individuals. Therefore, inexpensive, simple and flexible genotyping solutions that enable rapid screening of intermediate numbers of loci (~75-300) in hundreds to thousands of individuals are still needed for QTL mapping applications in a broad range of organisms.
Results
Here we describe the discovery of and application of insertion-deletion (INDEL) polymorphisms for cost-efficient medium throughput genotyping that enables analysis of >75 loci in a single automated sequencer electrophoresis column with standard laboratory equipment. Genotyping of INDELs requires low start-up costs, includes few standard sample handling steps and is applicable to a broad range of species for which expressed sequence tag (EST) collections are available. As a proof of principle, we generated a partial INDEL linkage map in Atlantic salmon (
Conclusions
The INDEL genotyping enabled fast coarse-mapping of chromosomal regions containing QTL, thus providing an efficient means for characterization of genetic architecture in multiple crosses and large pedigrees. This enables not only the discovery of larger number of QTLs with relatively smaller phenotypic effect but also provides a cost-effective means for evaluation of the frequency of segregating QTLs in outbred populations which is important for further understanding how genetic variation underlying phenotypic traits is maintained in the wild.

Loading...
Loading...
Abstract
Background
Swarming is a multicellular phenomenom characterized by the coordinated and rapid movement of bacteria across semisolid surfaces. In
Results
More than a thousand genes were identified as differentially expressed in response to growth on agar surfaces including genes for several metabolic activities, iron uptake, chemotaxis, motility and stress-related genes. Under swarming-specific conditions, the most remarkable response was the up-regulation of iron-related genes. We demonstrate that the pSymA plasmid and specifically genes required for the biosynthesis of the siderophore rhizobactin 1021 are essential for swarming of a
Conclusions
The present work represents the first transcriptomic study of rhizobium growth on surfaces including swarming inducing conditions. The results have revealed major changes in the physiology of

Loading...
Loading...
Abstract
Background
Utilization of the natural genetic variation in traditional breeding programs remains a major challenge in crop plants. The identification of candidate genes underlying, or associated with, phenotypic trait QTLs is desired for effective marker assisted breeding. With the advent of high throughput -omics technologies, screening of entire populations for association of gene expression with targeted traits is becoming feasible but remains costly. Here we present the identification of novel candidate genes for different potato tuber quality traits by employing a pooling approach reducing the number of hybridizations needed. Extreme genotypes for a quantitative trait are collected and the RNA from contrasting bulks is then profiled with the aim of finding differentially expressed genes.
Results
We have successfully implemented the pooling strategy for potato quality traits and identified candidate genes associated with potato tuber flesh color and tuber cooking type. Elevated expression level of a dominant allele of the β-carotene hydroxylase (
Conclusions
A pooling approach for RNA-profiling with the aim of identifying novel candidate genes associated with tuber quality traits was successfully implemented. The identified candidate genes for tuber flesh color (

Loading...
Loading...
Abstract
Background
Variations in recombination fraction (θ) among chromosomal regions, individuals and families have been observed and have an important impact on quantitative trait loci (QTL) mapping studies. Such variations on porcine chromosome X (SSC-X) and on other mammalian chromosome X are rarely explored. The emerging assembly of pig sequence provides exact physical location of many markers, facilitating the study of a fine-scale recombination landscape of the pig genome by comparing a clone-based physical map to a genetic map. Using large offspring of F1 females from two large-scale resource populations (Large White ♂ × Chinese Meishan ♀, and White Duroc ♂ × Chinese Erhualian ♀), we were able to evaluate the heterogeneity in θ for a specific interval among individual F1 females.
Results
Alignments between the cytogenetic map, radiation hybrid (RH) map, genetic maps and clone map of SSC-X with the physical map of human chromosome X (HSA-X) are presented. The most likely order of 60 markers on SSC-X is inferred. The average recombination rate across SSC-X is of ~1.27 cM/Mb. However, almost no recombination occurred in a large region of ~31 Mb extending from the centromere to Xq21, whereas in the surrounding regions and in the Xq telomeric region a recombination rate of 2.8-3.3 cM/Mb was observed, more than twice the chromosome-wide average rate. Significant differences in θ among F1 females within each population were observed for several chromosomal intervals. The largest variation was observed in both populations in the interval
Conclusions
This study reveals marked regional, individual and haplotype-specific differences in recombination rate on SSC-X. Lack of recombination in such a large region makes it impossible to narrow QTL interval using traditional fine-mapping approaches. The relationship between recombination variation and haplotype polymorphism is shown for the first time in pigs.

Loading...
Loading...
Abstract
Background
Tsetse flies, vectors of African trypanosomes, undergo viviparous reproduction (the deposition of live offspring). This reproductive strategy results in a large maternal investment and the deposition of a small number of progeny during a female's lifespan. The reproductive biology of tsetse has been studied on a physiological level; however the molecular analysis of tsetse reproduction requires deeper investigation. To build a foundation from which to base molecular studies of tsetse reproduction, a cDNA library was generated from female tsetse (
Results
Analysis of a nonredundant catalogue of 1391 contigs resulted in 520 predicted proteins. 475 of these proteins were full length. We predict that 412 of these represent cytoplasmic proteins while 57 are secreted. Comparison of these proteins with other tissue specific tsetse cDNA libraries (salivary gland, fat body/milk gland, and midgut) identified 51 that are unique to the reproductive/immature cDNA library. 11 unique proteins were homologus to uncharacterized putative proteins within the NR database suggesting the identification of novel genes associated with reproductive functions in other insects (hypothetical conserved). The analysis also yielded seven putative proteins without significant homology to sequences present in the public database (unknown genes). These proteins may represent unique functions associated with tsetse's viviparous reproductive cycle. RT-PCR analysis of hypothetical conserved and unknown contigs was performed to determine basic tissue and stage specificity of the expression of these genes.
Conclusion
This paper identifies 51 putative proteins specific to a tsetse reproductive/immature EST library. 11 of these proteins correspond to hypothetical conserved genes and 7 proteins are tsetse specific.

Loading...
Loading...
Abstract
Background
The diversity of placental architectures within and among mammalian orders is believed to be the result of adaptive evolution. Although, the genetic basis for these differences is unknown, some may arise from rapidly diverging and lineage-specific genes. Previously, we identified 91 novel lineage-specific transcripts (LSTs) from a cow term-placenta cDNA library, which are excellent candidates for adaptive placental functions acquired by the ruminant lineage. The aim of the present study was to infer functions of previously uncharacterized lineage-specific genes (LSGs) using co-expression, promoter, pathway and network analysis.
Results
Clusters of co-expressed genes preferentially expressed in liver, placenta and thymus were found using 49 previously uncharacterized LSTs as seeds. Over-represented composite transcription factor binding sites (TFBS) in promoters of clustered LSGs and known genes were then identified computationally. Functions were inferred for nine previously uncharacterized LSGs using co-expression analysis and pathway analysis tools. Our results predict that these LSGs may function in cell signaling, glycerophospholipid/fatty acid metabolism, protein trafficking, regulatory processes in the nucleus, and processes that initiate parturition and immune system development.
Conclusions
The placenta is a rich source of lineage-specific genes that function in the adaptive evolution of placental architecture and functions. We have shown that co-expression, promoter, and gene network analyses are useful methods to infer functions of LSGs with heretofore unknown functions. Our results indicate that many LSGs are involved in cellular recognition and developmental processes. Furthermore, they provide guidance for experimental approaches to validate the functions of LSGs and to study their evolution.

Loading...
Loading...
Abstract
Background
The hypothalamus plays a central role in regulating appetite and metabolism. However, the gene networks within the hypothalamus that regulate feed intake and metabolism, and the effects of fasting on those pathways are not completely understood in any species. The present experiment evaluated global hypothalamic gene expression in newly hatched chicks using microarray analysis to elucidate genes and pathways regulated by feeding, fasting, and delayed feeding. Ten groups of chicks were sampled over four days post-hatch, including fed, fasted, and 48 h fasted followed by access to feed for 4 h, 24 h, and 48 h. Hypothalamic samples were collected for microarray analysis (n = 4). Expression patterns of selected genes were confirmed by quantitative real-time PCR. Pathway analysis of the microarray results predicted a network of genes involved in neuropeptide or neurotransmitter signaling. To confirm the functionality of this predicted gene network, hypothalamic neurons from fed and fasted chicks were isolated and cultured in the presence of neuropeptide Y, somatostatin, α-melanocyte stimulating hormone, norepinephrine, and L-phospho-serine. Results confirmed functional relationships among members of the predicted gene network. Moreover, the effects observed were dependant upon the nutritional state of the animals (fed
Results
Differences in gene expression (≥ 1.6 fold) were detected in 1,272 genes between treatments, and of those, 119 genes were significantly (P < 0.05) different. Pathway Miner analysis revealed that six genes (
Conclusions
Microarray analysis of the hypothalamus during different nutritional states revealed that many genes are differentially regulated. We found that functional interactions exist among six differentially regulated genes associated within a putative gene network from this experiment. Considering that

Loading...
Loading...
Abstract
Background
Laser capture microdissection (LCM) has successfully isolated pure cell populations from tissue sections and the combination of LCM with standard genomic and proteomic methods has revolutionized molecular analysis of complex tissue. However, the quantity and quality of material recovered after LCM is often still limited for analysis by using whole genomic and proteomic approaches. To procure high quality and quantity of RNA after LCM, we optimized the procedures on tissue preparations and applied the approach for cell type-specific miRNA expression profiling in colorectal tumors.
Results
We found that the ethanol fixation of tissue sections for 2 hours had the maximum improvement of RNA quality (1.8 fold, p = 0.0014) and quantity (1.5 fold, p = 0.066). Overall, the quality (RNA integrity number, RIN) for the microdissected colorectal tissues was 5.2 ± 1.5 (average ± SD) for normal (n = 43), 5.7 ± 1.1 for adenomas (n = 14) and 7.2 ± 1.2 for carcinomas (n = 44). We then compared miRNA expression profiles of 18 colorectal tissues (6 normal, 6 adenomas and 6 carcinomas) between LCM selected epithelial cells versus stromal cells using Agilent miRNA microarrays. We identified 51 differentially expressed miRNAs (p <= 0.001) between these two cell types. We found that the miRNAs in the epithelial cells could differentiate adenomas from normal and carcinomas. However, the miRNAs in the stromal and mixed cells could not separate adenomas from normal tissues. Finally, we applied quantitative RT-PCR to cross-verify the expression patterns of 7 different miRNAs using 8 LCM-selected epithelial cells and found the excellent correlation of the fold changes between the two platforms (R = 0.996).
Conclusions
Our study demonstrates the feasibility and potential power of discovering cell type-specific miRNA biomarkers in complex tissue using combination of LCM with genome-wide miRNA analysis.

Loading...
Loading...
Abstract
Background
Results
The
Conclusions
The determination of the genome sequence and expressed proteome of the ulcerogenic species
The genome sequence was deposited in EMBL/GenBank/DDBJ under accession number

Loading...
Loading...
Abstract
Background
6S RNA from
Results
To solve this apparent ambiguity and to better understand the role of 6S RNA in stationary phase transition we have performed a genome-wide transcriptional analysis of wild-type and 6S RNA deficient cells growing to mid-log or early stationary phase. We found 245 genes at the exponential growth phase and 273 genes at the early stationary phase to be ≥ 1.5-fold differentially expressed. Up- and down-regulated genes include many transcriptional regulators, stress-related proteins, transporters and several enzymes involved in purine metabolism. As the most striking result during stationary phase, however, we obtained in the 6S RNA deficient strain a concerted expression reduction of genes constituting the translational apparatus. In accordance, primer extension analysis showed that transcription of ribosomal RNAs, representing the key molecules for ribosome biogenesis, is also significantly reduced under the same conditions. Consistent with this finding biochemical analysis of the 6S RNA deficient strain indicates that the lack of 6S RNA is apparently compensated by an increase of the basal ppGpp concentration, known to affect growth adaptation and ribosome biogenesis.
Conclusions
The analysis demonstrated that the effect of 6S RNA on transcription is not strictly confined to σ70-dependent promoters. Moreover, the results indicate that 6S RNA is embedded in stationary phase adaptation, which is governed by the capacity of the translational machinery.

Loading...
Loading...
Abstract
Background
The TATA-box and TATA-variants are regulatory elements involved in the formation of a transcription initiation complex. Both have been conserved throughout evolution in a restricted region close to the Transcription Start Site (TSS). However, less than half of the genes in model organisms studied so far have been found to contain either one of these elements. Indeed different core-promoter elements are involved in the recruitment of the TATA-box-binding protein. Here we assessed the possibility of identifying novel functional motifs in plant genes, sharing the TATA-box topological constraints.
Results
We developed an
Conclusions
Our observations suggest that the TC-elements might constitute a class of novel regulatory elements participating towards the complex modulation of gene expression in plants.

Loading...
Loading...
Abstract
Background
Retrotransposons are transposable elements that proliferate within eukaryotic genomes through a process involving reverse transcription. The numbers of retrotransposons within genomes and differences between closely related species may yield insight into the evolutionary history of the elements. Less is known about the ongoing dynamics of retrotransposons, as analysis of genome sequences will only reveal insertions of retrotransposons that are fixed - or near fixation - in the population or strain from which genetic material has been extracted for sequencing. One pre-requisite for retrotransposition is transcription of the elements. Given their intrinsic sequence redundancy, transcriptome-level analyses of transposable elements are scarce. We have used recently published transcriptome data from the fission yeast
Results
Two independent sets of transcriptome data reveal the presence of full-length, polyadenylated transcripts from LTR retrotransposons in
Conclusions
Presumably, the host organism negatively regulates proliferation of LTR retrotransposons. The finding of considerable transcriptional activity of retrotransposons suggests that part of this regulation is likely to take place at a post-transcriptional level. Alternatively, the transcriptional activity may signify a hitherto unrecognized activity level of retrotransposon proliferation. Our findings underline the usefulness of transcriptome data in elucidating dynamics in retrotransposon transcription.

Loading...
Loading...
Abstract
Background
The phenomenon of desiccation tolerance, also called anhydrobiosis, involves the ability of an organism to survive the loss of almost all cellular water without sustaining irreversible damage. Although there are several physiological, morphological and ecological studies on tardigrades, only limited DNA sequence information is available. Therefore, we explored the transcriptome in the active and anhydrobiotic state of the tardigrade
Results
We sequenced a total of 9984 expressed sequence tags (ESTs) from two cDNA libraries from the eutardigrade
Conclusions
This study describes novel sequence data from the tardigrade

Loading...
Loading...
Abstract
Background
In
Results
In this study, we compare the known binding sites of the DCC with oligonucleotide profiles that measure the specificity of the sequences of the
Conclusions
These results offer an explanation for the wild-type binding of the DCC along the

Loading...
Loading...
Abstract
Background
The SROs (SIMILAR TO RCD-ONE) are a group of plant-specific proteins which have important functions in stress adaptation and development. They contain the catalytic core of the poly(ADP-ribose) polymerase (PARP) domain and a C-terminal RST (RCD-SRO-TAF4) domain. In addition to these domains, several, but not all, SROs contain an N-terminal WWE domain.
Results
SROs are present in all analyzed land plants and sequence analysis differentiates between two structurally distinct groups; cryptogams and monocots possess only group I SROs whereas eudicots also contain group II. Group I SROs possess an N-terminal WWE domain (PS50918) but the WWE domain is lacking in group II SROs. Group I domain structure is widely represented in organisms as distant as humans (for example, HsPARP11). We propose a unified nomenclature for the SRO family. The SROs are able to interact with transcription factors through the C-terminal RST domain but themselves are generally not regulated at the transcriptional level. The most conserved feature of the SROs is the catalytic core of the poly(ADP-ribose) polymerase (PS51059) domain. However, bioinformatic analysis of the SRO PARP domain fold-structure and biochemical assays of AtRCD1 suggested that SROs do not possess ADP-ribosyl transferase activity.
Conclusions
The SROs are a highly conserved family of plant specific proteins. Sequence analysis of the RST domain implicates a highly preserved protein structure in that region. This might have implications for functional conservation. We suggest that, despite the presence of the catalytic core of the PARP domain, the SROs do not possess ADP-ribosyl transferase activity. Nevertheless, the function of SROs is critical for plants and might be related to transcription factor regulation and complex formation.

Loading...
Loading...
Abstract
Background
Numerous cases of horizontal transfers (HTs) have been described for eukaryote genomes, but in contrast to prokaryote genomes, no whole genome evaluation of HTs has been carried out. This is mainly due to a lack of parametric methods specially designed to take the intrinsic heterogeneity of eukaryote genomes into account. We applied a simple and tested method based on local variations of genomic signatures to analyze the genome of the pathogenic fungus
Results
We detected 189 atypical regions containing 214 genes, accounting for about 1 Mb of DNA sequences. However, the fraction of atypical DNA detected was smaller than the average amount detected in the same conditions in prokaryote genomes (3.1% vs 5.6%). It appeared that about one third of these regions contained no annotated genes, a proportion far greater than in prokaryote genomes. When analyzing the origin of these HTs by comparing their signatures to a home made database of species signatures, 3 groups of donor species emerged: bacteria (40%), fungi (25%), and viruses (22%). It is to be noticed that though inter-domain exchanges are confirmed, we only put in evidence very few exchanges between eukaryotic kingdoms.
Conclusions
In conclusion, we demonstrated that HTs are not negligible in eukaryote genomes, bearing in mind that in our stringent conditions this amount is a floor value, though of a lesser extent than in prokaryote genomes. The biological mechanisms underlying those transfers remain to be elucidated as well as the biological functions of the transferred genes.

Loading...
Loading...
Abstract
Background
Myosin performs ATP free energy transduction into mechanical work in the motor domain of the myosin heavy chain (MHC). Energy transduction is the definitive systemic feature of the myosin motor performed by coordinating in a time ordered sequence: ATP hydrolysis at the active site, actin affinity modulation at the actin binding site, and the lever-arm rotation of the power stroke. These functions are carried out by several conserved sub-domains within the motor domain. Single nucleotide polymorphisms (SNPs) affect the MHC sequence of many isoforms expressed in striated muscle, smooth muscle, and non-muscle tissue. The purpose of this work is to provide a rationale for using SNPs as a functional genomics tool to investigate structurefunction relationships in myosin. In particular, to discover SNP distribution over the conserved sub-domains and surmise what it implies about sub-domain stability and criticality in the energy transduction mechanism.
Results
An automated routine identifying human nonsynonymous SNP amino acid missense substitutions for any MHC gene mined the NCBI SNP data base. The routine tested 22 MHC genes coding muscle and non-muscle isoforms and identified 89 missense mutation positions in the motor domain with 10 already implicated in heart disease and another 8 lacking sequence homology with a skeletal MHC isoform for which a crystallographic model is available. The remaining 71 SNP substitutions were found to be distributed over MHC with 22 falling outside identified functional sub-domains and 49 in or very near to myosin sub-domains assigned specific crucial functions in energy transduction. The latter includes the active site, the actin binding site, the rigid lever-arm, and regions facilitating their communication. Most MHC isoforms contained SNPs somewhere in the motor domain.
Conclusions
Several functional-crucial sub-domains are infiltrated by a large number of SNP substitution sites suggesting these domains are engineered by evolution to be too-robust to be disturbed by otherwise intrusive sequence changes. Two functional sub-domains are SNP-free or relatively SNP-deficient but contain many disease implicated mutants. These sub-domains are apparently highly sensitive to any missense substitution suggesting they have failed to evolve a robust sequence paradigm for performing their function.

Loading...
Loading...
Abstract
Background
Insect cuticle plays essential roles in many physiological functions. During molting and metamorphosis tremendous changes occur in silkworm cuticle where multiple proteins exist and genes encoding them constitute about 1.5% of all
Results
In an effort to determine their expression profiles, a microarray-based investigation was carried out using mRNA collected from larvae to pupae. The results showed that a total of 6676 genes involved in various functions and physiological pathways were activated. The vast majority (93%) of cuticular protein genes were expressed in selected stages with varying expression patterns. There was no correlation between expression patterns and the presence of conserved motifs. Twenty-six RR genes distributed in chromosome 22 were co-expressed at the larval and wandering stages. The 2 kb upstream regions of these genes were further analyzed and three putative elements were identified.
Conclusions
Data from the present study provide, for the first time, a comprehensive expression profile of genes in silkworm epidermal tissues and evidence that putative elements exist to allow massive production of mRNAs from specific cuticular protein genes.

Loading...
Loading...
Abstract
Background
Results
Three QTLs were detected for stalk tunnel length at bins 1.02, 3.05 and 8.05 which explained 7.5% of the RILs genotypic variance. The QTL at bin 3.05 was co-located to a QTL related to plant height and grain humidity and the QTL at bin 8.05 was located near a QTL related to yield.
Conclusions
Our results, when compared with results from other authors, suggest the presence of genes involved in cell wall biosynthesis or fortification with effects on resistance to different corn borer species and digestibility for dairy cattle. Particularly, we proposed five candidate genes related to cell wall characteristics which could explain the QTL for stalk tunnelling in the region 3.05. However, the small proportion of genotypic variance explained by the QTLs suggest that there are also many other genes of small effect regulating MCB resistance and we conclude that MAS seems not promising for this trait. Two QTLs detected for stalk tunnelling overlap with QTLs for agronomic traits, indicating the presence of pleitropism or linkage between genes affecting resistance and agronomic traits.

Loading...
Loading...
Abstract
Background
RNA inhibition by siRNAs is a frequently used approach to identify genes required for specific biological processes. However RNAi screening using siRNAs is hampered by non-specific or off target effects of the siRNAs, making it difficult to separate genuine hits from false positives. It is thought that many of the off-target effects seen in RNAi experiments are due to siRNAs acting as microRNAs (miRNAs), causing a reduction in gene expression of unintended targets via matches to the 6 or 7 nt 'seed' sequence. We have conducted a careful examination of off-target effects during an siRNA screen for novel regulators of the TRAIL apoptosis induction pathway(s).
Results
We identified 3 hexamers and 3 heptamer seed sequences that appeared multiple times in the top twenty siRNAs in the TRAIL apoptosis screen. Using a novel statistical enrichment approach, we systematically identified a further 17 hexamer and 13 heptamer seed sequences enriched in high scoring siRNAs. The presence of one of these seeds sequences (which could explain 6 of 8 confirmed off-target effects) is sufficient to elicit a phenotype. Three of these seed sequences appear in the human miRNAs miR-26a, miR-145 and miR-384. Transfection of mimics of these miRNAs protects several cell types from TRAIL-induced cell death.
Conclusions
We have demonstrated a role for miR-26a, miR-145 and miR-26a in TRAIL-induced apoptosis. Further these results show that RNAi screening enriches for siRNAs with relevant off-target effects. Some of these effects can be identified by the over-representation of certain seed sequences in high-scoring siRNAs and we demonstrate the usefulness of such systematic analysis of enriched seed sequences.

Loading...
Loading...
Abstract
Background
The mouse skeletal muscle is composed of four distinct fiber types that differ in contractile function, number of mitochondria and metabolism. Every muscle type has a specific composition and distribution of the four fiber types. To find novel genes involved in specifying muscle types, we used microarray analysis to compare the gastrocnemius with the quadriceps from mice fed a low fat diet (LFD) or high fat diet (HFD) for 8 weeks. Additional qPCR analysis were performed in the gastrocnemius, quadriceps and soleus muscle from mice fed an LFD or HFD for 20 weeks.
Results
In mice fed the 8-week LFD 162 genes were differentially expressed in the gastrocnemius
Conclusions
We found that genes from the Dkk family, Hox family and Tbx family are detectably expressed in adult mouse muscle. Interestingly, expression of

Loading...
Loading...
Abstract
Background
The superfamily of ABC proteins is among the largest known in nature. Its members are mainly, but not exclusively, involved in the transport of a broad range of substrates across biological membranes. Many contribute to multidrug resistance in microbial pathogens and cancer cells. The diversity of ABC proteins in fungi is comparable with those in multicellular animals, but so far fungal ABC proteins have barely been studied.
Results
We performed a phylogenetic analysis of the ABC proteins extracted from the genomes of 27 fungal species from 18 orders representing 5 fungal phyla thereby covering the most important groups. Our analysis demonstrated that some of the subfamilies of ABC proteins remained highly conserved in fungi, while others have undergone a remarkable group-specific diversification. Members of the various fungal phyla also differed significantly in the number of ABC proteins found in their genomes, which is especially reduced in the yeast
Conclusions
Data obtained during our analysis should contribute to a better understanding of the diversity of the fungal ABC proteins and provide important clues about their possible biological functions.

Loading...
Loading...
Abstract
Background
In all eukaryotic species examined, genes that are chromosomal neighbours are more similar in their expression than random gene pairs. Currently, it is still unclear how much of this local co-expression is caused by direct transcriptional interactions, and how much is due to shared chromatin environments.
Results
We analysed neighbouring genes in
Conclusions
Classifying linked gene pairs by their orientation, we are able to partially tease apart the different levels of regional expression modulation. (i) Regional chromatin characteristics modulate the accessibility for regulation and transcription, regardless of gene orientation; the strength of this chromatin effect can be assessed from divergently or convergently transcribed distant neighbours. (ii) Shared promoter regions up to 400 bp in length enhance the co-expression of close bi-directional neighbours. (iii) Transcriptional interference of close neighbours is concentrated at the 3' ends of genes, and reduces co-expression on average by 40%.

Loading...
Loading...
Abstract
Background
The ribosomal protein SA (RPSA), previously named 37-kDa laminin receptor precursor/67-kDa laminin receptor (LRP/LR) is a multifunctional protein that plays a role in a number of pathological processes, such as cancer and prion diseases. In all investigated species,
Results
In addition to the full length functional ovine
Conclusions
In the present work we have characterized the ovine

Loading...
Loading...
Abstract
Background
Massively parallel sequencing of cDNA is now an efficient route for generating enormous sequence collections that represent expressed genes. This approach provides a valuable starting point for characterizing functional genetic variation in non-model organisms, especially where whole genome sequencing efforts are currently cost and time prohibitive. The large and complex genomes of pines (
Results
We obtained 586,732 sequencing reads from a 454 GS XLR70 Titanium pyrosequencer (mean length: 306 base pairs). A combination of reference-based and
Conclusions
This sequence collection represents a major genomic resource for

Loading...
Loading...
Abstract
Background
Results
We have sequenced the genome of the particularly persistent
Pangenome analysis facilitated the identification of strain-specific features, as well as some that are shared by two or more strains. We identified: (1) three islands related to anti-drug resistance systems; (2) a system for transport and secretion of heavy metals; (3) three systems related to DNA transfer; (4) two CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) systems, known to provide resistance against phage infections, one similar in the Lens and Alcoy strains, and another specific to the Paris strain; and (5) seven islands of phage-related proteins, five of which seem to be strain-specific and two shared.
Conclusions
The dispensable genome disclosed by the pangenomic analysis seems to be a reservoir of new traits that have mainly been acquired by horizontal gene transfer and could confer evolutionary advantages over strains lacking them.

Loading...
Loading...
Abstract
Background
Over the past decades, extensive comparative mapping research has been performed in the plant family Solanaceae. The recent identification of a large set of single-copy conserved orthologous (COSII) markers has greatly accelerated comparative mapping studies among major solanaceous species including tomato, potato, eggplant, pepper and diploid
Results
We combined data from multiple COSII studies, and other comparative mapping studies performed in tomato, potato, eggplant, pepper and diploid
Conclusions
The Solanaceae has experienced chromosomal changes at a modest rate compared with other families and the rates are likely conserved across different lineages of the family. Chromosomal inversions occur at a consistently higher rate than do translocations. Further, we find evidences for non-random positioning of the chromosomal rearrangement breakpoints. This finding is consistent with the similar finding in mammals, where hot spots for chromosomal breakages have apparently played a significant role in shaping genome evolution. Finally, by utilizing multiple genome comparisons we were able to reconstruct the most likely genome configuration for a number of now-extinct progenitor species that gave rise to the extant solanaceous species used in this research. The results from this study provide the first broad overview of chromosomal evolution in the family Solanaceae, and one of the most detailed thus far for any family of plants.

Loading...
Loading...
Abstract
Background
Cellular membranes are crucial host components utilized by positive-strand RNA viruses for replication of their genomes. Published studies have suggested that the synthesis and distribution of membrane lipids are particularly important for the assembly and function of positive-strand RNA virus replication complexes. However, the impact of specific lipid metabolism pathways in this process have not been well defined, nor have potential changes in lipid expression associated with positive-strand RNA virus replication been examined in detail.
Results
In this study we used parallel and complementary global and targeted approaches to examine the impact of lipid metabolism on the replication of the well-studied model alphanodavirus Flock House virus (FHV). We found that FHV RNA replication in cultured
Conclusion
These results indicate that glycerophospholipid metabolism, and in particular phosphatidylcholine biosynthesis, plays an important role in FHV RNA replication. Furthermore, they provide a framework in which to further explore the impact of specific steps in lipid metabolism on FHV replication, and potentially identify novel cellular targets for the development of drugs to inhibit positive-strand RNA viruses.

Loading...
Loading...
Abstract
Background
Understanding syntentic relationship between two species is critical to assessing the potential for comparative genomic analysis. Common bean (
Results
Genetically-positioned transcript loci of common bean were mapped relative to the recent soybean 1.01 pseudochromosome assembly. In nearly every case, each common bean locus mapped to two loci in soybean, a result consistent with the duplicate polyploidy history of soybean. Blocks of synteny averaging 32 cM in common bean and 4.9 Mb in soybean were observed for all 11 common bean linkage groups, and these blocks mapped to all 20 soybean pseudochromosomes. The median physical-to-genetic distance ratio in common bean (based on soybean physical distances) was ~120 kb/cM. ~15,000 common bean sequences (primarily EST contigs and EST singletons) were electronically positioned onto the common bean map using the shared syntentic blocks as references points.
Conclusion
The collected evidence from this mapping strongly supports the duplicate history of soybean. It further provides evidence that the soybean genome was fractionated and reassembled at some point following the duplication event. These well mapped syntentic relationships between common bean and soybean will enable researchers to target specific genomic regions to discover genes or loci that affect phenotypic expression in both species.

Loading...
Loading...
Abstract
Background
Transcription factors (TFs) regulate gene expression by interacting with promoters of their target genes and are classified into families based on their DNA-binding domains. Genes coding for TFs have been identified in the sequences of model plant genomes. The rice (
Results
We have created an extensive list of wheat (
Conclusions
wDBTF collates 3,820 wheat TF sequences validated by the presence of a DNA-binding domain out of 7,112 potential TF sequences identified from publicly available gene expression data. We also incorporated

Loading...
Loading...
Abstract
Background
Bovine anaplasmosis, caused by the rickettsial tick-borne pathogen
Results
Suppression subtractive hybridization libraries were constructed from infected and uninfected ticks and used to identify genes differentially expressed in male
Conclusions
Characterization of differential gene expression in salivary glands of

Loading...
Loading...
Abstract
Background
Antimicrobial peptides (AMPs) are an essential component of innate immunity which can rapidly respond to diverse microbial pathogens. Insects, as a rich source of AMPs, attract great attention of scientists in both understanding of the basic biology of the immune system and searching molecular templates for anti-infective drug design. Despite a large number of AMPs have been identified from different insect species, little information in terms of these peptides is available from parasitic insects.
Results
By using integrated computational approaches to systemically mining the Hymenopteran parasitic wasp
Conclusion
The present study established the
