<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Bioinformatics &#187; Oxford journals</title>
	<atom:link href="http://bioinformatics.me/category/oxford_journals/feed/" rel="self" type="application/rss+xml" />
	<link>http://bioinformatics.me</link>
	<description>BioData make sense!</description>
	<lastBuildDate>Thu, 18 Aug 2011 12:47:42 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>A large-scale benchmark study of existing algorithms for taxonomy-independent microbial community analysis</title>
		<link>http://bioinformatics.me/a-large-scale-benchmark-study-of-existing-algorithms-for-taxonomy-independent-microbial-community-analysis/</link>
		<comments>http://bioinformatics.me/a-large-scale-benchmark-study-of-existing-algorithms-for-taxonomy-independent-microbial-community-analysis/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/a-large-scale-benchmark-study-of-existing-algorithms-for-taxonomy-independent-microbial-community-analysis/</guid>
		<description><![CDATA[Recent advances in massively parallel sequencing technology have created new opportunities to probe the hidden world of microbes. Taxonomy-independent clustering of the 16S rRNA gene is usually the first step in analyzing microbial communities. Dozens of algorithms have been developed in the last decade, but a comprehensive benchmark study is lacking. Here, we survey algorithms [...]]]></description>
			<content:encoded><![CDATA[<p>Recent advances in massively parallel sequencing technology have created new opportunities to probe the hidden world of microbes. Taxonomy-independent clustering of the 16S rRNA gene is usually the first step in analyzing microbial communities. Dozens of algorithms have been developed in the last decade, but a comprehensive benchmark study is lacking. Here, we survey algorithms currently used by microbiologists, and compare seven representative methods in a large-scale benchmark study that addresses several issues of concern. A new experimental protocol was developed that allows different algorithms to be compared using the same platform, and several criteria were introduced to facilitate a quantitative evaluation of the clustering performance of each algorithm. We found that existing methods vary widely in their outputs, and that inappropriate use of distance levels for taxonomic assignments likely resulted in substantial overestimates of biodiversity in many studies. The benchmark study identified our recently developed ESPRIT-Tree, a fast implementation of the average linkage-based hierarchical clustering algorithm, as one of the best algorithms available in terms of computational efficiency and clustering accuracy.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/a-large-scale-benchmark-study-of-existing-algorithms-for-taxonomy-independent-microbial-community-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A computational framework for the inheritance pattern of genomic imprinting for complex traits</title>
		<link>http://bioinformatics.me/a-computational-framework-for-the-inheritance-pattern-of-genomic-imprinting-for-complex-traits/</link>
		<comments>http://bioinformatics.me/a-computational-framework-for-the-inheritance-pattern-of-genomic-imprinting-for-complex-traits/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/a-computational-framework-for-the-inheritance-pattern-of-genomic-imprinting-for-complex-traits/</guid>
		<description><![CDATA[Genetic imprinting, by which the expression of a gene depends on the parental origin of its alleles, may be subjected to reprogramming through each generation. Currently, such reprogramming is limited to qualitative description only, lacking more precise quantitative estimation for its extent, pattern and mechanism. Here, we present a computational framework for analyzing the magnitude [...]]]></description>
			<content:encoded><![CDATA[<p>Genetic imprinting, by which the expression of a gene depends on the parental origin of its alleles, may be subjected to reprogramming through each generation. Currently, such reprogramming is limited to qualitative description only, lacking more precise quantitative estimation for its extent, pattern and mechanism. Here, we present a computational framework for analyzing the magnitude of genetic imprinting and its transgenerational inheritance mode. This quantitative model is based on the breeding scheme of reciprocal backcrosses between reciprocal F<SUB>1</SUB> hybrids and original inbred parents, in which the transmission of genetic imprinting across generations can be tracked. We define a series of quantitative genetic parameters that describe the extent and transmission mode of genetic imprinting and further estimate and test these parameters within a genetic mapping framework using a new powerful computational algorithm. The model and algorithm described will enable geneticists to identify and map imprinted quantitative trait loci and dictate a comprehensive atlas of developmental and epigenetic mechanisms related to genetic imprinting. We illustrate the new discovery of the role of genetic imprinting in regulating hyperoxic acute lung injury survival time using a mouse reciprocal backcross design.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/a-computational-framework-for-the-inheritance-pattern-of-genomic-imprinting-for-complex-traits/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>OrthoDisease: tracking disease gene orthologs across 100 species</title>
		<link>http://bioinformatics.me/orthodisease-tracking-disease-gene-orthologs-across-100-species/</link>
		<comments>http://bioinformatics.me/orthodisease-tracking-disease-gene-orthologs-across-100-species/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/orthodisease-tracking-disease-gene-orthologs-across-100-species/</guid>
		<description><![CDATA[Orthology is one of the most important tools available to modern biology, as it allows making inferences from easily studied model systems to much less tractable systems of interest, such as ourselves. This becomes important not least in the study of genetic diseases. We here review work on the orthology of disease-associated genes and also [...]]]></description>
			<content:encoded><![CDATA[<p>Orthology is one of the most important tools available to modern biology, as it allows making inferences from easily studied model systems to much less tractable systems of interest, such as ourselves. This becomes important not least in the study of genetic diseases. We here review work on the orthology of disease-associated genes and also present an updated version of the InParanoid-based disease orthology database and web site OrthoDisease, with 14-fold increased species coverage since the previous version. Using this resource, we survey the taxonomic distribution of orthologs of human genes involved in different disease categories. The hypothesis that paralogs can mask the effect of deleterious mutations predicts that known heritable disease genes should have fewer close paralogs. We found large-scale support for this hypothesis as significantly fewer duplications were observed for disease genes in the OrthoDisease ortholog groups.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/orthodisease-tracking-disease-gene-orthologs-across-100-species/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Learning transcriptional regulation on a genome scale: a theoretical analysis based on gene expression data</title>
		<link>http://bioinformatics.me/learning-transcriptional-regulation-on-a-genome-scale-a-theoretical-analysis-based-on-gene-expression-data/</link>
		<comments>http://bioinformatics.me/learning-transcriptional-regulation-on-a-genome-scale-a-theoretical-analysis-based-on-gene-expression-data/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/learning-transcriptional-regulation-on-a-genome-scale-a-theoretical-analysis-based-on-gene-expression-data/</guid>
		<description><![CDATA[The recent advent of high-throughput microarray data has enabled the global analysis of the transcriptome, driving the development and application of computational approaches to study transcriptional regulation on the genome scale, by reconstructing in silico the regulatory interactions of the gene network. Although there are many in-depth reviews of such &#8216;reverse-engineering&#8217; methodologies, most have focused [...]]]></description>
			<content:encoded><![CDATA[<p>The recent advent of high-throughput microarray data has enabled the global analysis of the transcriptome, driving the development and application of computational approaches to study transcriptional regulation on the genome scale, by reconstructing <I>in silico</I> the regulatory interactions of the gene network. Although there are many in-depth reviews of such &lsquo;reverse-engineering&rsquo; methodologies, most have focused on the practical aspect of data mining, and few on the biological problem and the biological relevance of the methodology. Therefore, in this review, from a biological perspective, we used a set of yeast microarray data as a working example, to evaluate the fundamental assumptions implicit in associating transcription factor (TF)&ndash;target gene expression levels and estimating TFs&rsquo; activity, and further explore cooperative models. Finally we confirm that the detailed transcription mechanism is overly-complex for expression data alone to reveal, nevertheless, future network reconstruction studies could benefit from the incorporation of context-specific information, the modeling of multiple layers of regulation (e.g. micro-RNA), or the development of approaches for context-dependent analysis, to uncover the mechanisms of gene regulation.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/learning-transcriptional-regulation-on-a-genome-scale-a-theoretical-analysis-based-on-gene-expression-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SeqXML and OrthoXML: standards for sequence and orthology information</title>
		<link>http://bioinformatics.me/seqxml-and-orthoxml-standards-for-sequence-and-orthology-information/</link>
		<comments>http://bioinformatics.me/seqxml-and-orthoxml-standards-for-sequence-and-orthology-information/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/seqxml-and-orthoxml-standards-for-sequence-and-orthology-information/</guid>
		<description><![CDATA[There is a great need for standards in the orthology field. Users must contend with different ortholog data representations from each provider, and the providers themselves must independently gather and parse the input sequence data. These burdensome and redundant procedures make data comparison and integration difficult. We have designed two XML-based formats, SeqXML and OrthoXML, [...]]]></description>
			<content:encoded><![CDATA[<p>There is a great need for standards in the orthology field. Users must contend with different ortholog data representations from each provider, and the providers themselves must independently gather and parse the input sequence data. These burdensome and redundant procedures make data comparison and integration difficult. We have designed two XML-based formats, SeqXML and OrthoXML, to solve these problems. SeqXML is a lightweight format for sequence records&mdash;the input for orthology prediction. It stores the same sequence and metadata as typical FASTA format records, but overcomes common problems such as unstructured metadata in the header and erroneous sequence content. XML provides validation to prevent data integrity problems that are frequent in FASTA files. The range of applications for SeqXML is broad and not limited to ortholog prediction. We provide read/write functions for BioJava, BioPerl, and Biopython. OrthoXML was designed to represent ortholog assignments from any source in a consistent and structured way, yet cater to specific needs such as scoring schemes or meta-information. A unified format is particularly valuable for ortholog consumers that want to integrate data from numerous resources, e.g. for gene annotation projects. Reference proteomes for 61 organisms are already available in SeqXML, and 10 orthology databases have signed on to OrthoXML. Adoption by the entire field would substantially facilitate exchange and quality control of sequence and orthology information.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/seqxml-and-orthoxml-standards-for-sequence-and-orthology-information/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Combining literature text mining with microarray data: advances for system biology modeling</title>
		<link>http://bioinformatics.me/combining-literature-text-mining-with-microarray-data-advances-for-system-biology-modeling/</link>
		<comments>http://bioinformatics.me/combining-literature-text-mining-with-microarray-data-advances-for-system-biology-modeling/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/combining-literature-text-mining-with-microarray-data-advances-for-system-biology-modeling/</guid>
		<description><![CDATA[A huge amount of important biomedical information is hidden in the bulk of research articles in biomedical fields. At the same time, the publication of databases of biological information and of experimental datasets generated by high-throughput methods is in great expansion, and a wealth of annotated gene databases, chemical, genomic (including microarray datasets), clinical and [...]]]></description>
			<content:encoded><![CDATA[<p>A huge amount of important biomedical information is hidden in the bulk of research articles in biomedical fields. At the same time, the publication of databases of biological information and of experimental datasets generated by high-throughput methods is in great expansion, and a wealth of annotated gene databases, chemical, genomic (including microarray datasets), clinical and other types of data repositories are now available on the Web. Thus a current challenge of bioinformatics is to develop targeted methods and tools that integrate scientific literature, biological databases and experimental data for reducing the time of database curation and for accessing evidence, either in the literature or in the datasets, useful for the analysis at hand. Under this scenario, this article reviews the knowledge discovery systems that fuse information from the literature, gathered by text mining, with microarray data for enriching the lists of down and upregulated genes with elements for biological understanding and for generating and validating new biological hypothesis. Finally, an easy to use and freely accessible tool, GeneWizard, that exploits text mining and microarray data fusion for supporting researchers in discovering gene&ndash;disease relationships is described.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/combining-literature-text-mining-with-microarray-data-advances-for-system-biology-modeling/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>When orthologs diverge between human and mouse</title>
		<link>http://bioinformatics.me/when-orthologs-diverge-between-human-and-mouse/</link>
		<comments>http://bioinformatics.me/when-orthologs-diverge-between-human-and-mouse/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/when-orthologs-diverge-between-human-and-mouse/</guid>
		<description><![CDATA[Despite the common assumption that orthologs usually share the same function, there have been various reports of divergence between orthologs, even among species as close as mammals. The comparison of mouse and human is of special interest, because mouse is often used as a model organism to understand human biology. We review the literature on [...]]]></description>
			<content:encoded><![CDATA[<p>Despite the common assumption that orthologs usually share the same function, there have been various reports of divergence between orthologs, even among species as close as mammals. The comparison of mouse and human is of special interest, because mouse is often used as a model organism to understand human biology. We review the literature on evidence for divergence between human and mouse orthologous genes, and discuss it in the context of biomedical research.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/when-orthologs-diverge-between-human-and-mouse/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Computational methods for Gene Orthology inference</title>
		<link>http://bioinformatics.me/computational-methods-for-gene-orthology-inference/</link>
		<comments>http://bioinformatics.me/computational-methods-for-gene-orthology-inference/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/computational-methods-for-gene-orthology-inference/</guid>
		<description><![CDATA[Accurate inference of orthologous genes is a pre-requisite for most comparative genomics studies, and is also important for functional annotation of new genomes. Identification of orthologous gene sets typically involves phylogenetic tree analysis, heuristic algorithms based on sequence conservation, synteny analysis, or some combination of these approaches. The most direct tree-based methods typically rely on [...]]]></description>
			<content:encoded><![CDATA[<p>Accurate inference of orthologous genes is a pre-requisite for most comparative genomics studies, and is also important for functional annotation of new genomes. Identification of orthologous gene sets typically involves phylogenetic tree analysis, heuristic algorithms based on sequence conservation, synteny analysis, or some combination of these approaches. The most direct tree-based methods typically rely on the comparison of an individual gene tree with a species tree. Once the two trees are accurately constructed, orthologs are straightforwardly identified by the definition of orthology as those homologs that are related by speciation, rather than gene duplication, at their most recent point of origin. Although ideal for the purpose of orthology identification in principle, phylogenetic trees are computationally expensive to construct for large numbers of genes and genomes, and they often contain errors, especially at large evolutionary distances. Moreover, in many organisms, in particular prokaryotes and viruses, evolution does not appear to have followed a simple &lsquo;tree-like&rsquo; mode, which makes conventional tree reconciliation inapplicable. Other, heuristic methods identify probable orthologs as the closest homologous pairs or groups of genes in a set of organisms. These approaches are faster and easier to automate than tree-based methods, with efficient implementations provided by graph-theoretical algorithms enabling comparisons of thousands of genomes. Comparisons of these two approaches show that, despite conceptual differences, they produce similar sets of orthologs, especially at short evolutionary distances. Synteny also can aid in identification of orthologs. Often, tree-based, sequence similarity- and synteny-based approaches can be combined into flexible hybrid methods.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/computational-methods-for-gene-orthology-inference/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Biological network motif detection: principles and practice</title>
		<link>http://bioinformatics.me/biological-network-motif-detection-principles-and-practice/</link>
		<comments>http://bioinformatics.me/biological-network-motif-detection-principles-and-practice/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/biological-network-motif-detection-principles-and-practice/</guid>
		<description><![CDATA[Network motifs are statistically overrepresented sub-structures (sub-graphs) in a network, and have been recognized as &#8216;the simple building blocks of complex networks&#8217;. Study of biological network motifs may reveal answers to many important biological questions. The main difficulty in detecting larger network motifs in biological networks lies in the facts that the number of possible [...]]]></description>
			<content:encoded><![CDATA[<p>Network motifs are statistically overrepresented sub-structures (sub-graphs) in a network, and have been recognized as &lsquo;the simple building blocks of complex networks&rsquo;. Study of biological network motifs may reveal answers to many important biological questions. The main difficulty in detecting larger network motifs in biological networks lies in the facts that the number of possible sub-graphs increases exponentially with the network or motif size (node counts, in general), and that no known polynomial-time algorithm exists in deciding if two graphs are topologically equivalent. This article discusses the biological significance of network motifs, the motivation behind solving the motif-finding problem, and strategies to solve the various aspects of this problem. A simple classification scheme is designed to analyze the strengths and weaknesses of several existing algorithms. Experimental results derived from a few comparative studies in the literature are discussed, with conclusions that lead to future research directions.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/biological-network-motif-detection-principles-and-practice/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Positional orthology: putting genomic evolutionary relationships into context</title>
		<link>http://bioinformatics.me/positional-orthology-putting-genomic-evolutionary-relationships-into-context/</link>
		<comments>http://bioinformatics.me/positional-orthology-putting-genomic-evolutionary-relationships-into-context/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/positional-orthology-putting-genomic-evolutionary-relationships-into-context/</guid>
		<description><![CDATA[Orthology is a powerful refinement of homology that allows us to describe more precisely the evolution of genomes and understand the function of the genes they contain. However, because orthology is not concerned with genomic position, it is limited in its ability to describe genes that are likely to have equivalent roles in different genomes. [...]]]></description>
			<content:encoded><![CDATA[<p>Orthology is a powerful refinement of homology that allows us to describe more precisely the evolution of genomes and understand the function of the genes they contain. However, because orthology is not concerned with genomic position, it is limited in its ability to describe genes that are likely to have equivalent roles in different genomes. Because of this limitation, the concept of &lsquo;positional orthology&rsquo; has emerged, which describes the relation between orthologous genes that retain their ancestral genomic positions. In this review, we formally define this concept, for which we introduce the shorter term &lsquo;toporthology&rsquo;, with respect to the evolutionary events experienced by a gene&rsquo;s ancestors. Through a discussion of recent studies on the role of genomic context in gene evolution, we show that the distinction between orthology and toporthology is biologically significant. We then review a number of orthology prediction methods that take genomic context into account and thus that may be used to infer the important relation of toporthology.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/positional-orthology-putting-genomic-evolutionary-relationships-into-context/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>GO-function: deriving biologically relevant functions from statistically significant functions</title>
		<link>http://bioinformatics.me/go-function-deriving-biologically-relevant-functions-from-statistically-significant-functions/</link>
		<comments>http://bioinformatics.me/go-function-deriving-biologically-relevant-functions-from-statistically-significant-functions/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/go-function-deriving-biologically-relevant-functions-from-statistically-significant-functions/</guid>
		<description><![CDATA[In high-throughput studies of diseases, terms enriched with disease-related genes based on Gene Ontology (GO) are routinely found. However, most current algorithms used to find significant GO terms cannot handle the redundancy that results from the dependencies of GO terms. Simply based on some numerical considerations, current algorithms developed for reducing this redundancy may produce [...]]]></description>
			<content:encoded><![CDATA[<p>In high-throughput studies of diseases, terms enriched with disease-related genes based on Gene Ontology (GO) are routinely found. However, most current algorithms used to find significant GO terms cannot handle the redundancy that results from the dependencies of GO terms. Simply based on some numerical considerations, current algorithms developed for reducing this redundancy may produce results that do not account for biologically interesting cases. In this article, we present several rules used to design a tool called GO-function for extracting biologically relevant terms from statistically significant GO terms for a disease. Using one gene expression profile for colorectal cancer, we compared GO-function with four algorithms designed to treat redundancy. Then, we validated results obtained in this data set by GO-function using another data set for colorectal cancer. Our analysis showed that GO-function can identify disease-related terms that are more statistically and biologically meaningful than those found by the other four algorithms.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/go-function-deriving-biologically-relevant-functions-from-statistically-significant-functions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Ortholog identification in the presence of domain architecture rearrangement</title>
		<link>http://bioinformatics.me/ortholog-identification-in-the-presence-of-domain-architecture-rearrangement/</link>
		<comments>http://bioinformatics.me/ortholog-identification-in-the-presence-of-domain-architecture-rearrangement/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/ortholog-identification-in-the-presence-of-domain-architecture-rearrangement/</guid>
		<description><![CDATA[Ortholog identification is used in gene functional annotation, species phylogeny estimation, phylogenetic profile construction and many other analyses. Bioinformatics methods for ortholog identification are commonly based on pairwise protein sequence comparisons between whole genomes. Phylogenetic methods of ortholog identification have also been developed; these methods can be applied to protein data sets sharing a common [...]]]></description>
			<content:encoded><![CDATA[<p>Ortholog identification is used in gene functional annotation, species phylogeny estimation, phylogenetic profile construction and many other analyses. Bioinformatics methods for ortholog identification are commonly based on pairwise protein sequence comparisons between whole genomes. Phylogenetic methods of ortholog identification have also been developed; these methods can be applied to protein data sets sharing a common domain architecture or which share a single functional domain but differ outside this region of homology. While promiscuous domains represent a challenge to all orthology prediction methods, overall structural similarity is highly correlated with proximity in a phylogenetic tree, conferring a degree of robustness to phylogenetic methods. In this article, we review the issues involved in orthology prediction when data sets include sequences with structurally heterogeneous domain architectures, with particular attention to automated methods designed for high-throughput application, and present a case study to illustrate the challenges in this area.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/ortholog-identification-in-the-presence-of-domain-architecture-rearrangement/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Comparative genomics approach to detecting split-coding regions in a low-coverage genome: lessons from the chimaera Callorhinchus milii (Holocephali, Chondrichthyes)</title>
		<link>http://bioinformatics.me/comparative-genomics-approach-to-detecting-split-coding-regions-in-a-low-coverage-genome-lessons-from-the-chimaera-callorhinchus-milii-holocephali-chondrichthyes/</link>
		<comments>http://bioinformatics.me/comparative-genomics-approach-to-detecting-split-coding-regions-in-a-low-coverage-genome-lessons-from-the-chimaera-callorhinchus-milii-holocephali-chondrichthyes/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/comparative-genomics-approach-to-detecting-split-coding-regions-in-a-low-coverage-genome-lessons-from-the-chimaera-callorhinchus-milii-holocephali-chondrichthyes/</guid>
		<description><![CDATA[Recent development of deep sequencing technologies has facilitated de novo genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of [...]]]></description>
			<content:encoded><![CDATA[<p>Recent development of deep sequencing technologies has facilitated <I>de novo</I> genome sequencing projects, now conducted even by individual laboratories. However, this will yield more and more genome sequences that are not well assembled, and will hinder thorough annotation when no closely related reference genome is available. One of the challenging issues is the identification of protein-coding sequences split into multiple unassembled genomic segments, which can confound orthology assignment and various laboratory experiments requiring the identification of individual genes. In this study, using the genome of a cartilaginous fish, <I>Callorhinchus milii</I>, as test case, we performed gene prediction using a model specifically trained for this genome. We implemented an algorithm, designated <I>ESPRIT</I>, to identify possible linkages between multiple protein-coding portions derived from a single genomic locus split into multiple unassembled genomic segments. We developed a validation framework based on an artificially fragmented human genome, improvements between early and recent mouse genome assemblies, comparison with experimentally validated sequences from GenBank, and phylogenetic analyses. Our strategy provided insights into practical solutions for efficient annotation of only partially sequenced (low-coverage) genomes. To our knowledge, our study is the first formulation of a method to link unassembled genomic segments based on proteomes of relatively distantly related species as references.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/comparative-genomics-approach-to-detecting-split-coding-regions-in-a-low-coverage-genome-lessons-from-the-chimaera-callorhinchus-milii-holocephali-chondrichthyes/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Calculating transcription factor binding maps for chromatin</title>
		<link>http://bioinformatics.me/calculating-transcription-factor-binding-maps-for-chromatin/</link>
		<comments>http://bioinformatics.me/calculating-transcription-factor-binding-maps-for-chromatin/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/calculating-transcription-factor-binding-maps-for-chromatin/</guid>
		<description><![CDATA[Current high-throughput experiments already generate enough data for retrieving the DNA sequence-dependent binding affinities of transcription factors (TF) and other chromosomal proteins throughout the complete genome. However, the reverse task of calculating binding maps in a chromatin context for a given set of concentrations and TF affinities appears to be even more challenging and computationally [...]]]></description>
			<content:encoded><![CDATA[<p>Current high-throughput experiments already generate enough data for retrieving the DNA sequence-dependent binding affinities of transcription factors (TF) and other chromosomal proteins throughout the complete genome. However, the reverse task of calculating binding maps in a chromatin context for a given set of concentrations and TF affinities appears to be even more challenging and computationally demanding. The problem can be addressed by considering the DNA sequence as a one-dimensional lattice with units of one or more base pairs. To calculate protein occupancies in chromatin, one needs to consider the competition of TF and histone octamers for binding sites as well as the partial unwrapping of nucleosomal DNA. Here, we consider five different classes of algorithms to compute binding maps that include the binary variable, combinatorial, sequence generating function, transfer matrix and dynamic programming approaches. The calculation time of the binary variable algorithm scales exponentially with DNA length, which limits its use to the analysis of very small genomic regions. For regulatory regions with many overlapping binding sites, potentially applicable algorithms reduce either to the transfer matrix or dynamic programming approach. In addition to the recently proposed transfer matrix formalism for TF access to the nucleosomal organized DNA, we develop here a dynamic programming algorithm that accounts for this feature. In the absence of nucleosomes, dynamic programming outperforms the transfer matrix approach, but the latter is faster when nucleosome unwrapping has to be considered. Strategies are discussed that could further facilitate calculations to allow computing genome-wide TF binding maps.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/calculating-transcription-factor-binding-maps-for-chromatin/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Conceptual framework and pilot study to benchmark phylogenomic databases based on reference gene trees</title>
		<link>http://bioinformatics.me/conceptual-framework-and-pilot-study-to-benchmark-phylogenomic-databases-based-on-reference-gene-trees/</link>
		<comments>http://bioinformatics.me/conceptual-framework-and-pilot-study-to-benchmark-phylogenomic-databases-based-on-reference-gene-trees/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/conceptual-framework-and-pilot-study-to-benchmark-phylogenomic-databases-based-on-reference-gene-trees/</guid>
		<description><![CDATA[Phylogenomic databases provide orthology predictions for species with fully sequenced genomes. Although the goal seems well-defined, the content of these databases differs greatly. Seven ortholog databases (Ensembl Compara, eggNOG, HOGENOM, InParanoid, OMA, OrthoDB, Panther) were compared on the basis of reference trees. For three well-conserved protein families, we observed a generally high specificity of orthology [...]]]></description>
			<content:encoded><![CDATA[<p>Phylogenomic databases provide orthology predictions for species with fully sequenced genomes. Although the goal seems well-defined, the content of these databases differs greatly. Seven ortholog databases (Ensembl Compara, eggNOG, HOGENOM, InParanoid, OMA, OrthoDB, Panther) were compared on the basis of reference trees. For three well-conserved protein families, we observed a generally high specificity of orthology assignments for these databases. We show that differences in the completeness of predicted gene relationships and in the phylogenetic information are, for the great majority, not due to the methods used, but to differences in the underlying database concepts. According to our metrics, none of the databases provides a fully correct and comprehensive protein classification. Our results provide a framework for meaningful and systematic comparisons of phylogenomic databases. In the future, a sustainable set of &lsquo;Gold standard&rsquo; phylogenetic trees could provide a robust method for phylogenomic databases to assess their current quality status, measure changes following new database releases and diagnose improvements subsequent to an upgrade of the analysis procedure.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/conceptual-framework-and-pilot-study-to-benchmark-phylogenomic-databases-based-on-reference-gene-trees/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bioinformatics tools and database resources for systems genetics analysis in mice&#8211;a short review and an evaluation of future needs</title>
		<link>http://bioinformatics.me/bioinformatics-tools-and-database-resources-for-systems-genetics-analysis-in-mice-a-short-review-and-an-evaluation-of-future-needs/</link>
		<comments>http://bioinformatics.me/bioinformatics-tools-and-database-resources-for-systems-genetics-analysis-in-mice-a-short-review-and-an-evaluation-of-future-needs/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/bioinformatics-tools-and-database-resources-for-systems-genetics-analysis-in-mice-a-short-review-and-an-evaluation-of-future-needs/</guid>
		<description><![CDATA[During a meeting of the SYSGENET working group &#8216;Bioinformatics&#8217;, currently available software tools and databases for systems genetics in mice were reviewed and the needs for future developments discussed. The group evaluated interoperability and performed initial feasibility studies. To aid future compatibility of software and exchange of already developed software modules, a strong recommendation was [...]]]></description>
			<content:encoded><![CDATA[<p>During a meeting of the SYSGENET working group &lsquo;Bioinformatics&rsquo;, currently available software tools and databases for systems genetics in mice were reviewed and the needs for future developments discussed. The group evaluated interoperability and performed initial feasibility studies. To aid future compatibility of software and exchange of already developed software modules, a strong recommendation was made by the group to integrate HAPPY and R/qtl analysis toolboxes, GeneNetwork and XGAP database platforms, and TIQS and xQTL processing platforms. R should be used as the principal computer language for QTL data analysis in all platforms and a &lsquo;cloud&rsquo; should be used for software dissemination to the community. Furthermore, the working group recommended that all data models and software source code should be made visible in public repositories to allow a coordinated effort on the use of common data structures and file formats.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/bioinformatics-tools-and-database-resources-for-systems-genetics-analysis-in-mice-a-short-review-and-an-evaluation-of-future-needs/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How to cluster gene expression dynamics in response to environmental signals</title>
		<link>http://bioinformatics.me/how-to-cluster-gene-expression-dynamics-in-response-to-environmental-signals/</link>
		<comments>http://bioinformatics.me/how-to-cluster-gene-expression-dynamics-in-response-to-environmental-signals/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/how-to-cluster-gene-expression-dynamics-in-response-to-environmental-signals/</guid>
		<description><![CDATA[Organisms usually cope with change in the environment by altering the dynamic trajectory of gene expression to adjust the complement of active proteins. The identification of particular sets of genes whose expression is adaptive in response to environmental changes helps to understand the mechanistic base of gene&#8211;environment interactions essential for organismic development. We describe a [...]]]></description>
			<content:encoded><![CDATA[<p>Organisms usually cope with change in the environment by altering the dynamic trajectory of gene expression to adjust the complement of active proteins. The identification of particular sets of genes whose expression is adaptive in response to environmental changes helps to understand the mechanistic base of gene&ndash;environment interactions essential for organismic development. We describe a computational framework for clustering the dynamics of gene expression in distinct environments through Gaussian mixture fitting to the expression data measured at a set of discrete time points. We outline a number of quantitative testable hypotheses about the patterns of dynamic gene expression in changing environments and gene&ndash;environment interactions causing developmental differentiation. The future directions of gene clustering in terms of incorporations of the latest biological discoveries and statistical innovations are discussed. We provide a set of computational tools that are applicable to modeling and analysis of dynamic gene expression data measured in multiple environments.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/how-to-cluster-gene-expression-dynamics-in-response-to-environmental-signals/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>LEPSCAN&#8211;a web server for searching latent periodicity in DNA sequences</title>
		<link>http://bioinformatics.me/lepscan-a-web-server-for-searching-latent-periodicity-in-dna-sequences/</link>
		<comments>http://bioinformatics.me/lepscan-a-web-server-for-searching-latent-periodicity-in-dna-sequences/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/lepscan-a-web-server-for-searching-latent-periodicity-in-dna-sequences/</guid>
		<description><![CDATA[A web server for searching latent periodicity based on the method of modified profile analysis has been developed. This method allows searching latent periodicity in presence of insertions and deletions. During searching process, the periodicity classes are used which were found by us earlier for various groups of organisms. Period length belongs to the range [...]]]></description>
			<content:encoded><![CDATA[<p>A web server for searching latent periodicity based on the method of modified profile analysis has been developed. This method allows searching latent periodicity in presence of insertions and deletions. During searching process, the periodicity classes are used which were found by us earlier for various groups of organisms. Period length belongs to the range 2&ndash;20 nt, not including the triplet periodicity. The results obtained are subjected to various filtration steps to ensure their statistical significance. Availability: The use of web server is free for non-commercial users. No registration is required. URL of the server is <a href="http://victoria.biengi.ac.ru/lepscan">http://victoria.biengi.ac.ru/lepscan</a>. Current software version is 1.06.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/lepscan-a-web-server-for-searching-latent-periodicity-in-dna-sequences/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Letter to the Editor: Current progress in patient-specific modeling by Neal and Kerckhoffs (2010)</title>
		<link>http://bioinformatics.me/letter-to-the-editor-current-progress-in-patient-specific-modeling-by-neal-and-kerckhoffs-2010/</link>
		<comments>http://bioinformatics.me/letter-to-the-editor-current-progress-in-patient-specific-modeling-by-neal-and-kerckhoffs-2010/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/letter-to-the-editor-current-progress-in-patient-specific-modeling-by-neal-and-kerckhoffs-2010/</guid>
		<description><![CDATA[A recent review article on &#8216;Current progress in patient-specific modeling&#8217; in Briefings in Bioinformatics contains the statement summarizing the results of our previous study &#8216;On the unimportance of constitutive models in computing brain deformation for image-guided surgery&#8217; published in Biomechanics and Modeling in Mechanobiology as confirmation of adequacy of linear elastic model for such computation. [...]]]></description>
			<content:encoded><![CDATA[<p>A recent review article on &lsquo;Current progress in patient-specific modeling&rsquo; in <I>Briefings in Bioinformatics</I> contains the statement summarizing the results of our previous study &lsquo;On the unimportance of constitutive models in computing brain deformation for image-guided surgery&rsquo; published in <I>Biomechanics and Modeling in Mechanobiology</I> as confirmation of adequacy of linear elastic model for such computation. The purpose of this Letter to the Editor is to clarify this statement by informing the Readers of <I>Briefings in Bioinformatics</I> that our study indicates the following: (i) a simple linear elastic constitutive model for the brain tissue is sufficient when used with an appropriate finite deformation solution (i.e. geometrically non-linear analysis); and (ii) Linear analysis approach that assumes infinitesimally small brain deformations leads to unrealistic results.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/letter-to-the-editor-current-progress-in-patient-specific-modeling-by-neal-and-kerckhoffs-2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>A toolbox for developing bioinformatics software</title>
		<link>http://bioinformatics.me/a-toolbox-for-developing-bioinformatics-software/</link>
		<comments>http://bioinformatics.me/a-toolbox-for-developing-bioinformatics-software/#comments</comments>
		<pubDate>Thu, 18 Aug 2011 12:47:42 +0000</pubDate>
		<dc:creator>Waleed Ghalwash</dc:creator>
				<category><![CDATA[Oxford journals]]></category>

		<guid isPermaLink="false">http://bioinformatics.me/a-toolbox-for-developing-bioinformatics-software/</guid>
		<description><![CDATA[Creating useful software is a major activity of many scientists, including bioinformaticians. Nevertheless, software development in an academic setting is often unsystematic, which can lead to problems associated with maintenance and long-term availibility. Unfortunately, well-documented software development methodology is difficult to adopt, and technical measures that directly improve bioinformatic programming have not been described comprehensively. [...]]]></description>
			<content:encoded><![CDATA[<p>Creating useful software is a major activity of many scientists, including bioinformaticians. Nevertheless, software development in an academic setting is often unsystematic, which can lead to problems associated with maintenance and long-term availibility. Unfortunately, well-documented software development methodology is difficult to adopt, and technical measures that directly improve bioinformatic programming have not been described comprehensively. We have examined 22 software projects and have identified a set of practices for software development in an academic environment. We found them useful to plan a project, support the involvement of experts (e.g. experimentalists), and to promote higher quality and maintainability of the resulting programs. This article describes 12 techniques that facilitate a quick start into software engineering. We describe 3 of the 22 projects in detail and give many examples to illustrate the usage of particular techniques. We expect this toolbox to be useful for many bioinformatics programming projects and to the training of scientific programmers.</p>
]]></content:encoded>
			<wfw:commentRss>http://bioinformatics.me/a-toolbox-for-developing-bioinformatics-software/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

