<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?action=history&amp;feed=atom&amp;title=GenDBWiki%2FToolAndJobConcept</id>
	<title>GenDBWiki/ToolAndJobConcept - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?action=history&amp;feed=atom&amp;title=GenDBWiki%2FToolAndJobConcept"/>
	<link rel="alternate" type="text/html" href="https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=GenDBWiki/ToolAndJobConcept&amp;action=history"/>
	<updated>2026-06-26T11:37:01Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.39.7</generator>
	<entry>
		<id>https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=GenDBWiki/ToolAndJobConcept&amp;diff=3456&amp;oldid=prev</id>
		<title>Agoesman at 14:45, 31 October 2011</title>
		<link rel="alternate" type="text/html" href="https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=GenDBWiki/ToolAndJobConcept&amp;diff=3456&amp;oldid=prev"/>
		<updated>2011-10-31T14:45:05Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 16:45, 31 October 2011&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l1&quot;&gt;Line 1:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 1:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;= The GenDB Tool and Job Concept =&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;= The GenDB Tool and Job Concept =&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;One major improvement of the &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Gen``DB &lt;/del&gt;system in comparison to the first version, is the modular concept for the integration of bioinformatics tools (e.g. Blast). &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Gen``DB &lt;/del&gt;allows the incorporation of arbitrary programs for different kinds of bioinformatics analysis. According to the system design, each of these programs is integrated as a &amp;#039;&amp;#039;Tool&amp;#039;&amp;#039; (e.g. &amp;#039;&amp;#039;Tool::Function::Blast&amp;#039;&amp;#039;), which creates &amp;#039;&amp;#039;Observations&amp;#039;&amp;#039; for a specific kind of &amp;#039;&amp;#039;Region&amp;#039;&amp;#039;. A &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; that can be submitted to the scheduling system thus contains the information about a valid tool and region combination as illustrated below.  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;One major improvement of the &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;GenDB &lt;/ins&gt;system in comparison to the first version, is the modular concept for the integration of bioinformatics tools (e.g. Blast). &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;GenDB &lt;/ins&gt;allows the incorporation of arbitrary programs for different kinds of bioinformatics analysis. According to the system design, each of these programs is integrated as a &amp;#039;&amp;#039;Tool&amp;#039;&amp;#039; (e.g. &amp;#039;&amp;#039;Tool::Function::Blast&amp;#039;&amp;#039;), which creates &amp;#039;&amp;#039;Observations&amp;#039;&amp;#039; for a specific kind of &amp;#039;&amp;#039;Region&amp;#039;&amp;#039;. A &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; that can be submitted to the scheduling system thus contains the information about a valid tool and region combination as illustrated below.  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[File:ToolConcept.png]]&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;[[File:ToolConcept.png]]&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;For most tools, &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Gen``DB &lt;/del&gt;also features simple automatic annotators that can be activated. They are started upon completion of a tool run and create automatic annotations employing a simple &amp;quot;best hit&amp;quot; strategy based on the observations created by the tool run.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;For most tools, &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;GenDB &lt;/ins&gt;also features simple automatic annotators that can be activated. They are started upon completion of a tool run and create automatic annotations employing a simple &amp;quot;best hit&amp;quot; strategy based on the observations created by the tool run.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;For an automated large scale computation of various bioinformatics tools, a scalable framework was developed and implemented which allows a batch submission of thousands of &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; in a very simple manner. Therefore, the following steps have to be performed:&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;For an automated large scale computation of various bioinformatics tools, a scalable framework was developed and implemented which allows a batch submission of thousands of &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; in a very simple manner. Therefore, the following steps have to be performed:&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l17&quot;&gt;Line 17:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 17:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;4. When such a command line is executed by one of the compute hosts, the script &amp;#039;&amp;#039;runtool.pl&amp;#039;&amp;#039; tries to initialize the &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; object for the given id and project name. Since a &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; contains the information about a specific region and a single tool that should be computed for that region, this script can now execute the &amp;#039;&amp;#039;run&amp;#039;&amp;#039; method that has to be defined for each tool. Such a &amp;#039;&amp;#039;run&amp;#039;&amp;#039; method normally starts a bioinformatics tool (e.g. Blast, Pfam, InterPro) for the given region and stores some observations for the results obtained. During this computation the status of the current &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; is &amp;#039;&amp;#039;RUNNING&amp;#039;&amp;#039;. If the option &amp;#039;&amp;#039;-a&amp;#039;&amp;#039; was specified an automatic annotation will be started upon successful computation of the tool. These are only very simple automatic annotations since they are based on the results of a single tool and region combination. Whenever the computation itself or the automatic annotation fails, the status of a &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; is set to &amp;#039;&amp;#039;FAILED&amp;#039;&amp;#039;, otherwise the status is &amp;#039;&amp;#039;FINISHED&amp;#039;&amp;#039; and the computation is complete.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;4. When such a command line is executed by one of the compute hosts, the script &amp;#039;&amp;#039;runtool.pl&amp;#039;&amp;#039; tries to initialize the &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; object for the given id and project name. Since a &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; contains the information about a specific region and a single tool that should be computed for that region, this script can now execute the &amp;#039;&amp;#039;run&amp;#039;&amp;#039; method that has to be defined for each tool. Such a &amp;#039;&amp;#039;run&amp;#039;&amp;#039; method normally starts a bioinformatics tool (e.g. Blast, Pfam, InterPro) for the given region and stores some observations for the results obtained. During this computation the status of the current &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; is &amp;#039;&amp;#039;RUNNING&amp;#039;&amp;#039;. If the option &amp;#039;&amp;#039;-a&amp;#039;&amp;#039; was specified an automatic annotation will be started upon successful computation of the tool. These are only very simple automatic annotations since they are based on the results of a single tool and region combination. Whenever the computation itself or the automatic annotation fails, the status of a &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; is set to &amp;#039;&amp;#039;FAILED&amp;#039;&amp;#039;, otherwise the status is &amp;#039;&amp;#039;FINISHED&amp;#039;&amp;#039; and the computation is complete.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The inclusion of new tools in &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Gen``DB &lt;/del&gt;is very easy, with the most time-consuming step typically being the implementation of a parser for the result files. For the prediction of regions, such as coding sequences (CDS) or tRNAs, GLIMMER, CRITICA, tRNAscan-SE, and others have been integrated into the system.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The inclusion of new tools in &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;GenDB &lt;/ins&gt;is very easy, with the most time-consuming step typically being the implementation of a parser for the result files. For the prediction of regions, such as coding sequences (CDS) or tRNAs, GLIMMER, CRITICA, tRNAscan-SE, and others have been integrated into the system.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Homology searches on DNA or amino acid level in arbitrary sequence databases can be done using the Blast program suite. In addition to using HMMer for motif searches, we also search the BLOCKS and InterPro databases to classify sequence data based on a combination of different kinds of motif search tools. A number of additional tools have been integrated for the characterization of certain features of coding sequences, such as TMHMM for the prediction of alpha-helical transmembrane regions, SignalP for signal peptide prediction, or CoBias for analyzing trends in codon usage.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Homology searches on DNA or amino acid level in arbitrary sequence databases can be done using the Blast program suite. In addition to using HMMer for motif searches, we also search the BLOCKS and InterPro databases to classify sequence data based on a combination of different kinds of motif search tools. A number of additional tools have been integrated for the characterization of certain features of coding sequences, such as TMHMM for the prediction of alpha-helical transmembrane regions, SignalP for signal peptide prediction, or CoBias for analyzing trends in codon usage.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Agoesman</name></author>
	</entry>
	<entry>
		<id>https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=GenDBWiki/ToolAndJobConcept&amp;diff=3376&amp;oldid=prev</id>
		<title>Tk at 12:26, 28 October 2011</title>
		<link rel="alternate" type="text/html" href="https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=GenDBWiki/ToolAndJobConcept&amp;diff=3376&amp;oldid=prev"/>
		<updated>2011-10-28T12:26:52Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 14:26, 28 October 2011&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l9&quot;&gt;Line 9:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 9:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;For an automated large scale computation of various bioinformatics tools, a scalable framework was developed and implemented which allows a batch submission of thousands of &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; in a very simple manner. Therefore, the following steps have to be performed:&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;For an automated large scale computation of various bioinformatics tools, a scalable framework was developed and implemented which allows a batch submission of thousands of &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; in a very simple manner. Therefore, the following steps have to be performed:&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; &lt;/del&gt;1. The desired &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; have to be created, e.g. for region or function prediction by using the &amp;#039;&amp;#039;JobSubmitter Wizard&amp;#039;&amp;#039;. This can be done quite easily with the &amp;#039;&amp;#039;submit_job.pl&amp;#039;&amp;#039; script or via the graphical user interface. For all valid region and tool combinations as defined by the user, the requested &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; will be created and stored in the Gen``DB project database. Initially, these new &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; will then have the status &amp;#039;&amp;#039;PENDING&amp;#039;&amp;#039;.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;1. The desired &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; have to be created, e.g. for region or function prediction by using the &amp;#039;&amp;#039;JobSubmitter Wizard&amp;#039;&amp;#039;. This can be done quite easily with the &amp;#039;&amp;#039;submit_job.pl&amp;#039;&amp;#039; script or via the graphical user interface. For all valid region and tool combinations as defined by the user, the requested &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; will be created and stored in the Gen``DB project database. Initially, these new &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; will then have the status &amp;#039;&amp;#039;PENDING&amp;#039;&amp;#039;.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; &lt;/del&gt;2. Before the &amp;#039;&amp;#039;submit_job.pl&amp;#039;&amp;#039; script finishes, it calls the &amp;#039;&amp;#039;submit&amp;#039;&amp;#039; method of the &amp;#039;&amp;#039;JobSubmitter Wizard&amp;#039;&amp;#039;. Thus, all previously created &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; will be registered as a &amp;#039;&amp;#039;Job Array&amp;#039;&amp;#039; in the &amp;#039;&amp;#039;Scheduler::Codine&amp;#039;&amp;#039; using the &amp;#039;&amp;#039;Scheduler::Codine-&amp;gt;freeze&amp;#039;&amp;#039; method. Finally, the array of all &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; is submitted by calling &amp;#039;&amp;#039;Scheduler::Codine-&amp;gt;thaw&amp;#039;&amp;#039;. All &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; should now have the status &amp;#039;&amp;#039;SUBMITTED&amp;#039;&amp;#039; and a queue of &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; should appear in the status report of the Sun GridEngine&amp;#039;s &amp;#039;&amp;#039;qstat&amp;#039;&amp;#039; output.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;2. Before the &amp;#039;&amp;#039;submit_job.pl&amp;#039;&amp;#039; script finishes, it calls the &amp;#039;&amp;#039;submit&amp;#039;&amp;#039; method of the &amp;#039;&amp;#039;JobSubmitter Wizard&amp;#039;&amp;#039;. Thus, all previously created &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; will be registered as a &amp;#039;&amp;#039;Job Array&amp;#039;&amp;#039; in the &amp;#039;&amp;#039;Scheduler::Codine&amp;#039;&amp;#039; using the &amp;#039;&amp;#039;Scheduler::Codine-&amp;gt;freeze&amp;#039;&amp;#039; method. Finally, the array of all &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; is submitted by calling &amp;#039;&amp;#039;Scheduler::Codine-&amp;gt;thaw&amp;#039;&amp;#039;. All &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; should now have the status &amp;#039;&amp;#039;SUBMITTED&amp;#039;&amp;#039; and a queue of &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; should appear in the status report of the Sun GridEngine&amp;#039;s &amp;#039;&amp;#039;qstat&amp;#039;&amp;#039; output.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; &lt;/del&gt;3. In the previous step, each &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; was submitted to the scheduler by adding the command line for each single &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; computation to the list of &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039;. Actually, the script &amp;#039;&amp;#039;runtool.pl&amp;#039;&amp;#039; is called for each &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; with the corresponding arguments such as &amp;#039;&amp;#039;runtool.pl -p &amp;lt;projectname&amp;gt; -j &amp;lt;jobid&amp;gt; [-a]&amp;#039;&amp;#039;.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;3. In the previous step, each &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; was submitted to the scheduler by adding the command line for each single &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; computation to the list of &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039;. Actually, the script &amp;#039;&amp;#039;runtool.pl&amp;#039;&amp;#039; is called for each &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; with the corresponding arguments such as &amp;#039;&amp;#039;runtool.pl -p &amp;lt;projectname&amp;gt; -j &amp;lt;jobid&amp;gt; [-a]&amp;#039;&amp;#039;.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt; &lt;/del&gt;4. When such a command line is executed by one of the compute hosts, the script &amp;#039;&amp;#039;runtool.pl&amp;#039;&amp;#039; tries to initialize the &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; object for the given id and project name. Since a &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; contains the information about a specific region and a single tool that should be computed for that region, this script can now execute the &amp;#039;&amp;#039;run&amp;#039;&amp;#039; method that has to be defined for each tool. Such a &amp;#039;&amp;#039;run&amp;#039;&amp;#039; method normally starts a bioinformatics tool (e.g. Blast, Pfam, InterPro) for the given region and stores some observations for the results obtained. During this computation the status of the current &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; is &amp;#039;&amp;#039;RUNNING&amp;#039;&amp;#039;. If the option &amp;#039;&amp;#039;-a&amp;#039;&amp;#039; was specified an automatic annotation will be started upon successful computation of the tool. These are only very simple automatic annotations since they are based on the results of a single tool and region combination. Whenever the computation itself or the automatic annotation fails, the status of a &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; is set to &amp;#039;&amp;#039;FAILED&amp;#039;&amp;#039;, otherwise the status is &amp;#039;&amp;#039;FINISHED&amp;#039;&amp;#039; and the computation is complete.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;4. When such a command line is executed by one of the compute hosts, the script &amp;#039;&amp;#039;runtool.pl&amp;#039;&amp;#039; tries to initialize the &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; object for the given id and project name. Since a &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; contains the information about a specific region and a single tool that should be computed for that region, this script can now execute the &amp;#039;&amp;#039;run&amp;#039;&amp;#039; method that has to be defined for each tool. Such a &amp;#039;&amp;#039;run&amp;#039;&amp;#039; method normally starts a bioinformatics tool (e.g. Blast, Pfam, InterPro) for the given region and stores some observations for the results obtained. During this computation the status of the current &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; is &amp;#039;&amp;#039;RUNNING&amp;#039;&amp;#039;. If the option &amp;#039;&amp;#039;-a&amp;#039;&amp;#039; was specified an automatic annotation will be started upon successful computation of the tool. These are only very simple automatic annotations since they are based on the results of a single tool and region combination. Whenever the computation itself or the automatic annotation fails, the status of a &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; is set to &amp;#039;&amp;#039;FAILED&amp;#039;&amp;#039;, otherwise the status is &amp;#039;&amp;#039;FINISHED&amp;#039;&amp;#039; and the computation is complete.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The inclusion of new tools in Gen``DB is very easy, with the most time-consuming step typically being the implementation of a parser for the result files. For the prediction of regions, such as coding sequences (CDS) or tRNAs, GLIMMER, CRITICA, tRNAscan-SE, and others have been integrated into the system.&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The inclusion of new tools in Gen``DB is very easy, with the most time-consuming step typically being the implementation of a parser for the result files. For the prediction of regions, such as coding sequences (CDS) or tRNAs, GLIMMER, CRITICA, tRNAscan-SE, and others have been integrated into the system.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Tk</name></author>
	</entry>
	<entry>
		<id>https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=GenDBWiki/ToolAndJobConcept&amp;diff=3375&amp;oldid=prev</id>
		<title>Tk: Created page with &quot;= The GenDB Tool and Job Concept =  One major improvement of the Gen``DB system in comparison to the first version, is the modular concept for the integration of bioinformatics t...&quot;</title>
		<link rel="alternate" type="text/html" href="https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=GenDBWiki/ToolAndJobConcept&amp;diff=3375&amp;oldid=prev"/>
		<updated>2011-10-28T12:26:31Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;= The GenDB Tool and Job Concept =  One major improvement of the Gen``DB system in comparison to the first version, is the modular concept for the integration of bioinformatics t...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;= The GenDB Tool and Job Concept =&lt;br /&gt;
&lt;br /&gt;
One major improvement of the Gen``DB system in comparison to the first version, is the modular concept for the integration of bioinformatics tools (e.g. Blast). Gen``DB allows the incorporation of arbitrary programs for different kinds of bioinformatics analysis. According to the system design, each of these programs is integrated as a &amp;#039;&amp;#039;Tool&amp;#039;&amp;#039; (e.g. &amp;#039;&amp;#039;Tool::Function::Blast&amp;#039;&amp;#039;), which creates &amp;#039;&amp;#039;Observations&amp;#039;&amp;#039; for a specific kind of &amp;#039;&amp;#039;Region&amp;#039;&amp;#039;. A &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; that can be submitted to the scheduling system thus contains the information about a valid tool and region combination as illustrated below. &lt;br /&gt;
&lt;br /&gt;
[[File:ToolConcept.png]]&lt;br /&gt;
&lt;br /&gt;
For most tools, Gen``DB also features simple automatic annotators that can be activated. They are started upon completion of a tool run and create automatic annotations employing a simple &amp;quot;best hit&amp;quot; strategy based on the observations created by the tool run.&lt;br /&gt;
&lt;br /&gt;
For an automated large scale computation of various bioinformatics tools, a scalable framework was developed and implemented which allows a batch submission of thousands of &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; in a very simple manner. Therefore, the following steps have to be performed:&lt;br /&gt;
&lt;br /&gt;
 1. The desired &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; have to be created, e.g. for region or function prediction by using the &amp;#039;&amp;#039;JobSubmitter Wizard&amp;#039;&amp;#039;. This can be done quite easily with the &amp;#039;&amp;#039;submit_job.pl&amp;#039;&amp;#039; script or via the graphical user interface. For all valid region and tool combinations as defined by the user, the requested &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; will be created and stored in the Gen``DB project database. Initially, these new &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; will then have the status &amp;#039;&amp;#039;PENDING&amp;#039;&amp;#039;.&lt;br /&gt;
&lt;br /&gt;
 2. Before the &amp;#039;&amp;#039;submit_job.pl&amp;#039;&amp;#039; script finishes, it calls the &amp;#039;&amp;#039;submit&amp;#039;&amp;#039; method of the &amp;#039;&amp;#039;JobSubmitter Wizard&amp;#039;&amp;#039;. Thus, all previously created &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; will be registered as a &amp;#039;&amp;#039;Job Array&amp;#039;&amp;#039; in the &amp;#039;&amp;#039;Scheduler::Codine&amp;#039;&amp;#039; using the &amp;#039;&amp;#039;Scheduler::Codine-&amp;gt;freeze&amp;#039;&amp;#039; method. Finally, the array of all &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; is submitted by calling &amp;#039;&amp;#039;Scheduler::Codine-&amp;gt;thaw&amp;#039;&amp;#039;. All &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; should now have the status &amp;#039;&amp;#039;SUBMITTED&amp;#039;&amp;#039; and a queue of &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039; should appear in the status report of the Sun GridEngine&amp;#039;s &amp;#039;&amp;#039;qstat&amp;#039;&amp;#039; output.&lt;br /&gt;
&lt;br /&gt;
 3. In the previous step, each &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; was submitted to the scheduler by adding the command line for each single &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; computation to the list of &amp;#039;&amp;#039;Jobs&amp;#039;&amp;#039;. Actually, the script &amp;#039;&amp;#039;runtool.pl&amp;#039;&amp;#039; is called for each &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; with the corresponding arguments such as &amp;#039;&amp;#039;runtool.pl -p &amp;lt;projectname&amp;gt; -j &amp;lt;jobid&amp;gt; [-a]&amp;#039;&amp;#039;.&lt;br /&gt;
&lt;br /&gt;
 4. When such a command line is executed by one of the compute hosts, the script &amp;#039;&amp;#039;runtool.pl&amp;#039;&amp;#039; tries to initialize the &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; object for the given id and project name. Since a &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; contains the information about a specific region and a single tool that should be computed for that region, this script can now execute the &amp;#039;&amp;#039;run&amp;#039;&amp;#039; method that has to be defined for each tool. Such a &amp;#039;&amp;#039;run&amp;#039;&amp;#039; method normally starts a bioinformatics tool (e.g. Blast, Pfam, InterPro) for the given region and stores some observations for the results obtained. During this computation the status of the current &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; is &amp;#039;&amp;#039;RUNNING&amp;#039;&amp;#039;. If the option &amp;#039;&amp;#039;-a&amp;#039;&amp;#039; was specified an automatic annotation will be started upon successful computation of the tool. These are only very simple automatic annotations since they are based on the results of a single tool and region combination. Whenever the computation itself or the automatic annotation fails, the status of a &amp;#039;&amp;#039;Job&amp;#039;&amp;#039; is set to &amp;#039;&amp;#039;FAILED&amp;#039;&amp;#039;, otherwise the status is &amp;#039;&amp;#039;FINISHED&amp;#039;&amp;#039; and the computation is complete.&lt;br /&gt;
&lt;br /&gt;
The inclusion of new tools in Gen``DB is very easy, with the most time-consuming step typically being the implementation of a parser for the result files. For the prediction of regions, such as coding sequences (CDS) or tRNAs, GLIMMER, CRITICA, tRNAscan-SE, and others have been integrated into the system.&lt;br /&gt;
&lt;br /&gt;
Homology searches on DNA or amino acid level in arbitrary sequence databases can be done using the Blast program suite. In addition to using HMMer for motif searches, we also search the BLOCKS and InterPro databases to classify sequence data based on a combination of different kinds of motif search tools. A number of additional tools have been integrated for the characterization of certain features of coding sequences, such as TMHMM for the prediction of alpha-helical transmembrane regions, SignalP for signal peptide prediction, or CoBias for analyzing trends in codon usage.&lt;br /&gt;
&lt;br /&gt;
Since all tools have to be defined separately for each project, a tool configuration wizard was implemented to support this task.&lt;br /&gt;
&lt;br /&gt;
Whereas some tools only return a numeric score and/or an E-value as a result, other tools like Blast or HMMer additionally provide more detailed information, such as an alignment. Although the complete tool results are available to the annotator, only a minimum data subset is stored in form of observations. Based on this subset, the complete tool result record can be recomputed on demand. Storing only a minimal subset of data reduces the storage demands by two orders of magnitude when compared to the traditional &amp;quot;store everything&amp;quot; approach. Our performance measurements have shown this also to be more time efficient than data retrieval from a disk subsystem for any realistic genome project.&lt;/div&gt;</summary>
		<author><name>Tk</name></author>
	</entry>
</feed>