<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?action=history&amp;feed=atom&amp;title=SAMSWiki%2FWebDocumentation%2FIntroDuction</id>
	<title>SAMSWiki/WebDocumentation/IntroDuction - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?action=history&amp;feed=atom&amp;title=SAMSWiki%2FWebDocumentation%2FIntroDuction"/>
	<link rel="alternate" type="text/html" href="https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=SAMSWiki/WebDocumentation/IntroDuction&amp;action=history"/>
	<updated>2026-04-30T07:20:44Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.39.7</generator>
	<entry>
		<id>https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=SAMSWiki/WebDocumentation/IntroDuction&amp;diff=268&amp;oldid=prev</id>
		<title>Admin: 3 revisions</title>
		<link rel="alternate" type="text/html" href="https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=SAMSWiki/WebDocumentation/IntroDuction&amp;diff=268&amp;oldid=prev"/>
		<updated>2011-10-26T06:13:13Z</updated>

		<summary type="html">&lt;p&gt;3 revisions&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;1&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 08:13, 26 October 2011&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-notice&quot; lang=&quot;en&quot;&gt;&lt;div class=&quot;mw-diff-empty&quot;&gt;(No difference)&lt;/div&gt;
&lt;/td&gt;&lt;/tr&gt;&lt;/table&gt;</summary>
		<author><name>Admin</name></author>
	</entry>
	<entry>
		<id>https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=SAMSWiki/WebDocumentation/IntroDuction&amp;diff=267&amp;oldid=prev</id>
		<title>imported&gt;VirginieMittard at 07:09, 16 October 2006</title>
		<link rel="alternate" type="text/html" href="https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=SAMSWiki/WebDocumentation/IntroDuction&amp;diff=267&amp;oldid=prev"/>
		<updated>2006-10-16T07:09:17Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 09:09, 16 October 2006&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l4&quot;&gt;Line 4:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 4:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Every genome project generates thousands of ESTs or shotgun reads. Roughly 70% of all sequences in GenBank/EMBL/DDBJ are ESTs (Expressed Sequence Tags), which are generated by reverse transcribing mRNAs into complementary cDNAs, and then performing single-pass sequencing on those cDNAs. ESTs thus represent segments of DNA (a few hundred of nucleotides) that code for an mRNA, and are a fast, inexpensive way to determine which genes are being actively transcribed in a tissue at a given stage of development. Other common experimental methods for sequence generation include Sequence-Tagged Sites (STS) used to derive physical maps in genome construction, and Genome Survey Sequence (GSS), short random sequences used commonly to quickly sample the type of DNA sequences t hat could be found in a genome. So Users have high interest in a first look at the DNA sequence content of the individual reads, before they are assembled (in case of shotgun reads) or clustered (in case of ESTs). Several steps are necessary to provide the researcher high quality sequences, as well as an overview of their content. For all these purposes we have implemented some additional extensions to GenDB within the SAMS system. SAMS is a simple, easy to install and maintain open source system that provides the mechanisms to run a variety of tools on each read/EST, presenting the results in a web form. The EST pipeline is consisting of four parts (figure 1.1):&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Every genome project generates thousands of ESTs or shotgun reads. Roughly 70% of all sequences in GenBank/EMBL/DDBJ are ESTs (Expressed Sequence Tags), which are generated by reverse transcribing mRNAs into complementary cDNAs, and then performing single-pass sequencing on those cDNAs. ESTs thus represent segments of DNA (a few hundred of nucleotides) that code for an mRNA, and are a fast, inexpensive way to determine which genes are being actively transcribed in a tissue at a given stage of development. Other common experimental methods for sequence generation include Sequence-Tagged Sites (STS) used to derive physical maps in genome construction, and Genome Survey Sequence (GSS), short random sequences used commonly to quickly sample the type of DNA sequences t hat could be found in a genome. So Users have high interest in a first look at the DNA sequence content of the individual reads, before they are assembled (in case of shotgun reads) or clustered (in case of ESTs). Several steps are necessary to provide the researcher high quality sequences, as well as an overview of their content. For all these purposes we have implemented some additional extensions to GenDB within the SAMS system. SAMS is a simple, easy to install and maintain open source system that provides the mechanisms to run a variety of tools on each read/EST, presenting the results in a web form. The EST pipeline is consisting of four parts (figure 1.1):&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Pre-processing&lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;/PreProcessing&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Clustering&lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;/ClusTering&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Assembly&lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;/AssemBly&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;Analysis&lt;/del&gt;&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* &lt;ins style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;/AnaLysis&lt;/ins&gt;&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The preprocessing begins with importing the sequences, then running user-defined quality clipping, vector clipping against a standard database or custom sequences, and repeat masking. In the second part the pipeline continues with clustering the ESTs with user-specific parameters and afterwards the assembly is carried out, generating Tentative Consensus Sequences (TCs). Finally, the user can run the automatic annotation of the ESTs and TCs, which will launch BLAST searches against several database, such as KEGG and SwissProt, as well as InterPro and Pfam. Once the annotation is done, the user can manually go through it, checking for correctness.  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The preprocessing begins with importing the sequences, then running user-defined quality clipping, vector clipping against a standard database or custom sequences, and repeat masking. In the second part the pipeline continues with clustering the ESTs with user-specific parameters and afterwards the assembly is carried out, generating Tentative Consensus Sequences (TCs). Finally, the user can run the automatic annotation of the ESTs and TCs, which will launch BLAST searches against several database, such as KEGG and SwissProt, as well as InterPro and Pfam. Once the annotation is done, the user can manually go through it, checking for correctness.  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;attachment: file.png (Figure 1.1: EST pipeline)&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;attachment: file.png (Figure 1.1: EST pipeline)&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>imported&gt;VirginieMittard</name></author>
	</entry>
	<entry>
		<id>https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=SAMSWiki/WebDocumentation/IntroDuction&amp;diff=266&amp;oldid=prev</id>
		<title>imported&gt;VirginieMittard at 07:05, 16 October 2006</title>
		<link rel="alternate" type="text/html" href="https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=SAMSWiki/WebDocumentation/IntroDuction&amp;diff=266&amp;oldid=prev"/>
		<updated>2006-10-16T07:05:31Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 09:05, 16 October 2006&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l2&quot;&gt;Line 2:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 2:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Introduction of &amp;quot;Sequence Analysis and Management System&amp;quot; (SAMS) and general concepts&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Introduction of &amp;quot;Sequence Analysis and Management System&amp;quot; (SAMS) and general concepts&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Every genome project generates thousands of ESTs or shotgun reads. Roughly 70% of all sequences in &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;[[&lt;/del&gt;GenBank&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;]]&lt;/del&gt;/EMBL/DDBJ are ESTs (Expressed Sequence Tags), which are generated by reverse transcribing mRNAs into complementary cDNAs, and then performing single-pass sequencing on those cDNAs. ESTs thus represent segments of DNA (a few hundred of nucleotides) that code for an mRNA, and are a fast, inexpensive way to determine which genes are being actively transcribed in a tissue at a given stage of development. Other common experimental methods for sequence generation include Sequence-Tagged Sites (STS) used to derive physical maps in genome construction, and Genome Survey Sequence (GSS), short random sequences used commonly to quickly sample the type of DNA sequences t hat could be found in a genome. So Users have high interest in a first look at the DNA sequence content of the individual reads, before they are assembled (in case of shotgun reads) or clustered (in case of ESTs). Several steps are necessary to provide the researcher high quality sequences, as well as an overview of their content. For all these purposes we have implemented some additional extensions to GenDB within the SAMS system. SAMS is a simple, easy to install and maintain open source system that provides the mechanisms to run a variety of tools on each read/EST, presenting the results in a web form. The EST pipeline is consisting of four parts (figure 1.1):&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;Every genome project generates thousands of ESTs or shotgun reads. Roughly 70% of all sequences in GenBank/EMBL/DDBJ are ESTs (Expressed Sequence Tags), which are generated by reverse transcribing mRNAs into complementary cDNAs, and then performing single-pass sequencing on those cDNAs. ESTs thus represent segments of DNA (a few hundred of nucleotides) that code for an mRNA, and are a fast, inexpensive way to determine which genes are being actively transcribed in a tissue at a given stage of development. Other common experimental methods for sequence generation include Sequence-Tagged Sites (STS) used to derive physical maps in genome construction, and Genome Survey Sequence (GSS), short random sequences used commonly to quickly sample the type of DNA sequences t hat could be found in a genome. So Users have high interest in a first look at the DNA sequence content of the individual reads, before they are assembled (in case of shotgun reads) or clustered (in case of ESTs). Several steps are necessary to provide the researcher high quality sequences, as well as an overview of their content. For all these purposes we have implemented some additional extensions to GenDB within the SAMS system. SAMS is a simple, easy to install and maintain open source system that provides the mechanisms to run a variety of tools on each read/EST, presenting the results in a web form. The EST pipeline is consisting of four parts (figure 1.1):&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Pre-processing&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Pre-processing&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l9&quot;&gt;Line 9:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 9:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Analysis&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Analysis&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;−&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The preprocessing begins with importing the sequences, then running user-defined quality clipping, vector clipping against a standard database or custom sequences, and repeat masking. In the second part the pipeline continues with clustering the ESTs with user-specific parameters and afterwards the assembly is carried out, generating Tentative Consensus Sequences (TCs). Finally, the user can run the automatic annotation of the ESTs and TCs, which will launch BLAST searches against several database, such as KEGG and &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;[[&lt;/del&gt;SwissProt&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;]]&lt;/del&gt;, as well as &lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;[[&lt;/del&gt;InterPro&lt;del style=&quot;font-weight: bold; text-decoration: none;&quot;&gt;]] &lt;/del&gt;and Pfam. Once the annotation is done, the user can manually go through it, checking for correctness.  &lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;&lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;The preprocessing begins with importing the sequences, then running user-defined quality clipping, vector clipping against a standard database or custom sequences, and repeat masking. In the second part the pipeline continues with clustering the ESTs with user-specific parameters and afterwards the assembly is carried out, generating Tentative Consensus Sequences (TCs). Finally, the user can run the automatic annotation of the ESTs and TCs, which will launch BLAST searches against several database, such as KEGG and SwissProt, as well as InterPro and Pfam. Once the annotation is done, the user can manually go through it, checking for correctness.  &lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br/&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;attachment: file.png (Figure 1.1: EST pipeline)&lt;/div&gt;&lt;/td&gt;&lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;&lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;attachment: file.png (Figure 1.1: EST pipeline)&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>imported&gt;VirginieMittard</name></author>
	</entry>
	<entry>
		<id>https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=SAMSWiki/WebDocumentation/IntroDuction&amp;diff=265&amp;oldid=prev</id>
		<title>imported&gt;VirginieMittard at 10:22, 11 October 2006</title>
		<link rel="alternate" type="text/html" href="https://wiki.cebitec.uni-bielefeld.de/brf-software/index.php?title=SAMSWiki/WebDocumentation/IntroDuction&amp;diff=265&amp;oldid=prev"/>
		<updated>2006-10-11T10:22:10Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;__NOTOC__&lt;br /&gt;
Introduction of &amp;quot;Sequence Analysis and Management System&amp;quot; (SAMS) and general concepts&lt;br /&gt;
&lt;br /&gt;
Every genome project generates thousands of ESTs or shotgun reads. Roughly 70% of all sequences in [[GenBank]]/EMBL/DDBJ are ESTs (Expressed Sequence Tags), which are generated by reverse transcribing mRNAs into complementary cDNAs, and then performing single-pass sequencing on those cDNAs. ESTs thus represent segments of DNA (a few hundred of nucleotides) that code for an mRNA, and are a fast, inexpensive way to determine which genes are being actively transcribed in a tissue at a given stage of development. Other common experimental methods for sequence generation include Sequence-Tagged Sites (STS) used to derive physical maps in genome construction, and Genome Survey Sequence (GSS), short random sequences used commonly to quickly sample the type of DNA sequences t hat could be found in a genome. So Users have high interest in a first look at the DNA sequence content of the individual reads, before they are assembled (in case of shotgun reads) or clustered (in case of ESTs). Several steps are necessary to provide the researcher high quality sequences, as well as an overview of their content. For all these purposes we have implemented some additional extensions to GenDB within the SAMS system. SAMS is a simple, easy to install and maintain open source system that provides the mechanisms to run a variety of tools on each read/EST, presenting the results in a web form. The EST pipeline is consisting of four parts (figure 1.1):&lt;br /&gt;
&lt;br /&gt;
* Pre-processing&lt;br /&gt;
* Clustering&lt;br /&gt;
* Assembly&lt;br /&gt;
* Analysis&lt;br /&gt;
&lt;br /&gt;
The preprocessing begins with importing the sequences, then running user-defined quality clipping, vector clipping against a standard database or custom sequences, and repeat masking. In the second part the pipeline continues with clustering the ESTs with user-specific parameters and afterwards the assembly is carried out, generating Tentative Consensus Sequences (TCs). Finally, the user can run the automatic annotation of the ESTs and TCs, which will launch BLAST searches against several database, such as KEGG and [[SwissProt]], as well as [[InterPro]] and Pfam. Once the annotation is done, the user can manually go through it, checking for correctness. &lt;br /&gt;
&lt;br /&gt;
attachment: file.png (Figure 1.1: EST pipeline)&lt;/div&gt;</summary>
		<author><name>imported&gt;VirginieMittard</name></author>
	</entry>
</feed>