SAMSWiki/WebDocumentation/IntroDuction/AnaLysis: Difference between revisions
Jump to navigation
Jump to search
imported>VirginieMittard No edit summary |
m (1 revision) |
(No difference)
|
Latest revision as of 07:15, 26 October 2011
Analysis using the SAMS pipeline:
- 1) Pre-processing
Four steps are used for the pre-processing in SAMS:
- Quality clipping using PHRED (Q= 13)
- Repeat masking using RepeatMasker where the regions are masked by Xs.
- Vector sequence masking using vector clipper.
- Sequences shorter than 50 bp are removed.
- 2) Clustering
SAMS is using the BioMake tool developped in the group by Thomas Bekel.
- Performs an all against all BLAST
- Builds a graph according to found overlaps
The minimum overlap's length is 40 bp and 95% of required similarity and a maximum number of mismatches of 20 bp.
- 3) Assembly
SAMS is using the CAP3 program. Fore more information, please have a look at the CAP3 paragraph.