EMMAWiki/AdministratorDocumentation/ArraylayoutGuide: Difference between revisions
imported>MichaelDondrup No edit summary |
imported>MichaelDondrup No edit summary |
||
Line 4: | Line 4: | ||
== Synopsis == | == Synopsis == | ||
An Array layout describes the physical properties of a micorarray. See these pdf | An Array layout describes the physical properties of a micorarray. See these [[Media:EMMAWiki$$AdministratorDocumentation$$ArraylayoutGuide$arraylayoutOverview.pdf]] slides for a brief overview on the different terms used in EMMA. | ||
'''An arraylayout is essential for your microarray experiment.''' All further processing and analysis is based on the layout. A wrong layout leads to a wrong calculation! Without a valid layout you will be unable to import any data. | '''An arraylayout is essential for your microarray experiment.''' All further processing and analysis is based on the layout. A wrong layout leads to a wrong calculation! Without a valid layout you will be unable to import any data. | ||
Revision as of 12:04, 18 December 2006
Arraylayout Guidelines
Synopsis
An Array layout describes the physical properties of a micorarray. See these Media:EMMAWiki$$AdministratorDocumentation$$ArraylayoutGuide$arraylayoutOverview.pdf slides for a brief overview on the different terms used in EMMA. An arraylayout is essential for your microarray experiment. All further processing and analysis is based on the layout. A wrong layout leads to a wrong calculation! Without a valid layout you will be unable to import any data.
Making an Array layout available to EMMA is a two-step process:
- Make an Array Description Format File (ADF)
- Upload the ADF to EMMA providing few addtional data.
Both steps can be performed by a Maintainer or Chief (that is possibly you :) ) and are described below.
Making an ADF
In order to provide MIAME compliant annotations, mandatory fields in the ADF format for EMMA are the same fields as in ArrayExpress.
Please have a look at the ADF Guidelines at the EBI for detailed descriptions and ADF Checklist at the EBI for a quick overview and checklist before you upload your ADF-file.
If you have a layout in the GAL-format, this can easily be converted to the ADF-format using the spotter file to ADF converter at the EBI. Do not expect to get a valid ADF file from this conversion. You will need to add additional mandatory columns.
Use the ADF checker provided by the EBI to interatively improve your file.
You need to have the role Chief or Maintainer within your project to be able to upload ADF-files to EMMA. So check with the Maintainer of your project, to see how to upload your ADF.
Hints
- The ADF file needs to be in text format. In Excel, use CSV export.
- Use only tab-delimited files
- Quotes
"" or ''
around fields will not work with the ADF-checker - Reporter identifiers must only consist of : A-Z,a-z,0-9,_,-,+,: In particular reporter identifiers may not contain:
() tab [] ;
- Do not try to conceil information on your array design (e.g. reporter sequences, annotation). You will need them later anyway. Let me tell you that the information you try to hide from the world is relevant only for yourself and your workgroup. It seems faulty to assume somebody in your project is going to steal your reporter sequence.
- Double check that Reporter Identifiers are correct. If you have the same reporter spotted in replicates, it needs to have the identical Reporter Identifier at that position. Reporter names or other details are irrelevant for this.
- Use
Reporter [[BioSequence]] Database Entry
columns whenever possible. The entries will be shown as hyperlinks in the interface, if you provide a propper accession. You can provide multiple DB entry columns. Make sure the accession keys work when doing a manual query on the Database with your web-browser. Leave the field empty only if there is no db accession for this sequence. - If you have a list of BRIDGE links, that provide a external reference to a Sequence in GenDB or SAMS (aka.:
o2xr://
) you may useReporter [[BioSequence]] Database Entry [GenDB]
orReporter [[BioSequence]] Database Entry [SAMS]
to denote them. Instead of creating a regular Database entry, a BRIDGE reference for the Sequences is made. Include the full BRIDGE URI not only the id-number. - Empty locations may not be ommited in the layout. Use the Reporter Identifier
Empty
to mark those locations. Set,Reporter Group [role]
toControl
and ControlType tocontrol_empty
. - Use the Reporter Comment field to add additional annotation to your Reporter. This will appear as a Reporter description in EMMA.
- You can leave uncommon fields in you file as a reference and they will be appended to each Reporter description in the form
[[FieldName]]=Value
. - Remember, an ADF file has to be uploaded only once for each ArrayLayout (or array type) not for each individual array.