EMMAWiki/WebDocumentation/HowTos/DatasetBrowserHowTo: Difference between revisions

From BRF-Software
Jump to navigation Jump to search
imported>SebastianJuenemann
No edit summary
 
(14 intermediate revisions by 5 users not shown)
Line 1: Line 1:
__NOTOC__
__NOTOC__
= Browse [[DataSets]] =
= Browse DataSets =


This Section describes the use of the Dataset Browser.
This Section describes the use of the Dataset Browser.
A description of shered terms can be found in [[EMMAWiki/WebDocumentation/HelpSystem|HelpSystem]] and shortcuts are explained in [[EMMAWiki/WebDocumentation/Ontology|Ontology]].
A description of shared terms can be found in [[EMMAWiki/WebDocumentation/HelpSystem|HelpSystem]].


This Howto is subdevided in 2 subsections (chronological by menu prompt):
This Howto is divided in 2 subsections (chronological by menu prompt):


* [[#select_dataset|Select Dataset(s)]]
* [[#select_dataset|Select Dataset(s)]]
Line 13: Line 13:
== Select Dataset(s) ==
== Select Dataset(s) ==


The following imgage shows the dialog whith three raw datasets of the current exemplary experiment. Each dataset has an unique Identifier, a self explainig Name and if available a short Description. The Cube-Order, #BA-Dim,. #DE-Dim and #QT-Dim gives you information about the sice and range of the dataset:
The following image shows the dialog with three examples of a raw datasets of a current experiment. Each dataset has an unique Identifier, a self explaining Name and if available a short description. The Cube-Order, #BA-Dim,. #DE-Dim and #QT-Dim gives you information about the size and range of the dataset:
* Cube-Order: lay down the following order but is at current state always '''BDQ''',
* Cube-Order: lay down the following order but is at current state always '''BDQ''',
* #'''B'''A-Dim: show how many arrays where use of this dataset and is also at current state always one,
* #'''B'''A-Dim: show how many arrays were used from this dataset and is also at current state always one,
* #'''D'''E-Dim: shows how many Designelements where used and stands for the number of rows,
* #'''D'''E-Dim: stands for the number of rows. Shows how many Designelements were used.
* #'''Q'''T-Dim: shows how many Quantitationtypes where used and stands for the number of columns.
* #'''Q'''T-Dim: stands for the number of columns. Shows how many Quantitationtypes were used.


The Quality is an index for the user (ranges from very good to insufficient).
The Quality is an index for the user. It ranges from very good to insufficient.


[[Image:EMMAWiki$$WebDocumentation$$HowTos$$DatasetBrowserHowTo$brows2_dataset_dialog.png]]  
[[File:brows2_dataset_dialog.png]]  


  === Show ===
=== Show ===
   Clicking the [[Image:EMMAWiki$$WebDocumentation$$HowTos$$DatasetBrowserHowTo$browse_dataset_dialog_show.png]] button shows the selected dataset(s) in one table as described in [[#browse_dataset|View Dataset(s)]].
   Clicking the ''Show All'' button shows the selected dataset(s) in one table as described in [[#browse_dataset|View Dataset(s)]].


  === Coincid [[QuantitationType]]-Selection ===
=== Coincid [[QuantitationType]]-Selection ===
   The [[Image:EMMAWiki$$WebDocumentation$$HowTos$$DatasetBrowserHowTo$browse_dataset_dialog_coincid.png]] button let you take a preselection of Quantitationtypes which leads in the dialog shown below. Thus you decide for all
   The ''Common Label/Column Selection'' button let you take a preselection of Quantitationtypes which leads to the dialog box (shown below). Thus you can decide which Quantitationtype should be shown for all selected datasets at once. You can get more information about each Quantitationtype by following the link.
  selected dataset at once which Quantitationtype should be shown. You can get more information about each Quantitationtype by following the link.


   [[Image:EMMAWiki$$WebDocumentation$$HowTos$$DatasetBrowserHowTo$browse_dataset_dialog_preselection.png]]
   [[File:browse_dataset_dialog_preselection.png]]


  === Individual [[QuantitationType]]-Selection ===
Remarks:
   A more sensitive preselection occour if you click the [[Image:EMMAWiki$$WebDocumentation$$HowTos$$DatasetBrowserHowTo$browse_dataset_dialog_individual.png]] button. Thus you decide not for all at once but for every selected
* [[QuantitationType]]-Selection currently works only for data display. The exported datasets always contain the full tables.
   dataset individual which Quantitationtype you want. The dialog is almost se same as shown above but as many as selected datasets.
 
=== Individual [[QuantitationType]]-Selection ===
   A more sensitive preselection can be made if you click the ''Individual Label/Column Selection'' button. Here you can select which Quantitationtype you want for each
   dataset. The dialog box is almost the same as the one shown above and includes all selected datasets.
 
Remarks:
* [[QuantitationType]]-Selection currently works only for data display. The exported datasets always contain the full tables.


<span id="browse_dataset"></span>
<span id="browse_dataset"></span>
== View Dataset(s) ==
== View Dataset(s) ==
The datasets are shown as one table which can be easily browsed with the navigation bar. Each [[DesignElement]] and Quantitationtype is linked for getting detailed information.
The datasets are shown as one table which can be browsed easily with the navigation bar. Each [[DesignElement]] and Quantitationtype is linked for further detailed information.
 
[[File:browse_dataset_de_range.png]]
[[File:browse_dataset_table2.png]]
 
=== Data Display ===
 
The actual data is displayed in the table cells. The number format is normally chosen automatically depending on the actual values.
Default is to use floating point representation with  eg <code><nowiki> 12.8032050412329 </nowiki></code> with a maximum of 15 digits. For very large numbers or numbers close to zero, exponential representation will be used, like <code><nowiki> 1.2E10 </nowiki></code> = 1.2 * 10<sup>10</sup> or <code><nowiki> 1.2E-10 </nowiki></code> = 1.2 * 10<sup>-10</sup>.


[[Image:EMMAWiki$$WebDocumentation$$HowTos$$DatasetBrowserHowTo$browse_dataset_de_range.png]]
Remarks:
[[Image:EMMAWiki$$WebDocumentation$$HowTos$$DatasetBrowserHowTo$browse_dataset_table2.png]]
* '''.''' (dot) is used as decimal character, not '','' (comma) for display and export
* The internal precision of the data might be higher than the display precision.
* The number of decimal digits depends on the value, the maximal number of total digits is 15.
* '''NaN''' means Not a Number. It represents a missing or infinite value. Often, values are set to NaN by filter procedures.
* NaN are also found in one-sample statistical tests, for values for the second group. This means, that their calculation is not applicable.


  === Range Designelements ===
=== Dimension Range ===
   The first section of this window is for specifying how many [[DesignElements]] are shown each page. Enter a start and end position in the Dimension-Range label to jump to this
   The first section of this window is for specifying how many rows ([[DesignElement]]) are shown in each page. Entering a start and end position in the Dimension-Range field enables you to go to the given position. With this option you can also set the number of Designelements per page. The field named ''Search'' offers you a simple possibility to restrict the Designelements by the Identifier.  
  position and also to commit the number of Designelements per page. The Search label offers you a simple possibility to restrict the Designelements by the Name.  


  === Sort by Quantitationtypes ===
=== Sort by Quantitationtypes ===
   The blue arrows in the first row of the table let you sort the dataset ascendending or descendending. Also you can choose to sort only the currently shown page or the whole
   The blue arrows in the first row of the table let you sort the dataset in ascendending or descendending order. Also you can choose between sorting the current shown page or the whole
   dataset(s) by switching the radio buttons above the table between '''Sort only this page''' or '''Sort whole dataset'''. If you decide to sort the whole dataset the  
   dataset(s) by switching the radio button above the table between '''Sort only this page''' or '''Sort whole dataset'''. If you pick the option ''whole dataset'' the  
   assortion remains while browsing the dataset until you deactivate the option '''Keep sort order while browsing the Dataset'''. Uncheck this option to reobtain the raw view
   assortion remains while browsing the dataset until you click '''Drop Assortment'''.
  of your table.


  === Filter Data ===
=== Filter Data ===
   The button '''filter data''' opens a new window where you are able to filter the dataset. Start by single out the wsished Quantitationtype in the first drop-down menu then
   The button '''filter data''' opens a new window where you are able to filter the dataset. Start to single out the whished Quantitationtype in the first drop-down menu then
   select an operator and leave the value for the comparison in the right text field. If you want to filter for more than one Quantitationtype or one Quantitationtype with  
   select an operator and leave the value for the comparison in the right text field. If you want to filter for more than one Quantitationtype or one Quantitationtype with  
   different  operation you have to chose '''AND''' at the first drop-down menu of ervery next line you want to include. The More button leads in as many new lines you want.
   different  operation you have to chose '''AND''' at the first drop-down menu of each next line you want to include. The More button leads in as many new lines you want.
   As by the sorting the filter keeps by as long the '''Keep sort order while browsing the Dataset''' option is checked.
   As by the sorting the filter keeps by as long until '''Drop Assortment'''. You can reactivate the sorting/filtering by pressing the '''Keep Assortment''' button. This way
  you switch between the raw dataset and your last performed sorting/filtering.
 
  [[File:browse_dataset_filter.png]]
 
  The following operators available:


   [[Image:EMMAWiki$$WebDocumentation$$HowTos$$DatasetBrowserHowTo$browse_dataset_filter.png]]
* > : greater
* < : less than
* >=: greater or equal
* <= : less or equal
* != : not equal
* approx ''value epsilon'' : get all values approximately equal to ''value'' within an range of +/-''epsilon'' (default 0.001). e.g.:
   ''approx 10 0.1'' will return all rows where the selected column value is between 9.9 and 10.1
* not missing : filter rows where selected column has a good value (not NaN)
* missing : the opposite, return all rows with missing values


  === Export ===
=== Export ===
   It is possible to download the dataset as a tab seperated text file. You can either export the foltered/sorted dataset ore the whole dataset.
   It is possible to download the dataset as a tab seperated text file. You can either export the foltered/sorted dataset ore the whole dataset.

Latest revision as of 13:51, 26 October 2011

Browse DataSets

This Section describes the use of the Dataset Browser. A description of shared terms can be found in HelpSystem.

This Howto is divided in 2 subsections (chronological by menu prompt):

Select Dataset(s)

The following image shows the dialog with three examples of a raw datasets of a current experiment. Each dataset has an unique Identifier, a self explaining Name and if available a short description. The Cube-Order, #BA-Dim,. #DE-Dim and #QT-Dim gives you information about the size and range of the dataset:

  • Cube-Order: lay down the following order but is at current state always BDQ,
  • #BA-Dim: show how many arrays were used from this dataset and is also at current state always one,
  • #DE-Dim: stands for the number of rows. Shows how many Designelements were used.
  • #QT-Dim: stands for the number of columns. Shows how many Quantitationtypes were used.

The Quality is an index for the user. It ranges from very good to insufficient.

Brows2 dataset dialog.png

Show

 Clicking the Show All button shows the selected dataset(s) in one table as described in View Dataset(s).

Coincid QuantitationType-Selection

 The Common Label/Column Selection button let you take a preselection of Quantitationtypes which leads to the dialog box (shown below). Thus you can decide which Quantitationtype should be shown for all selected datasets at once. You can get more information about each Quantitationtype by following the link.
 Browse dataset dialog preselection.png

Remarks:

  • QuantitationType-Selection currently works only for data display. The exported datasets always contain the full tables.

Individual QuantitationType-Selection

 A more sensitive preselection can be made if you click the Individual Label/Column Selection button. Here you can select which Quantitationtype you want for each
 dataset. The dialog box is almost the same as the one shown above and includes all selected datasets.

Remarks:

  • QuantitationType-Selection currently works only for data display. The exported datasets always contain the full tables.

View Dataset(s)

The datasets are shown as one table which can be browsed easily with the navigation bar. Each DesignElement and Quantitationtype is linked for further detailed information.

Browse dataset de range.png Browse dataset table2.png

Data Display

The actual data is displayed in the table cells. The number format is normally chosen automatically depending on the actual values. Default is to use floating point representation with eg 12.8032050412329 with a maximum of 15 digits. For very large numbers or numbers close to zero, exponential representation will be used, like 1.2E10 = 1.2 * 1010 or 1.2E-10 = 1.2 * 10-10.

Remarks:

  • . (dot) is used as decimal character, not , (comma) for display and export
  • The internal precision of the data might be higher than the display precision.
  • The number of decimal digits depends on the value, the maximal number of total digits is 15.
  • NaN means Not a Number. It represents a missing or infinite value. Often, values are set to NaN by filter procedures.
  • NaN are also found in one-sample statistical tests, for values for the second group. This means, that their calculation is not applicable.

Dimension Range

 The first section of this window is for specifying how many rows (DesignElement) are shown in each page. Entering a start and end position in the Dimension-Range field enables you to go to the given position. With this option you can also set the number of Designelements per page. The  field named Search offers you a simple possibility to restrict the Designelements by the Identifier. 

Sort by Quantitationtypes

 The blue arrows in the first row of the table let you sort the dataset in ascendending or descendending order. Also you can choose between sorting the current shown page or the whole
 dataset(s) by switching the radio button above the table between Sort only this page or Sort whole dataset. If you pick the option whole dataset the 
 assortion remains while browsing the dataset until you click Drop Assortment.

Filter Data

 The button filter data opens a new window where you are able to filter the dataset. Start to single out the whished Quantitationtype in the first drop-down menu then
 select an operator and leave the value for the comparison in the right text field. If you want to filter for more than one Quantitationtype or one Quantitationtype with 
 different  operation you have to chose AND at the first drop-down menu of each next line you want to include. The More button leads in as many new lines you want.
 As by the sorting the filter keeps by as long until Drop Assortment. You can reactivate the sorting/filtering by pressing the Keep Assortment button. This way 
 you switch between the raw dataset and your last performed sorting/filtering.
 Browse dataset filter.png
 The following operators available: 
  • > : greater
  • < : less than
  • >=: greater or equal
  • <= : less or equal
  • != : not equal
  • approx value epsilon : get all values approximately equal to value within an range of +/-epsilon (default 0.001). e.g.:
 approx 10 0.1 will return all rows where the selected column value is between 9.9 and 10.1
  • not missing : filter rows where selected column has a good value (not NaN)
  • missing : the opposite, return all rows with missing values

Export

 It is possible to download the dataset as a tab seperated text file. You can either export the foltered/sorted dataset ore the whole dataset.