GenDBWiki/DataModel: Difference between revisions

From BRF-Software
Jump to navigation Jump to search
(Created page with "= The GenDB Data Model = Gen``DB is based on a data model with three core types of objects. ''Regions'' describe arbitrary (sub-) sequences. A region can be related to a parent ...")
 
(added images)
Line 3: Line 3:
Gen``DB is based on a data model with three core types of objects. ''Regions'' describe arbitrary (sub-) sequences. A region can be related to a parent region, e.g. a CDS is part of a contig. ''Observations'' correspond to information computed by various tools (e.g. Blast or Inter``Pro) for those regions. ''Annotations'' store the interpretation of a (human) annotator. They describe regions based on the evidence stored in the observations.  
Gen``DB is based on a data model with three core types of objects. ''Regions'' describe arbitrary (sub-) sequences. A region can be related to a parent region, e.g. a CDS is part of a contig. ''Observations'' correspond to information computed by various tools (e.g. Blast or Inter``Pro) for those regions. ''Annotations'' store the interpretation of a (human) annotator. They describe regions based on the evidence stored in the observations.  


attachment:GenDB-UML-DataModel.jpg
[[File:GenDB-UML-DataModel.jpg]]


The figure shown above illustrates the relationships between the different core objects. As can be seen, there is a clear distinction between the results from various bioinformatics tools (observations) and their interpretation (annotations) which was implemented in the data model. While this data model seems very generic, it represents a hierarchy of classes, including the complete EM``BL feature set for prokaryotes with several extensions as illustrated in the figure below.
The figure shown above illustrates the relationships between the different core objects. As can be seen, there is a clear distinction between the results from various bioinformatics tools (observations) and their interpretation (annotations) which was implemented in the data model. While this data model seems very generic, it represents a hierarchy of classes, including the complete EM``BL feature set for prokaryotes with several extensions as illustrated in the figure below.


attachment:GenDB-Regions.jpg
[[File:GenDB-Regions.jpg]]


to be continued ...
to be continued ...

Revision as of 14:16, 28 October 2011

The GenDB Data Model

Gen``DB is based on a data model with three core types of objects. Regions describe arbitrary (sub-) sequences. A region can be related to a parent region, e.g. a CDS is part of a contig. Observations correspond to information computed by various tools (e.g. Blast or Inter``Pro) for those regions. Annotations store the interpretation of a (human) annotator. They describe regions based on the evidence stored in the observations.

GenDB-UML-DataModel.jpg

The figure shown above illustrates the relationships between the different core objects. As can be seen, there is a clear distinction between the results from various bioinformatics tools (observations) and their interpretation (annotations) which was implemented in the data model. While this data model seems very generic, it represents a hierarchy of classes, including the complete EM``BL feature set for prokaryotes with several extensions as illustrated in the figure below.

GenDB-Regions.jpg

to be continued ...