Download presentation
Presentation is loading. Please wait.
Published byRalph Wheetley Modified over 9 years ago
1
IGR-ANNOT: A Multiagent System for InterGenic Regions Annotation Sandro Camargo, João Valiati, Luis Otávio Álvares, Paulo Engel, Sergio Ceroni
2
Introduction The exponential growth of genomic data has led to an absolute requirement for computerized tools to analyze this data. A new genome sequencing does not answer all questions about the organism. Progress is more likely to come from comparing the genomes of different organisms.
3
Introduction There are many tools and techniques to compare complete genomes and coding regions, but there is a lack for techniques for compare non-coding regions of DNA, which contains regulatory elements. Many of the differences between species may be attributed to changes in the regulation of transcription and translation. Transcription and translation are often regulated via elements that lie in intergenic regions.
4
InterGenic Regions Intergenic regions are defined as the sequence between the translational stop of a gene and translational start of the next gene. For obtaining intergenic regions of an organism are necessary: –the complete genome of this organism (the nucleotides sequence) –the information about coding regions (start and stop positions, orientation, and name).
5
InterGenic Regions Our decision was to work with GenBank files because they contain all this necessary information for identifying coding regions, and this information will be used to infer the necessary information about intergenic regions.
6
InterGenic Regions The format design is based on a tabular approach and consists of the following items: –Feature Key: a single word or abbreviation indicating functional group; –Location: instructions for finding a feature; –Qualifiers: auxiliary information about a feature.
7
InterGenic Regions KeyLocation/Qualifiers CDS 23..400 /product=“alcohol dehydrogenase” /gene="adhI" An example of a feature in the feature table.
8
InterGenic Regions InterGenic Regions naming conventions: IGR-O-G1-G2 where O = {F|R|B|X} depending on the previous and next gene orientations, and G1 and G2 are the names coding regions which intergenic regions contains regulatory information.
9
InterGenic Regions Intergenic regions will be written in the GenBank file format using the feature misc_feature. According to the GenBank file format description, this feature key is used for annotate regions of biological interest which cannot be described by any other feature key.
10
IGR-ANNOT Engineering Process The multiagent approach is particularly attractive to this problem because: –information content is heterogeneous. –information can be distributed. –much of the annotation work for each gene can be done by different laboratories using different methodologies for annotate information about genes. We have used MASE and AgentTool to modelling the agent.
11
IGR-ANNOT Engineering Process User Interface Agent (UIA) File Reader Agents (FRA) Gene Agents (GA) InterGenic Regions Agents (IGRA) File Writer Agents (FWA)
12
IGR-ANNOT Engineering Process
13
To implementing this architecture, we have used the Perl language, and it can be run on any suitable platform. Perl have many features, like string manipulation facilities, that become it a very interesting language to working with DNA sequences, besides there are complete packages to implementing multiagent systems.
14
Results Discussion We have extensively used IGR-ANNOT to creating intergenic regions annotation in several genomes of Mycoplasmataceae family. To getting a graphical view of annotation created by our tool we have used the Artemis tool. The next figures are presenting the Mycoplasma Hyopneumoniae 232 genome.
15
Results Discussion
17
Len1Len2%IdyMhyMhy232 458 99,34IGR-F- MP04451_oppB-1 IGR-R-oppB 34534699,42IGR-F- MP0611_MHP0054 IGR-F-mhp057 57457298,26IGR-X- MP07135_rpsO- MP01224_MHP0106 IGR-X-mhp275- rps15 30731693,99IGR-X- MP09826_MHP0309- MP03567_baiH IGR-X-mhp321- baiH
18
Results Discussion Len1Len2%IdyMhyMhy232 1156115798,02IGR-R- MP03198_MHP0344 IGR-R-mhp354 1037103394,49IGR-B- MP18658_MHP0508- MP05045_pdhC IGR-B-mhp502- aceF 395 99,49IGR-B- MP07145_deoC- MP12669_gyrA IGR-B-deoC-gyrA 52854396,69IGR-F-MP02519_lgtIGR-R-lgt
19
Conclusions This system is now successfully in use by biologists at the UFRGS. The result of IGR-ANNOT application provides an easy way to comparing intergenic regions among different organisms. Although the positive results achieved until now in genomes of Mycoplasmataceae family, further tests will be performed, mainly using most complexes genomes.
20
Future Works Create an environment to InterGenic Regions comparison. IGR-ANNOT will be available publicly to other biologists over the web at www.inf.ufrgs.br/~scamargo in software section.
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.