Biological Databases Morten Nielsen BioSys, DTU
Different kinds of data DNA –NCBI GenBankNCBI GenBank –Organism specific databases Protein –UniProt SwissProt TrEMBL –NCBINCBI
Different kinds of data Protein Structure –PDBPDB Expression Data –NCBI GeoNCBI Geo Epitopes –IEDBIEDB
PDB
IEDB Immune Epitope Database:
UniProt UniProt database
Data redundancy! Databases have non-biological redundancy This is problematic when training data- driven prediction methods –As you saw for PSSM construction Uniprot has a feature to remove redundancy (90% or 50%). How is this done? This and much more you will find out in the next episode of...