.
The Open Protein Structure Annotation Network
PDB Keyword
.

DUFs

    PFAM DUF families solved by PSI centers

    The genome projects have unearthed an enormous diversity of novel genes of unknown function that require biological and biochemical characterization to assess their role in the organism(s) from which they were derived. These genes, like all others, can be grouped into families based on sequence similarity.

    duf_sizes.PNG

    The PFAM database 23.0 contains over 2200 such families, referred to as Domains of Unknown Function (DUF). In a coordinated effort, the four large-scale centers of the NIH Protein Structure Initiative have determined the first three‑dimensional structures for more than 250 of these DUF families. Analysis of the first 248, solved until October 2008, reveals that they significantly vary in size (with an average of  252 proteins) and in contributions from sequenced genomes and from metagenomic data (see the chart on the right). It also shows that about two thirds of the DUF families likely represent very divergent branches of already known and well-characterized families, which allows us to propose hypotheses about their biological function. The remainder can be formally categorized as new folds or topologies, although about one third of these show significant sub-structure similarity to previously characterized folds. The homology to functionally annotated protein families remains an important clue in proposing hypotheses about functions of DUF families but it is usually not sufficient for a very reliable functional annotation. The chart below shows overall percentages of DUF families with new folds, new folds partially similar to previously known folds, putative analogs, putative homologs and recognizable homologs. homology.PNGThe inset pie charts show the percentage of DUF families with proposed hypothesis about function in each of these six categories. From a more general perspective, our results infer that, despite the enormous increase in the number and the diversity of new genes being uncovered, the fold space of proteins encoded by those genes is gradually becoming saturated. These previously unexplored sectors of the protein universe are, therefore, primarily shaped by extreme diversification of known protein families, which enables organisms to evolve new functions and adapt to particular niches and habitats. Notwithstanding, these DUF families still constitute the richest source for discovery of the remaining protein folds and topologies.We recently published a paper on the structural analysis of DUF families solved by PSI centers, which was published in Plos Biology.

    http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1000205  

    A list of PFAM DUF families solved by PSI centers 

     
    DUFs: 248

    Displaying: 0 - 10

    Next
    Representative Structure
    Annotation Solved by Fold Type
    PF01519: Pfam family reference to undefined name 'note' Exception of type 'MindTouch.Deki.Script.Runtime.DekiScriptUndefinedNameException' was thrown. (click for details)Callstack: at Groups/DUFs at Te... BSGC Homolog
    PF01796: Pfam family reference to undefined name 'note' Exception of type 'MindTouch.Deki.Script.Runtime.DekiScriptUndefinedNameException' was thrown. (click for details)Callstack: at Groups/DUFs at Te... JCSG Homolog
    PF01861: Pfam family reference to undefined name 'note' Exception of type 'MindTouch.Deki.Script.Runtime.DekiScriptUndefinedNameException' was thrown. (click for details)Callstack: at Groups/DUFs at Te... MCSG Homolog
    PF01865: Pfam family reference to undefined name 'note' Exception of type 'MindTouch.Deki.Script.Runtime.DekiScriptUndefinedNameException' was thrown. (click for details)Callstack: at Groups/DUFs at Te... JCSG Homolog
    PF01877: Pfam family reference to undefined name 'note' Exception of type 'MindTouch.Deki.Script.Runtime.DekiScriptUndefinedNameException' was thrown. (click for details)Callstack: at Groups/DUFs at Te... NYSGXRC Putative Analog
    PF01883: Pfam family reference to undefined name 'note' Exception of type 'MindTouch.Deki.Script.Runtime.DekiScriptUndefinedNameException' was thrown. (click for details)Callstack: at Groups/DUFs at Te... JCSG Putative Homolog
    PF01893: Pfam family reference to undefined name 'note' Exception of type 'MindTouch.Deki.Script.Runtime.DekiScriptUndefinedNameException' was thrown. (click for details)Callstack: at Groups/DUFs at Te... NESG Putative Analog
    PF01904: Pfam family reference to undefined name 'note' Exception of type 'MindTouch.Deki.Script.Runtime.DekiScriptUndefinedNameException' was thrown. (click for details)Callstack: at Groups/DUFs at Te... JCSG Putative Homolog
    PF01906: Pfam family reference to undefined name 'note' Exception of type 'MindTouch.Deki.Script.Runtime.DekiScriptUndefinedNameException' was thrown. (click for details)Callstack: at Groups/DUFs at Te... MCSG Putative Analog
    PF01908: Pfam family reference to undefined name 'note' Exception of type 'MindTouch.Deki.Script.Runtime.DekiScriptUndefinedNameException' was thrown. (click for details)Callstack: at Groups/DUFs at Te... MCSG Putative Analog
    Next

    Reviews

    References

     

    No references found.

    Tag page

    Files (2)

    FileSizeDateAttached by 
     duf_sizes.PNG
    No description
    6.42 kB20:45, 1 Oct 2009lukaszActions
     homology.PNG
    No description
    12.02 kB21:54, 30 Sep 2009lukaszActions
    You must login to post a comment.
    All content on this site is licensed under a Creative Commons Attribution 3.0 License
    Powered by MindTouch