SPSS

Searching Protein Structure Space

 Coordinatore UNIVERSITY OF HAIFA 

 Organization address address: "Mount Carmel, Abba Khoushi Blvd."
city: HAIFA
postcode: 31905

contact info
Titolo: Ms.
Nome: Suzan
Cognome: Aminpour
Email: send email
Telefono: 97248240549
Fax: -8287067

 Nazionalità Coordinatore Israel [IL]
 Totale costo 100˙000 €
 EC contributo 100˙000 €
 Programma FP7-PEOPLE
Specific programme "People" implementing the Seventh Framework Programme of the European Community for research, technological development and demonstration activities (2007 to 2013)
 Code Call FP7-PEOPLE-2007-4-3-IRG
 Funding Scheme MC-IRG
 Anno di inizio 2008
 Periodo (anno-mese-giorno) 2008-05-01   -   2012-11-13

 Partecipanti

# participant  country  role  EC contrib. [€] 
1    UNIVERSITY OF HAIFA

 Organization address address: "Mount Carmel, Abba Khoushi Blvd."
city: HAIFA
postcode: 31905

contact info
Titolo: Ms.
Nome: Suzan
Cognome: Aminpour
Email: send email
Telefono: 97248240549
Fax: -8287067

IL (HAIFA) coordinator 0.00

Mappa


 Word cloud

Esplora la "nuvola delle parole (Word Cloud) per avere un'idea di massima del progetto.

protein    structural    query    neighbors    function    structure    diversity    interface    spss    designed    web    tool    computational    inverted    fragbag    strings    members    sg    refine    spatial    proteins    database    searches    represented    validated    biology    innovative    bank    data    search    retrieval    space    pdb    functional    size    structures    similarity    tools    fast    filter    index    accuracy   

 Obiettivo del progetto (Objective)

'Finding the structural neighbors of a protein is an essential task in computational structural Biology. In cases where there is no detectable sequence similarity, the identification of structural neighbors offers a powerful approach to predicting structure and function. State-of-the-art protein structural searches find structural neighbors by comparing a protein to all proteins in their database. As this is an expensive computation that depends directly on the size of the database, such searches consider only a representative subset of the Protein Data Bank (PDB). The PDB is growing dramatically to include many structures of uncharacterized function solved by high-throughput methods developed by the Structural Genomics (SG) initiative. Characterizing these structures, and addressing questions raised by the improved coverage of structure space, mandates better structure search tools. I propose to develop a search tool for protein structure space that is analogous to web search tools such as Google. The system is designed to be fast and interactive, to cover the whole data set, and to have a clear and simple interface so that it can serve as a navigation interface to structure space. For this, I propose adapting a data structure called an inverted index, which is used for fast retrieval in web search. Protein structures are described as strings of letter strings based on a structural alphabet, and placed in an inverted index. Then, given a query structure and its string description, one can quickly retrieve a short list of candidate structural neighbors. Similar to web search, I suggest using query expansion to improve the retrieval performance. The proposal also includes an application of structural search for comparing the structural novelty of contributions from different SG centers. This project will enable me to establish a new Computational Biology research team at my host institution: it preeminently fulfills the goals of the Work Program.'

Introduzione (Teaser)

Facilities such as the Protein Data Bank store vast amounts of molecular data. An effective large-scale protein search system will sift through the information for structural solutions for the biotech industry.

Descrizione progetto (Article)

The EU-funded 'Searching protein structure space' (SPSS) project has fulfilled this need using an innovative 'filter-and-refine' paradigm on large datasets. For fast and accurate retrieval of data on structural similarity, the filter methods need to be indexable. Structural alignment heuristics were used to identify the most promising candidates in the refine stage.

Project members successfully designed and validated the novel FragBag filter method where proteins are represented in terms of their backbone fragments.

Receiver operating characteristic curve analysis validated the FragBag method for accuracy. Comparisons with other filter methods such as SGM, STRUCTAL and PRIDE demonstrated the superiority of FragBag in terms of speed and accuracy. Moreover, this method covers structure prediction as a structure is represented as a collection of sub-structures.

Fixed-size vectors enable 3D spatial representation of structure. The SPSS scientists revealed previously undescribed fundamental properties by mapping the proteins to study spatial distribution as well as structural and functional diversity. A key finding was that functional diversity varies considerably across structure space.

SPSS members have delivered an innovative tool to study and describe protein structures in spatial dimensions. In the future, researchers can study protein evolution andstructure-function relationship using this novel and unique tool.

Altri progetti dello stesso programma (FP7-PEOPLE)

ENHANCE (2009)

"European Research Training Network of “New Materials: Innovative Concepts for their Fabrication, Integration and Characterization”"

Read More  

ROS-AUXIN (2013)

ROS AND AUXIN CROSSTALK DURING PLANT DEVELOPMENT AND STRESS ADAPTATION

Read More  

BIGCOW (2009)

BIoGeochemistry in a high CO2 World (BIGCOW): lessons from the Ocean Anoxic Events

Read More