JUSTA

Jumbled Strings: Theory and Applications

 Coordinatore UNIVERSITA DEGLI STUDI DI SALERNO 

 Organization address address: VIA GIOVANNI PAOLO II 132
city: FISCIANO (SA)
postcode: 84084

contact info
Titolo: Prof.
Nome: Ugo
Cognome: Vaccaro
Email: send email
Telefono: +39 089 969734

 Nazionalità Coordinatore Italy [IT]
 Totale costo 244˙075 €
 EC contributo 244˙075 €
 Programma FP7-PEOPLE
Specific programme "People" implementing the Seventh Framework Programme of the European Community for research, technological development and demonstration activities (2007 to 2013)
 Code Call FP7-PEOPLE-2010-IEF
 Funding Scheme MC-IEF
 Anno di inizio 2011
 Periodo (anno-mese-giorno) 2011-08-03   -   2011-11-02

 Partecipanti

# participant  country  role  EC contrib. [€] 
1    UNIVERSITA DEGLI STUDI DI SALERNO

 Organization address address: VIA GIOVANNI PAOLO II 132
city: FISCIANO (SA)
postcode: 84084

contact info
Titolo: Prof.
Nome: Ugo
Cognome: Vaccaro
Email: send email
Telefono: +39 089 969734

IT (FISCIANO (SA)) coordinator 244˙075.00

Mappa


 Word cloud

Esplora la "nuvola delle parole (Word Cloud) per avere un'idea di massima del progetto.

jumbled    vector    algorithms    optimal    matching    languages    searching    string    strings    data    refer    pattern    practical    biological    algorithmic    parikh    formal    bioinformatics   

 Obiettivo del progetto (Objective)

'We propose to investigate a number of algorithmic problems on jumbled strings, where we refer to a string t as a jumbled version of string s if t's positions can be permuted such that it is transformed into s. In other words, the two strings have the same Parikh vector, where the Parikh vector counts the number of occurrences of each character. For example, the strings AAGACGT and AAACGGT both have Parikh vector (3,1,2,1). All strings with the same Parikh vector build an equivalence class, which we refer to as a 'jumbled string.' We want to develop algorithms and dedicated data structures for searching, storing, comparing, and identifying jumbled strings.

Jumbled strings have important applications in bioinformatics, above all in interpretation of mass spectrometry data; but they have also been applied to alignment, pattern discovery in biological strings, or SNP detection. Searching for a jumbled pattern in a text constitutes a special case of approximate string matching, and is thus of particular interest in the pattern matching field. Similar problems regarding unique reconstruction of strings have been investigated in the area of formal languages.

The project involves both theoretical and practical parts. Besides searching for asymptotically optimal procedures for different models of the source which generates the text, we will also test on real instances of biological and textual data. We are not only interested in theoretically optimal algorithms but focus on algorithms that work well in practice. Thus, we consider also heuristics and ad hoc methods to enhance the practical implementation of our methods.

The project will enable the fellow to greatly enhance her competencies in algorithms development and formal languages, while training in information theory and extremal combinatorics, benefitting from the expertise at the host institution. This will constitute a major step in her career towards a professorship in algorithmic bioinformatics.'

Altri progetti dello stesso programma (FP7-PEOPLE)

DIVA (2008)

Genetic DIversity of AVIdins for Novel Biotech Applications

Read More  

GENPREPAR (2009)

Genetic basis and functional consequences of pre- and post-hatching parental effects

Read More  

RONET (2008)

The Reactive Oxygen Signaling Network of Cells

Read More