JUSTA

Jumbled Strings: Theory and Applications

 Coordinatore UNIVERSITA DEGLI STUDI DI SALERNO 

 Organization address address: VIA GIOVANNI PAOLO II 132
city: FISCIANO (SA)
postcode: 84084

contact info
Titolo: Prof.
Nome: Ugo
Cognome: Vaccaro
Email: send email
Telefono: +39 089 969734

 Nazionalità Coordinatore Italy [IT]
 Totale costo 244˙075 €
 EC contributo 244˙075 €
 Programma FP7-PEOPLE
Specific programme "People" implementing the Seventh Framework Programme of the European Community for research, technological development and demonstration activities (2007 to 2013)
 Code Call FP7-PEOPLE-2010-IEF
 Funding Scheme MC-IEF
 Anno di inizio 2011
 Periodo (anno-mese-giorno) 2011-08-03   -   2011-11-02

 Partecipanti

# participant  country  role  EC contrib. [€] 
1    UNIVERSITA DEGLI STUDI DI SALERNO

 Organization address address: VIA GIOVANNI PAOLO II 132
city: FISCIANO (SA)
postcode: 84084

contact info
Titolo: Prof.
Nome: Ugo
Cognome: Vaccaro
Email: send email
Telefono: +39 089 969734

IT (FISCIANO (SA)) coordinator 244˙075.00

Mappa


 Word cloud

Esplora la "nuvola delle parole (Word Cloud) per avere un'idea di massima del progetto.

refer    bioinformatics    data    vector    languages    biological    practical    string    jumbled    strings    searching    optimal    algorithms    pattern    matching    formal    parikh    algorithmic   

 Obiettivo del progetto (Objective)

'We propose to investigate a number of algorithmic problems on jumbled strings, where we refer to a string t as a jumbled version of string s if t's positions can be permuted such that it is transformed into s. In other words, the two strings have the same Parikh vector, where the Parikh vector counts the number of occurrences of each character. For example, the strings AAGACGT and AAACGGT both have Parikh vector (3,1,2,1). All strings with the same Parikh vector build an equivalence class, which we refer to as a 'jumbled string.' We want to develop algorithms and dedicated data structures for searching, storing, comparing, and identifying jumbled strings.

Jumbled strings have important applications in bioinformatics, above all in interpretation of mass spectrometry data; but they have also been applied to alignment, pattern discovery in biological strings, or SNP detection. Searching for a jumbled pattern in a text constitutes a special case of approximate string matching, and is thus of particular interest in the pattern matching field. Similar problems regarding unique reconstruction of strings have been investigated in the area of formal languages.

The project involves both theoretical and practical parts. Besides searching for asymptotically optimal procedures for different models of the source which generates the text, we will also test on real instances of biological and textual data. We are not only interested in theoretically optimal algorithms but focus on algorithms that work well in practice. Thus, we consider also heuristics and ad hoc methods to enhance the practical implementation of our methods.

The project will enable the fellow to greatly enhance her competencies in algorithms development and formal languages, while training in information theory and extremal combinatorics, benefitting from the expertise at the host institution. This will constitute a major step in her career towards a professorship in algorithmic bioinformatics.'

Altri progetti dello stesso programma (FP7-PEOPLE)

CROSSFIRE (2012)

"Uncoordinated network strategies for enhanced interference, mobIlity, radio resource, and energy saving management in LTE-Advanced networks"

Read More  

GHGELCAT (2013)

Electrocatalysis of greenhouse gases to fuels or chemical feedstocks on well-characterized materials

Read More  

VAKA2007 (2007)

2007 Researchers Night in Iceland

Read More