Boa tarde a todos, Teremos no dia 5 de Novembro, segunda-feira, pelas 13:00, na sala 0.20, Pavilhão de Informática II, IST Alameda, a apresentação e discussão da tese de mestrado da Ana Sofia Correia, "Fast mapping and querying over large scale typing data". Abstract: High-Throughput DNA Sequencing (HTS) methods gave rise to a paradigm shift in microbial typing and genomic population structure studies. The ability to partially sequence the genomes of hundreds to thousands of strains created the need for effective ways to represent relationships between strains. Single Nucleotide Polymorphism (SNP) analysis and whole or core genome MultiLocus Sequence Typing (wgMLST or cgMLST), result in profiles that have thousands of loci which can be used for outbreak investigation, epidemiological surveillance of clones of interest and bacterial population or evolutionary studies. The first step to define these profiles is to map reads obtained through genome sequencing, identify relevant genes, and query existing typing databases to find if the strain being analyzed has been identified already, or if it is a new strain. Given the size of existing typing databases, the data volume resulting from HTS, and the urgency of these analyses, namely when in presence of outbreaks, the inherent computational problem of mapping and querying typing data has become a big challenge. To solve this issue, this work intend to demonstrate and proof a new approach that relies on Linear Codes, specifically on Reed Muller codes. Saudações, Alexandre Francisco