• English
    • français
    • Deutsch
    • español
    • português (Brasil)
    • Bahasa Indonesia
    • русский
    • العربية
    • 中文
  • English 
    • English
    • français
    • Deutsch
    • español
    • português (Brasil)
    • Bahasa Indonesia
    • русский
    • العربية
    • 中文
  • Login
View Item 
  •   Home
  • OAI Data Pool
  • OAI Harvested Content
  • View Item
  •   Home
  • OAI Data Pool
  • OAI Harvested Content
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Browse

All of the LibraryCommunitiesPublication DateTitlesSubjectsAuthorsThis CollectionPublication DateTitlesSubjectsAuthorsProfilesView

My Account

Login

The Library

AboutNew SubmissionSubmission GuideSearch GuideRepository PolicyContact

Statistics

Most Popular ItemsStatistics by CountryMost Popular Authors

Fast Output-Sensitive Pattern Discovery in Massive Sequences using the Motif Trie

  • CSV
  • RefMan
  • EndNote
  • BibTex
  • RefWorks
Author(s)
Grossi, Roberto
Menconi, Giulia
Pisanti, Nadia
Trani, Roberto
Vind, Søren
Contributor(s)
Equipe de recherche européenne en algorithmique et biologie formelle et expérimentale (ERABLE) ; Inria Grenoble - Rhône-Alpes ; Institut National de Recherche en Informatique et en Automatique (Inria) - Institut National de Recherche en Informatique et en Automatique (Inria)
Dipartimento di Informatica [Pisa] (DI) ; University of Pisa [Pisa]
Technical University of Denmark [Lyngby] (DTU)
Keywords
[INFO.INFO-DS] Computer Science [cs]/Data Structures and Algorithms [cs.DS]
[INFO.INFO-BI] Computer Science [cs]/Bioinformatics [q-bio.QM]

Full record
Show full item record
URI
http://hdl.handle.net/20.500.12424/1227328
Online Access
https://hal.inria.fr/hal-01525745
https://hal.inria.fr/hal-01525745/document
https://hal.inria.fr/hal-01525745/file/Auth-TCS17.pdf
Abstract
International audience
We introduce the motif trie data structure, which has applications in pattern matching and discovery in genomic analysis, plagiarism detection, data mining, intrusion detection, spam fighting and time series analysis, to name a few. Here the extraction of recurring patterns in sequential and textual data is one of the main computational bottlenecks. For this, we address the problem of extracting maximal patterns with at most k don't care symbols and at least q occurrences, according to a maximality notion we define. We apply the motif trie to this problem, also showing how to build it efficiently. As a result, we give the first algorithm that attains a stronger notion of output-sensitivity, where the cost for an input sequence of n symbols is proportional to the actual number of occurrences of each pattern, which is at most n (much smaller in practice). This avoids the best-known cost of O(nc)O(nc) per pattern, for constant c>1c>1, which is otherwise impractical for massive sequences with large n.
Date
2017
Type
info:eu-repo/semantics/article
Identifier
oai:HAL:hal-01525745v1
hal-01525745
https://hal.inria.fr/hal-01525745
https://hal.inria.fr/hal-01525745/document
https://hal.inria.fr/hal-01525745/file/Auth-TCS17.pdf
DOI : 10.1016/j.tcs.2017.04.012
DOI
: 10.1016/j.tcs.2017.04.012
ae974a485f413a2113503eed53cd6c53
: 10.1016/j.tcs.2017.04.012
Scopus Count
Collections
OAI Harvested Content

entitlement

 
DSpace software (copyright © 2002 - 2022)  DuraSpace
Quick Guide | Contact Us
Open Repository is a service operated by 
Atmire NV
 

Export search results

The export option will allow you to export the current search results of the entered query to a file. Different formats are available for download. To export the items, click on the button corresponding with the preferred download format.

By default, clicking on the export buttons will result in a download of the allowed maximum amount of items.

To select a subset of the search results, click "Selective Export" button and make a selection of the items you want to export. The amount of items that can be exported at once is similarly restricted as the full export.

After making a selection, click one of the export format buttons. The amount of items that will be exported is indicated in the bubble next to export format.