Pattern Matching Pointers

      Combinatorial Pattern Matching addresses issues of searching and matching strings and more complicated patterns such as trees, regular expressions, graphs, point sets, and arrays. The goal is to derive non-trivial combinatorial properties for such structures and then to exploit these properties in order to achieve improved performance for the corresponding computational problem.

      A steady flow of high-quality research on this subject has changed a sparse set of isolated results into a full-fledged area of algorithmics with important applications. This area is expected to grow even further due to the increasing demand for speed and efficiency that comes from molecular biology, but also from areas such as information retrieval, pattern recognition, compiling, data compression, program analysis and security.

      Disclaimer: The purpose of this page is to serve as an index to information relevant to Pattern Matching/Computational Biologist researchers. We prefer to point to information rather than store it locally. We include all submissions that seem appropriate. However, inclusion should not be interpreted as an endorsement of a contribution's accuracy or importance.

    Contents

    • People page moved (not maintained anymore)
    • CPM Presentation Archive
    • Conference announcements
    • Resources: on-line bibliographies, journals, proceedings, software, newgroups.

    Conferences

    SIGACT News Theory Calendar
    SIAM Meetings and Conferences
    Past events (1995-)
    CPM conferences

      2008

    • Jan 4-8: PSB'08 (Big Island of Hawaii)
    • Jan 14-17: APCB'08 (Kyoto, Japan)
    • Jan 20-22: SODA'08 (San Francisco, CA)
    • Mar 27-29: DCC'08 (Snowbird, UT)
    • Mar 30- Apr 2: RECOMB'08 (Singapore)
    • Apr 24-26: SDM'08 (Atlanta, GA)
    • Jun 18-20: CPM'08 (Pisa, Italy)
    • Jul 27-29: COCOON'08 (Dalian, China)
    • Aug 25-29: CBS'08 (Stanford, CA)
    • Jul 19-23: ISMB'08 (Toronto, Canada)
    • Aug 24-27: KDD'08 (Las Vegas, NV)
    • Sep 15-19: WABI 2008 (Karlsruhe, Germany)
    • Sep 15-17: CIBCB'08 (Sun Valley, Idaho)
    • Sep 15-17: ECCB'08 (Cagliari, Sardina Italy)
    • Oct 8-10: BIBE 2008 (Athens, Greece)

    Resources

      Bibliographies
      Databases & Logic Programming Bibliography maintained by Michael Ley
      A Collection of Computer Science Bibliographies (Canadian Mirror) more than one million of CS BibTeX references
      Sequence Alignment, Analysis and Comparison: a bibliography 1,760 entries by William H. E. Day
      MathSciNet AMS bibliography on-line
      Networked Computer Science Technical Reports Library (NCSTRL)
      Computer Science Research Paper Search Engine Yahoo-style search engine for CS papers
      ResearchIndex over 7 millions pages and 5 millions citations
      ACM Computing Research Repository (CoRR)
      IEEE bibliographies on-line
      Benno's Computer Science Bio-Digest a regular survey of some CS journals for articles related to the field of Computational Biology
      A bibliography on stringology by T. Lecroq
      The Hypertext Bibliography Project by David M. Jones
      A bibliography on computational gene recognition by Wentian Li
      Groups and link collections
      Theoretical Computer Science On The Web maintained by Suresh Venkat
      Pattern Matching & Data Mining Research Group Homepage maintained by Mika Klemettinen and Jaak Vilo
      Human Genome Most Used Links hosted at LANL
      Complexity Related Links maintained by Bruce Edmonds
      ACM SIGACT Special Interest Group on Algorithms and Computation Theory
      Theoretical Computer Science Genealogy
      TCS: virtual address book lists URLs of 1105 theoretical computer scientists around the world
      A compendium of NP optimization problems maintained by P. Crescenzi and V. Kann
      Knowledge Discovery Nuggets Directory: Data Mining and Knowledge Discovery Information
      The Bioinformatics Resources
      The Rewriting Page maintained by Nachum Dershowitz and Laurent Vigneron
      Amos' WWW links page maintained by Amos Bairoch
      Books
      Pattern Matching Algoritms Alberto Apostolico, Zvi Galil, eds.
      Bioinformatics: The Machine Learning Approach Pierre Baldi, Søren Brunak
      Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins Andreas Baxevanis, B.F.Francis Ouellette (Eds.)
      Computational Molecular Biology: An Introduction Peter Clote, Rolf Backofen
      Text Algorithms Maxime Crochemore, Wojciech Rytter
      Jewels of Stringology Maxime Crochemore, Wojciech Rytter
      Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids Richard Durbin (Editor), S. Eddy, A. Krogh, G. Mitchison
      Information Retrieval Data Structures & Algorithms Bill Frakes, Ricardo Baeza-Yates (eds.)
      Developing Bioinformatics Computer Skills Cynthia Gibas, Per Jambeck
      Handbook of Algorithms and Data Structures Gaston H. Gonnet, Ricardo Baeza-Yates
      Combinatorics on Words M. Lothaire
      Algebraic Combinatorics on Words M. Lothaire
      Applied Combinatorics on Words M. Lothaire
      Algorithms on Strings, Trees and Sequences Dan Gusfield
      Rewriting Solving Proving Claude Kirchner, Helene Kirchner
      Bioinformatics: Sequence and Genome Analysis David Mount
      Introduction to Computational Molecular Biology João Meidanis & João Carlos Setubal
      Pattern Discovery in Bioinformatics: Theory & Algorithms Laxmi Parida
      Computational Molecular Biology: An Algorithmic Approach Pavel A. Pevzner
      Flexible Pattern Matching in Strings Gonzalo Navarro and Mathieu Raffinot
      Computational Methods in Molecular Biology Steven Salzberg, David Searls, Simon Kasif (eds.)
      String Searching Algorithms Graham A. Stephen
      Pattern Discovery in Biomolecular Data Jason Wang, Bruce A. Shapiro, Dennis Shasha (eds.)
      Introduction To Computational Biology Michael Waterman
      Journals
      Theory Calendar - Journal Calls
      ACM: Digital Library LOTS of papers on line
      ACM Transactions on Algorithms (TALG) papers on line
      IEEE OPeRA (Online Periodicals and Research Area) LOTS of papers on line
      IEEE conference proceedings on-line
      Algorithmica papers on line
      BMC Bioinformatics papers on line
      Bioinformatics (formerly CABIOS) papers on line
      Chicago Journal of Theoretical Computer Science papers on line
      Combinatorics, Probability and Computing papers on line
      Complexity papers on line
      Data Mining and Knowledge Discovery abstracts on line
      Discrete Applied Mathematics abstracts on line
      Discrete Mathematics and Theoretical Computer Science papers on line
      Electronic Journal of Combinatorics papers on line
      European Journal of Combinatorics abstracts on line
      Genome Research papers on line
      Genome Biology papers on line
      Journal of ACM papers on line
      Journal of Algorithms papers on line
      Journal of Bioinformatics and Computational Biology NEW
      Journal of Computational Biology recent papers online
      Journal of Complexity abstracts on line
      Journal of Molecular Biology papers on line
      Journal of Experimental Algorithmics papers on line
      Journal of Universal Computer Science papers on line
      Journal of Graph Algorithms and Applications papers on line
      IEEE Transactions on Information Theory
      IEEE Transactions on Knowledge and Data Engineering
      Information and Computation (or here) abstract on line
      Informatica abstract on line
      Information Processing Letters papers on line
      International Journal of Bioinformatics Research and Applications papers on line
      International Journal of Data Mining and Bioinformatics papers on line
      Lecture Notes in Computer Science papers on line
      Nucleic Acid Research papers on line
      Nordic Journal on Computing abstracts on line
      SIAM Journal on Computing papers on line
      SIGACT News Online papers on line
      Theoretical Computer Science papers on line
      IEEE/ACM Transactions on Computational Biology and Bioinformatics papers on line
      The Computer Journal papers on line
      Proceedings
      Bio-Informatics and Biomedical Engineering (BIBE) papers on line (2001)
      Bio-Informatics and Biomedical Engineering (BIBE) papers on line (2003)
      Bio-Informatics and Biomedical Engineering (BIBE) papers on line (2004)
      BIOKDD papers on line (2001)
      BIOKDD papers on line (2002)
      IEEE Computer Society Bioinformatics Conference (CSB)papers on line (2003)
      IEEE Computer Society Bioinformatics Conference (CSB) papers on line (2002)
      Combinatorial Pattern Matching (CPM) papers on line (2007)
      Combinatorial Pattern Matching (CPM) papers on line (2005)
      Combinatorial Pattern Matching (CPM) papers on line (2004)
      Combinatorial Pattern Matching (CPM) papers on line (2003)
      Combinatorial Pattern Matching (CPM) papers on line (2002)
      Combinatorial Pattern Matching (CPM) papers on line (2001)
      Combinatorial Pattern Matching (CPM) papers on line (2000)
      Combinatorial Pattern Matching (CPM) papers on line (1999)
      Compression and Complexity of Sequences (SEQUENCES) papers on line (1997)
      Computational System Bioinformatics (CSB) papers on line (2003-)
      Data Compression Conference (DCC) papers on line (1991-)
      European Conference on Computational Biology (ECCB) papers on line (2002)
      European Conference on Computational Biology (ECCB) papers on line (2003)
      European Conference on Computational Biology (ECCB) papers on line (2004)
      Foundations of Computer Science (FOCS) papers on line (1996-)
      Genome Informatics papers on line (1996-)
      Intelligent Systems for Molecular Biology (ISMB) papers on line (2004)
      Intelligent Systems for Molecular Biology (ISMB) papers on line (2003)
      Intelligent Systems for Molecular Biology (ISMB) papers on line (2002)
      Intelligent Systems for Molecular Biology (ISMB) papers on line (2001)
      Intelligent systems for Molecular Biology (ISMB) papers on line (2000)
      Pacific Symposium on Biocomputing (PSB) papers on line (1996-)
      Principles of Programming Languages (POPL) papers on line (1987-)
      Research in Computational Molecular Biology (RECOMB) papers on line (1997-2004)
      Research in Computational Molecular Biology (RECOMB) papers on line (2008)
      Research in Computational Molecular Biology (RECOMB) papers on line (2005)
      Research in Computational Molecular Biology (RECOMB) papers on line (2006)
      String Processing and Information Retrieval Symposium (SPIRE) papers on line (1998)
      String Processing and Information Retrieval Symposium (SPIRE) papers on line (1999)
      String Processing and Information Retrieval Symposium (SPIRE) papers on line (2000)
      String Processing and Information Retrieval Symposium (SPIRE) papers on line (2001)
      String Processing and Information Retrieval Symposium (SPIRE) papers on line (2004)
      Symposium on Discrete Algorithms (SODA) papers on line (1997-)
      Symposium on Theory of Computing (STOC) papers on line (1985-)
      Algorithms in Bioinformatics: 5th International Workshop (WABI) papers on line (2005)
      Algorithms in Bioinformatics: 4th International Workshop (WABI) papers on line (2004)
      Algorithms in Bioinformatics: 3rd International Workshop (WABI) papers on line (2003)
      Software
      Exact string matching algorithms in Java by Christian Charras - Thierry Lecroq
      Information Retrieval Projects
      PERL the "information extraction" language
      The LEDA Library Library of Efficient Datatypes and Algorithms
      The Stony Brook Algorithm Repository
      GLIMPSE and AGREP A tool to search entire file systems
      HARVEST Information Discovery and Access System
      AGREP for DOS, OS/2 and Windows 95/NT (not yet GLIMPSE)
      GAIS A General-Purposed Information Discovery System
      Handbook of Algorithms and Data Structures Source code
      Information Retrieval Data Structures & Algorithms Source code
      FIRE & SPARE C++ implementations of algorithms for finite automata, transducers, pattern matching of keywords and regular expressions
      {Codepage} listing of programming resources on the web
      Algorithms Archive maintained by Florian Schintke
      Illustrated Pattern Matching in Java
      SAM Sequence Alignment and Modeling System
      Isearch software for indexing and searching text documents
      LPM and PAISLEI IDE Laleh's pattern matcher system for Win32 systems
      Icon Programming Language homepage
      SNOBOL4 and SPITBOL Programming Languages
      strmat package exact string matching and suffix trees
      XPARAL: Graphical Computation of Parameterized Alignments
      Graph Template Library GPL
      string matching algorithm animation Smaa
      nrgrep fast and flexible online pattern matching
      Algorithm Archive by Scott Gasch (work in progress)
      grappe pattern matching with wildcards
      Boyer-Moore applet by Dieter Bühler
      Aho-Corasick applet by Dieter Bühler
      Naive, KMP, and BM pattern matching algorithms applet by Masayuki Takeda
      Multi : Efficient Multi-Suffix-Tree String Dictionary by Stefan Edelkamp
      Compact Affix Trees applet by Moritz G. Maaß
      Suffix Trees applet by Moritz G. Maaß
      Suffix Trees applet by Craig Nevill-Manning
      Tree Structures for Algorithmic Problems on Strings suffix trees, suffix arrays software
      ELAN a language where (equational) pattern matching plays a fundamental role
      Rewriting Calculus resources a calculus based on pattern matching
      A Library for Computational Biology Programs
      A C++ regular expressions library
      Sary: a suffix array library and tools
      Sufary: a suffix array library
      Boost Library Template C++ Library: contains Graph Library and RegEx++ Library
      CCP - a compact and fast algorithm for Regular Expression Search by J.-M. Champarnaud, F. Coulon, T. Paranthoen
      ANSI C implementation of a Suffix Tree by Dotan Tsadok, Shlomo Yona, Offer Kaye
      Suffix Arrays by M. Douglas McIlroy
      C Library to search over compressed texts by Paolo Ferragina, Alessandro Tommasi, Giovanni Manzini
      Linear-time and Lightweight Suffix Arrays construction by Juha Kärkkäinen
      PERL module String:Approx by Jarkko Hietaniemi
      TRE: regexp matching library by Ville Laurikari
      Regular expression (regex/grep) string matching by Nic Roets
      Yet another suffix tree implementation
      MREPS tandem repeats
      YASS local aligment
      STR Library a fast C++ string class with RegExp
      Pizza&Chili Corpus Compressed Indexes and their Testbeds
      Tom pattern matching compiler for Java or C
      News Groups and Mailing Lists
      Theory-Edge theoretical computer science on Egroups
      comp.compression data compression
      comp.compression.research data compression research
      comp.theory.info-retrieval theory of information retrieval
      comp.theory theoretical computer science
      comp.text text processing
      comp.infosystems.harvest information systems harvest

    If you have questions, complains, errata, or you would like to be included in this page, please send a message to stelo@cs.ucr.edu