Cite as: Cold Spring Harb. Protoc.; 2007; doi:10.1101/pdb.top15

This Protocol
Right arrow Full Text
Right arrow Update/discuss this protocolDiscussion icon
Right arrow Alert me when this protocol is cited
Right arrow Alert me when comments are published
Right arrow Alert me if a correction is posted
Services
Right arrow Similar protocols in this database
Right arrow Alert me to new releases of protocols
Right arrow Save to Personal Folders
Right arrow Download to citation manager
Right arrow Printer-friendly versionPrinter-friendly version
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Mount, D. W.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Mount, D. W.
Related Collections
Right arrow Bioinformatics/Genomics, general
Right arrow Alignment of Sequences
Right arrow Alignment of Sequences, general
Right arrow Alignment of Pairs of Sequences
Right arrow Multiple Sequence Alignment
Right arrow Gene Prediction and Regulation
Right arrow Sequence Database Searching
Right arrow Computational Biology
Right arrow Genetic Variation
Right arrow Proteins and Proteomics, general
Right arrow Proteomics
Right arrow Evolution
Right arrow Genome Analysis
Right arrowRelated Articles
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?
BSN globe

topic_introductionTopic Introduction

Strategies for Sequence Similarity Database Searches

David W. Mount

Adapted from "Sequence Database Searching for Similar Sequences," Chapter 6, in Bioinformatics: Sequence and Genome Analysis, 2nd edition, by David W. Mount. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, USA, 2004.


INTRODUCTION

Database similarity search programs tend to produce large volumes of output. It can become difficult to screen this volume of material and to assess whether the more remotely related sequences are really related to the query sequence. Thus, it is important to limit the sequence output; there are some relatively simple procedures that may be followed for each program, as described in this article. For searches of protein databases, avoid repetitive alignments with the same sequence by limiting searches to the protein sequence databases that are well curated, such as SwissProt and PIR, or to a specific genome, as opposed to the entire set of translated GenBank sequences (the GenPept database).


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?

Related Articles

Recommended Steps for a FASTA Search
David W. Mount
CSH Protocols 2007: 40. [Abstract] [Full Text]

Steps Used by the BLAST Algorithm
David W. Mount
CSH Protocols 2007: 41. [Abstract] [Full Text]

Using a FASTA Sequence Database Similarity Search
David W. Mount
CSH Protocols 2007: 16. [Abstract] [Full Text]

Using the Basic Local Alignment Search Tool (BLAST)
David W. Mount
CSH Protocols 2007: 17. [Abstract] [Full Text]



This article has been cited by other articles:


Home page
CSH ProtocolsHome page
D. W. Mount
Using PAM Matrices in Sequence Alignments
CSH Protocols, June 1, 2008; 2008(7): pdb.top38 - pdb.top38.
[Abstract] [Full Text]