Makeblastdb parameters in blast+

Source: Internet
Author: User
Tags create index

All relevant blast operations used in future work are done with blast+.

With the previous blast, we are still starting from the format database to the match

Generally we have a Fasta file used to format the database, the previous command is FORMATDB, and now it is MAKEBLASTDB

The general format used is as follows:

Makeblastdb-in input_file-dbtype molecule_type-title database_title-parse_seqids-out database_name-logfile File_ Name

-in followed by input file, you want to format the Fasta sequence

-dbtype followed by sequence type, Nucl as nucleic acid, prot as protein

-title give the database a name, good-looking ~ ~ (can not be used in the subsequent search-db parameters)

-parse_seqids recommendation Plus, now for what reason is not clear

-out after the database name, you start a meaningful name, and later blast+ search to use the parameters of-db

-logfile log file, if there is no default output to the screen

And the previous formatdb difference is quite big, hehe

Using the Makeblastdb connection parameter-help will print out the information:

Makeblastdb-help
USAGE
Makeblastdb [-h] [-help] [-in input_file] [-dbtype Molecule_type]
[-title Database_title] [-parse_seqids] [-hash_index]
[-mask_data Mask_data_files] [-out database_name]
[-MAX_FILE_SZ Number_of_bytes] [-taxid Taxid] [-taxid_map Taxidmapfile]
[-logfile file_name] [-version]

DESCRIPTION
Application to create BLAST databases, version 2.2.23+

OPTIONAL ARGUMENTS
-H
Print USAGE and DESCRIPTION; Ignore other arguments
-help
Print USAGE, DESCRIPTION and ARGUMENTS DESCRIPTION; Ignore other arguments
-version
Print version number; Ignore other arguments

Input Options
-in <File_In>
Input file/database name; The data type is automatically detected, it may
Be any of the following:
FASTA file (s) and/or
BLAST Database (s)
Default = '-'
-dbtype <string, ' nucl ', ' prot ' >
Molecule type of input
Default = ' Prot '

Configuration Options
-title <String>
Title for BLAST Database
Default = input file name provided to-in argument
-parse_seqids
Parse seq-ids in FASTA input
-hash_index
Create index of sequence hash values.

Sequence Masking Options
-mask_data <String>
comma-separated List of input files containing masking data as produced by
NCBI masking applications (e.g. Dustmasker, Segmasker, Windowmasker)

Output Options
-out <String>
Name of BLAST database to be created
Default = input file name provided to-in argumentrequired if multiple
File (s)/database (s) is provided as input
-MAX_FILE_SZ <String>
Maximum file size for BLAST database files
Default = ' 1GB '

Taxonomy options
-taxid <integer, >=0>
Taxonomy ID to assign to all sequences
* Incompatible With:taxid_map
-taxid_map <File_In>
Text file mapping sequence IDs to taxonomy IDs.
Format:<sequenceid> <TaxonomyId><newline>
* Incompatible WITH:TAXID
-logfile <File_Out>
File to which the program log should is redirected

Makeblastdb parameters in blast+

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.