Preparing your input sequence file
Format of the file contents
The sequences you wish to query must be saved a plain text (multi) fasta format. You can use any text editor.
Ideally the header lines (those starting with '>') will have the following format:
>customSeq~~~ID~~~ACC~~~RESISTANCES DESC
where,
customSeq
is the database name, e.g. 12P or FAEID
is the gene nameACC
is an accession number of the sequence sourceRESISTANCES
is the phenotype(s) to reportDESC
can be any textual description
Important
The maximum length of >customSeq~~~ID~~~ACC~~~RESISTANCES DESC
is 50 characters.
Use only letters, numbers and '_'.
The position of the '>', '~' and space are important.
Example
>12P~~~Lbr_pduC~~~LVIS_RS19425~~~RES pduC_Lactobacillus brevis ATCC 367
Format of the filename
Save the file as <customSeq>_sequences.fa
, replacing <customSeq>
with your chosen database name
Example
12P_sequences.fa
or FAE_sequences.fa