Preparing your input sequence file

Format of the file contents

The sequences you wish to query must be saved a plain text (multi) fasta format. You can use any text editor.

Ideally the header lines (those starting with '>') will have the following format:

>customSeq~~~ID~~~ACC~~~RESISTANCES DESC

where,

  • customSeq is the database name, e.g. 12P or FAE
  • ID is the gene name
  • ACC is an accession number of the sequence source
  • RESISTANCES is the phenotype(s) to report
  • DESC can be any textual description

Important

The maximum length of >customSeq~~~ID~~~ACC~~~RESISTANCES DESC is 50 characters.

Use only letters, numbers and '_'.

The position of the '>', '~' and space are important.

Example

>12P~~~Lbr_pduC~~~LVIS_RS19425~~~RES pduC_Lactobacillus brevis ATCC 367

Format of the filename

Save the file as <customSeq>_sequences.fa , replacing <customSeq> with your chosen database name

Example

12P_sequences.fa or FAE_sequences.fa