[2024-01-25 20:15:35,475] [INFO] DFAST_QC pipeline started. [2024-01-25 20:15:35,477] [INFO] DFAST_QC version: 0.5.7 [2024-01-25 20:15:35,477] [INFO] DQC Reference Directory: /var/lib/cwl/stg26b628dd-a605-4596-8c3d-561ff4e39201/dqc_reference [2024-01-25 20:15:36,646] [INFO] ===== Start taxonomy check using ANI ===== [2024-01-25 20:15:36,647] [INFO] Task started: Prodigal [2024-01-25 20:15:36,647] [INFO] Running command: gunzip -c /var/lib/cwl/stgcd342971-cb6e-4d86-932c-d95e54a1b109/GCF_014649455.1_ASM1464945v1_genomic.fna.gz | prodigal -d GCF_014649455.1_ASM1464945v1_genomic.fna/cds.fna -a GCF_014649455.1_ASM1464945v1_genomic.fna/protein.faa -g 11 -q > /dev/null [2024-01-25 20:16:00,163] [INFO] Task succeeded: Prodigal [2024-01-25 20:16:00,163] [INFO] Task started: HMMsearch [2024-01-25 20:16:00,163] [INFO] Running command: hmmsearch --tblout GCF_014649455.1_ASM1464945v1_genomic.fna/hmmer_result.tsv -E 1E-50 /var/lib/cwl/stg26b628dd-a605-4596-8c3d-561ff4e39201/dqc_reference/reference_markers.hmm GCF_014649455.1_ASM1464945v1_genomic.fna/protein.faa > /dev/null [2024-01-25 20:16:00,507] [INFO] Task succeeded: HMMsearch [2024-01-25 20:16:00,508] [INFO] Found 6/6 markers. [2024-01-25 20:16:00,584] [INFO] Query marker FASTA was written to GCF_014649455.1_ASM1464945v1_genomic.fna/markers.fasta [2024-01-25 20:16:00,584] [INFO] Task started: Blastn [2024-01-25 20:16:00,584] [INFO] Running command: blastn -query GCF_014649455.1_ASM1464945v1_genomic.fna/markers.fasta -db /var/lib/cwl/stg26b628dd-a605-4596-8c3d-561ff4e39201/dqc_reference/reference_markers.fasta -out GCF_014649455.1_ASM1464945v1_genomic.fna/blast.markers.tsv -outfmt 6 -max_hsps 1 -num_alignments 5 [2024-01-25 20:16:01,875] [INFO] Task succeeded: Blastn [2024-01-25 20:16:01,878] [INFO] Selected 14 target genomes. [2024-01-25 20:16:01,878] [INFO] Target genome list was writen to GCF_014649455.1_ASM1464945v1_genomic.fna/target_genomes.txt [2024-01-25 20:16:01,882] [INFO] Task started: fastANI [2024-01-25 20:16:01,882] [INFO] Running command: fastANI --query /var/lib/cwl/stgcd342971-cb6e-4d86-932c-d95e54a1b109/GCF_014649455.1_ASM1464945v1_genomic.fna.gz --refList GCF_014649455.1_ASM1464945v1_genomic.fna/target_genomes.txt --output GCF_014649455.1_ASM1464945v1_genomic.fna/fastani_result.tsv --threads 1 [2024-01-25 20:16:34,395] [INFO] Task succeeded: fastANI [2024-01-25 20:16:34,396] [INFO] Loading species specific ANI threshold from /var/lib/cwl/stg26b628dd-a605-4596-8c3d-561ff4e39201/dqc_reference/prokaryote_ANI_species_specific_threshold.txt [2024-01-25 20:16:34,396] [WARNING] Species-specific ANI threshold file not found. Will use the default threshold for all species. [/var/lib/cwl/stg26b628dd-a605-4596-8c3d-561ff4e39201/dqc_reference/prokaryote_ANI_species_specific_threshold.txt] [2024-01-25 20:16:34,405] [INFO] Found 14 fastANI hits (2 hits with ANI > threshold) [2024-01-25 20:16:34,405] [INFO] The taxonomy check result is classified as 'conclusive'. [2024-01-25 20:16:34,406] [INFO] DFAST Taxonomy check final result -------------------------------------------------------------------------------- organism_name strain accession taxid species_taxid relation_to_type validated ani matched_fragments total_fragments ani_threshold status Streptomyces coeruleorubidus strain=JCM 4359 GCA_014649455.1 116188 116188 suspected-type True 100.0 3066 3071 95 conclusive Streptomyces coeruleorubidus strain=ATCC 13740 GCA_008705135.1 116188 116188 suspected-type True 99.9965 3070 3071 95 conclusive Streptomyces swartbergensis strain=HMC13 GCA_002148965.1 487165 487165 type True 94.9931 2107 3071 95 below_threshold Streptomyces azureus strain=ATCC 14921 GCA_001270025.1 146537 146537 type True 94.9099 2289 3071 95 below_threshold Streptomyces caelestis strain=JCM 4566 GCA_014650295.1 36816 36816 type True 93.925 2325 3071 95 below_threshold Streptomyces caelestis strain=DSM 40084 GCA_014205255.1 36816 36816 type True 93.9173 2335 3071 95 below_threshold Streptomyces africanus strain=NRRL B-24243 GCA_002150735.1 231024 231024 type True 93.3768 2137 3071 95 below_threshold Streptomyces lomondensis strain=DSM 41428 GCA_021440105.1 68229 68229 type True 91.7336 2303 3071 95 below_threshold Streptomyces purpurascens strain=DSM 40310 GCA_021390235.1 1924 1924 type True 91.5156 2381 3071 95 below_threshold Streptomyces cahuitamycinicus strain=13K301 GCA_002891435.1 2070367 2070367 type True 90.2344 1824 3071 95 below_threshold Streptomyces tuirus strain=JCM 4255 GCA_014701095.1 68278 68278 type True 90.2335 2078 3071 95 below_threshold Streptomyces indiaensis strain=DSM 43803 GCA_021474405.1 284033 284033 type True 89.9425 1934 3071 95 below_threshold Streptomyces harenosi strain=PRKS01-65 GCA_011008945.1 2697029 2697029 type True 85.5136 1546 3071 95 below_threshold Lysobacter spongiae strain=119BY6-57 GCA_014145325.1 2025720 2025720 type True 74.9433 171 3071 95 below_threshold -------------------------------------------------------------------------------- [2024-01-25 20:16:34,407] [INFO] DFAST Taxonomy check result was written to GCF_014649455.1_ASM1464945v1_genomic.fna/tc_result.tsv [2024-01-25 20:16:34,407] [INFO] ===== Taxonomy check completed ===== [2024-01-25 20:16:34,407] [INFO] ===== Start completeness check using CheckM ===== [2024-01-25 20:16:34,408] [INFO] Setting CHECKM_DATA_PATH to /var/lib/cwl/stg26b628dd-a605-4596-8c3d-561ff4e39201/dqc_reference/checkm_data [2024-01-25 20:16:34,409] [INFO] Selected 'Prokaryote' markers (life, taxid=0) for CheckM [2024-01-25 20:16:34,497] [INFO] Task started: CheckM [2024-01-25 20:16:34,498] [INFO] Running command: checkm taxonomy_wf --tab_table -f GCF_014649455.1_ASM1464945v1_genomic.fna/cc_result.tsv -t 1 life "Prokaryote" GCF_014649455.1_ASM1464945v1_genomic.fna/checkm_input GCF_014649455.1_ASM1464945v1_genomic.fna/checkm_result [2024-01-25 20:18:47,520] [INFO] Task succeeded: CheckM [2024-01-25 20:18:47,521] [INFO] Completeness check finished. -------------------------------------------------------------------------------- Completeness: 100.00% Contamintation: 1.04% Strain heterogeneity: 0.00% -------------------------------------------------------------------------------- [2024-01-25 20:18:47,541] [INFO] ===== Completeness check finished ===== [2024-01-25 20:18:47,541] [INFO] ===== Start GTDB Search ===== [2024-01-25 20:18:47,542] [INFO] Query marker FASTA already exists. Will reuse it. (GCF_014649455.1_ASM1464945v1_genomic.fna/markers.fasta) [2024-01-25 20:18:47,542] [INFO] Task started: Blastn [2024-01-25 20:18:47,542] [INFO] Running command: blastn -query GCF_014649455.1_ASM1464945v1_genomic.fna/markers.fasta -db /var/lib/cwl/stg26b628dd-a605-4596-8c3d-561ff4e39201/dqc_reference/reference_markers_gtdb.fasta -out GCF_014649455.1_ASM1464945v1_genomic.fna/blast.markers.gtdb.tsv -outfmt 6 -max_hsps 1 -num_alignments 5 [2024-01-25 20:18:49,664] [INFO] Task succeeded: Blastn [2024-01-25 20:18:49,666] [INFO] Selected 13 target genomes. [2024-01-25 20:18:49,667] [INFO] Target genome list was writen to GCF_014649455.1_ASM1464945v1_genomic.fna/target_genomes_gtdb.txt [2024-01-25 20:18:49,682] [INFO] Task started: fastANI [2024-01-25 20:18:49,683] [INFO] Running command: fastANI --query /var/lib/cwl/stgcd342971-cb6e-4d86-932c-d95e54a1b109/GCF_014649455.1_ASM1464945v1_genomic.fna.gz --refList GCF_014649455.1_ASM1464945v1_genomic.fna/target_genomes_gtdb.txt --output GCF_014649455.1_ASM1464945v1_genomic.fna/fastani_result_gtdb.tsv --threads 1 [2024-01-25 20:19:19,531] [INFO] Task succeeded: fastANI [2024-01-25 20:19:19,540] [INFO] Found 13 fastANI hits (2 hits with ANI > circumscription radius) [2024-01-25 20:19:19,541] [INFO] GTDB search result -------------------------------------------------------------------------------- accession gtdb_species ani matched_fragments total_fragments gtdb_taxonomy ani_circumscription_radius mean_intra_species_ani min_intra_species_ani mean_intra_species_af min_intra_species_af num_clustered_genomes status GCF_008705135.1 s__Streptomyces coeruleorubidus 99.9965 3070 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.0 97.68 95.03 0.91 0.83 4 inconclusive GCF_002148965.1 s__Streptomyces swartbergensis 95.0298 2103 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.0 N/A N/A N/A N/A 1 inconclusive GCA_000415505.1 s__Streptomyces afghaniensis 93.9828 2284 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.0 N/A N/A N/A N/A 1 - GCF_014205255.1 s__Streptomyces caelestis 93.9255 2334 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.0 100.00 100.00 1.00 1.00 2 - GCF_900236475.1 s__Streptomyces chartreusis_D 93.632 2408 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.0 99.99 99.98 1.00 1.00 4 - GCF_002150735.1 s__Streptomyces africanus 93.3778 2137 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.0 N/A N/A N/A N/A 1 - GCF_000158955.1 s__Streptomyces viridochromogenes_B 91.002 2233 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.0 N/A N/A N/A N/A 1 - GCF_014648815.1 s__Streptomyces flaveolus 85.9189 1758 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.0 N/A N/A N/A N/A 1 - GCF_004364215.1 s__Streptomyces sp004364215 85.805 2093 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.0 N/A N/A N/A N/A 1 - GCF_000717595.1 s__Streptomyces flavochromogenes 81.3331 1442 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.5951 N/A N/A N/A N/A 1 - GCF_014203555.1 s__Streptomyces olivoverticillatus 81.2533 1073 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.0 N/A N/A N/A N/A 1 - GCF_009176265.1 s__Streptomyces angustmyceticus 80.9055 1320 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.0 99.96 99.96 1.00 1.00 2 - GCF_017349075.1 s__Streptomyces triculaminicus_A 80.7912 1230 3071 d__Bacteria;p__Actinobacteriota;c__Actinomycetia;o__Streptomycetales;f__Streptomycetaceae;g__Streptomyces 95.0 99.73 99.73 0.98 0.98 3 - -------------------------------------------------------------------------------- [2024-01-25 20:19:19,544] [INFO] GTDB search result was written to GCF_014649455.1_ASM1464945v1_genomic.fna/result_gtdb.tsv [2024-01-25 20:19:19,545] [INFO] ===== GTDB Search completed ===== [2024-01-25 20:19:19,550] [INFO] DFAST_QC result json was written to GCF_014649455.1_ASM1464945v1_genomic.fna/dqc_result.json [2024-01-25 20:19:19,550] [INFO] DFAST_QC completed! [2024-01-25 20:19:19,550] [INFO] Total running time: 0h3m44s