Skip to content

'run_RF.py' operation problem  #55

@ChaoXianSen

Description

@ChaoXianSen

Dear @joacjo

Question: Which part of the source code can I change, to fit my contigs file and Run the 'run_RF.py ' to get the final result ??

R63.contigs.fa.gz header format : >k141_84347 flag=1 multi=9.0000 len=118435
CCATAAATCTGATTTTAGTCAAAAAAATATGCAGTTTTTCAAAAAGGGTGTATAATTCTTTCGTTAC

'vae_clusters.tsv' format : ‘vae_1 k141_84347
vae_2 k141_92682
vae_2 k141_551576
vae_2 k141_358295’

run mode: python run_RF.py
R63.contigs.fa.gz
vae_clusters.tsv
annotations
resultdir

err report :
Traceback (most recent call last):
File "/public/home/bioinfo_wang/00_software/phamb-v.1.0.1/phamb/run_RF.py", line 223, in
reference = run_RF_modules.Reference.from_clusters(clusters = clusters, fastadict=fastadict, minimum_contig_len=2000)
File "/public/home/bioinfo_wang/00_software/miniconda3/envs/phamb/lib/python3.9/site-packages/phamb/run_RF_modules.py", line 272, in from_clust
genomes = cls._parse_clusters(clusters,fastadict,minimum_contig_len=minimum_contig_len)
File "/public/home/bioinfo_wang/00_software/miniconda3/envs/phamb/lib/python3.9/site-packages/phamb/run_RF_modules.py", line 286, in _parse_clu
contig_len = fastadict[contig].len()
KeyError: 'k141_84347'

Thanks !

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions