Skip to content

N0.tsv content changes over multiple runs for example_data

When using the example_data/, several runs with the same protein input files, will show differences in the N0.tsv file orthogroup content. For instance with Psat5g008960.1:

grep "Psat5g008960.1" run1/N0.tsv run2/N0.tsv

run1/N0.tsv:N0.HOG0000033	OG0000013	n4	Lcu.2RBY.4g047180.1, Lcu.2RBY.3g073270.1	MtrunA17Chr2g0283901.1, MtrunA17Chr2g0283741.1, MtrunA17Chr2g0319991.1, MtrunA17Chr2g0283661.1, MtrunA17Chr3g0144281.1	Psat7g070080.1, Psat5g250600.1, Psat7g166640.1, Psat5g008960.1	Vradi11g00000202.1
run2/N0.tsv:N0.HOG0000032	OG0000013	n1	Lcu.2RBY.3g073270.1	MtrunA17Chr3g0144281.1	Psat5g008960.1	Vradi11g00000202.1

All the Species[0-9].fa in the WorkingDirectory are the same in both runs, as well as the old Orthogroups.tsv file, suggesting that the construction of the N0.tsv file is non-deterministic.

This could be exacerbated by the low number of proteins in the example fasta files.

Edited by Baptiste Imbert