Skip to content

Multithreading #54

@RagnarGrootKoerkamp

Description

@RagnarGrootKoerkamp

I'm just trying a quick benchmark to get an idea of the speed of ish vs sassy.

Running against a 3Gbp human genome, I can't get the average CPU usage to go above 300%, even though I have 6 physical cores. (Larger or smaller batch sizes also don't work.) Interestingly the number of active threads seems to go up and down between 1 and 4.

Maybe this is something that can be improved?


Anyway, I'm getting around 60s wall-time or 200s CPU time to search a 23bp long pattern across a human genome. Does that sound reasonable?

Sassy does the same (with up to 4 errors) in 2s wall time or 8s CPU time, so I guess the bitpacking and edit-distance-only really make quite a big difference.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions