This is a topic that has come up a few times and deserves its own issue. Related: - #1014 mentions an algorithm requiring bit-level sampling for performance; some discussion of how to achieve this - #1031 is a WIP implementation plus some discussion