Improve performance on divisor finding function

Question

I have made a function that finds all divisors that runs decently fast but for what I am trying to do I want it to run faster. I have also already created a primeFactors function. What I am doing is finding all possible powers that can be made and multiplying them.

from functools import reduce
from itertools import product

def divisors(n):
    x = sorted(primeFactors(n))
    r = []
    for values in set(x):
        t = list(range(0, x.count(values) + 1))
        r.append(t)
    comb = list(product(*r))
    divisors = []
    for sets in comb:
        n = [a**b for a, b in zip(set(x), sets)]
        divisors.append(reduce(lambda x, y: x*y, n))
    return sorted(divisors)

Is there another approach that is faster or any improvements that can be made to this code to make it faster? For the sake of the problem lets assume my primeFactors is the fastest in the world.

kaya3 · Accepted Answer

There are quite a lot of ways to make small efficiency gains in your function:

There is no need to sort the list of prime factors. This doesn't have any effect on your algorithm.
You are converting the list of factors to a set many times, including within a loop; just do it once.
You are using .count to count each factor separately; the .count method loops over the list each time it's called. You can count occurrences of all factors in one pass using a Counter.
List comprehensions are usually a tiny bit faster than calling append in a loop.
The range objects and the product generator don't need to be converted to lists.
You are computing a**b many times for each b in the range for the factor a; you could take the product over a list of a**b instead of just the range for b.
Using reduce with operator.mul is faster than with a lambda.
The .sort method should be a little bit faster than sorted.

Some of these improvements involve removing or changing enough of the code that the other improvements are moot. Here's the resulting code:

from functools import reduce
from itertools import product
from collections import Counter
from operator import mul

def divisors2(n):
    factors = primeFactors(n)
    r = [
        [ a ** b for b in range(t + 1) ]
        for a, t in Counter(factors).items()
    ]
    d = [ reduce(mul, sets) for sets in product(*r) ]
    d.sort()
    return d

Overall this gives a modest to large improvement in running time, depending on the number of divisors:

For 1234567891 which is a prime number, the time saved is about 1%.
For 1234567890 == 2 * 3 * 3 * 5 * 3607 * 3803 the time saved is about 10%.
For 3333960000 == 2**6 * 3**5 * 5**4 * 7**3 the time saved is about 75%.

To do a bit better, we can save multiplying the same numbers over and over in the reduce stage by redesigning the implementation to use a recursive function instead of itertools.product: this allows us to do fewer multiplications overall.

def divisors3(n):
    factors = primeFactors(n)
    factor_counts = list(Counter(factors).items())

    def helper(x, i):
        if i < 0:
            d.append(x)
        else:
            a, t = factor_counts[i]
            for b in range(t + 1):
                helper(x, i - 1)
                x *= a

    d = []
    helper(1, len(factor_counts) - 1)
    d.sort()
    return d

This improves the time saved from 75% to 80% for the number with many prime factors. The difference for the other test cases is too small to matter.

I compared all three functions using the following fairly basic primeFactors implementation (from this post):

def primeFactors(n):
    i = 2
    factors = []
    while i * i <= n:
        if n % i:
            i += 1
        else:
            n //= i
            factors.append(i)
    if n > 1:
        factors.append(n)
    return factors

Improve performance on divisor finding function

Answers (2)

Related Questions