Python 多处理星图与apply_async，哪个更快?

假设我有以下两种方法来完成相同的任务：

from multiprocessing import Pool
pool = Pool(4)
def func(*args):
# do some slow operations
return something
dates = ['2011-01-01', ' 2011-01-02', ... , '2017-01-01']
other_args = [1, 2, 3, 'c', 'test', 'pdf')]
# approach 1:
res = [pool.apply_async(func, [day] + other_args) for day in dates]
list_of_results = [x.get() for x in res]
# approach 2: create an iterable of iterables
args = [[day] + other_args for day in dates]
list_of_results = pool.starmap(func, args)

我意识到apply_async立即返回，但是，如果 func 尚未完成运行，x.get() 可能仍会阻塞主线程......这两种方法之间是否必然存在性能差异？

> 在引擎盖下，starmap几乎做了你在第一种方法中所做的。它只是一个方便的包装纸。提供map系列函数是为了符合许多开发人员习惯的函数式编程范式。

它们提供了一些不错的功能，例如将可迭代对象拆分为块以最小化 IPC。性能优势可能来自此优化，但这取决于每个元素的计算成本。

我建议坚持使用更具可读性的内容，并且只有在性能真正令人担忧的情况下，才对结果进行基准测试和评估。

相关内容

最新更新

热门标签：