在Python中停止ThreadPool中的进程

我一直在尝试为一个控制某些硬件的库编写一个交互式包装器(用于ipython)。有些调用对IO来说很繁重，所以并行执行任务是有意义的。使用ThreadPool(几乎)工作得很好:

from multiprocessing.pool import ThreadPool
class hardware():
    def __init__(IPaddress):
        connect_to_hardware(IPaddress)
    def some_long_task_to_hardware(wtime):
        wait(wtime)
        result = 'blah'
        return result
pool = ThreadPool(processes=4)
Threads=[]
h=[hardware(IP1),hardware(IP2),hardware(IP3),hardware(IP4)]
for tt in range(4):
    task=pool.apply_async(h[tt].some_long_task_to_hardware,(1000))
    threads.append(task)
alive = [True]*4
Try:
    while any(alive) :
        for tt in range(4): alive[tt] = not threads[tt].ready()
        do_other_stuff_for_a_bit()
except:
    #some command I cannot find that will stop the threads...
    raise
for tt in range(4): print(threads[tt].get())

如果用户想要停止进程或do_other_stuff_for_a_bit()中有IO错误，则会出现问题。按Ctrl+C将停止主进程，但工作线程将继续运行，直到当前任务完成。
有没有什么方法可以在不重写库或让用户退出python的情况下停止这些线程?我在其他例子中看到的pool.terminate()和pool.join()似乎不起作用。

实际的例程(而不是上面的简化版本)使用日志记录，尽管所有的工作线程在某个时候都关闭了，但我可以看到它们开始运行的进程一直持续到完成(作为硬件，我可以通过观察房间的另一端来看到它们的影响)。

这是在python 2.7中。

更新:

解决方案似乎是切换到使用多处理。进程而不是线程池。我尝试的测试代码是运行foo_pulse:

class foo(object):
    def foo_pulse(self,nPulse,name): #just one method of *many*
        print('starting pulse for '+name)
        result=[]
        for ii in range(nPulse):
            print('on for '+name)
            time.sleep(2)
            print('off for '+name)
            time.sleep(2)
            result.append(ii)
        return result,name

如果你尝试使用ThreadPool运行这个，那么ctrl-C不会停止foo_pulse的运行(即使它会立即杀死线程，打印语句继续出现:

from multiprocessing.pool import ThreadPool
import time
def test(nPulse):
    a=foo()
    pool=ThreadPool(processes=4)
    threads=[]
    for rn in range(4) :
        r=pool.apply_async(a.foo_pulse,(nPulse,'loop '+str(rn)))
        threads.append(r)
    alive=[True]*4
    try:
        while any(alive) : #wait until all threads complete
            for rn in range(4):
                alive[rn] = not threads[rn].ready() 
                time.sleep(1)
    except : #stop threads if user presses ctrl-c
        print('trying to stop threads')
        pool.terminate()
        print('stopped threads') # this line prints but output from foo_pulse carried on.
        raise
    else : 
        for t in threads : print(t.get())

然而，一个使用多处理的版本。进程按预期工作:

import multiprocessing as mp
import time
def test_pro(nPulse):
    pros=[]
    ans=[]
    a=foo()
    for rn in range(4) :
        q=mp.Queue()
        ans.append(q)
        r=mp.Process(target=wrapper,args=(a,"foo_pulse",q),kwargs={'args':(nPulse,'loop '+str(rn))})
        r.start()
        pros.append(r)
    try:
        for p in pros : p.join()
        print('all done')
    except : #stop threads if user stops findRes
        print('trying to stop threads')
        for p in pros : p.terminate()
        print('stopped threads')
    else : 
        print('output here')
        for q in ans :
            print(q.get())
    print('exit time')

我为库foo定义了一个包装器(这样它就不需要重写了)。如果不需要返回值，也不需要这个包装器:

def wrapper(a,target,q,args=(),kwargs={}):
    '''Used when return value is wanted'''
    q.put(getattr(a,target)(*args,**kwargs))

从文档中我没有看到池不能工作的理由(除了bug)。

这是并行性的一个非常有趣的用法。

但是，如果您使用的是multiprocessing，则目标是让多个进程并行运行，而不是一个进程运行多个线程。

考虑使用multiprocessing来实现它的一些更改:

你有这些函数将并行运行:

import time
import multiprocessing as mp

def some_long_task_from_library(wtime):
    time.sleep(wtime)

class MyException(Exception): pass
def do_other_stuff_for_a_bit():
    time.sleep(5)
    raise MyException("Something Happened...")

让我们创建并启动进程，例如4:

procs = []  # this is not a Pool, it is just a way to handle the
            # processes instead of calling them p1, p2, p3, p4...
for _ in range(4):
    p = mp.Process(target=some_long_task_from_library, args=(1000,))
    p.start()
    procs.append(p)
mp.active_children()   # this joins all the started processes, and runs them.

进程并行运行，可能在一个单独的cpu核心，但这是由操作系统决定的。你可以检查你的系统监视器。

在此期间，您运行了一个即将中断的进程，并且您希望停止正在运行的进程，而不是让它们孤立:

try:
    do_other_stuff_for_a_bit()
except MyException as exc:
    print(exc)
    print("Now stopping all processes...")
    for p in procs:
        p.terminate()
print("The rest of the process will continue")

当一个或所有子进程终止时，如果继续主进程没有意义，你应该处理主程序的退出。

希望对您有所帮助，您可以将其中的一些内容改编为您的库。

在回答为什么pool不工作的问题时，这是由于(如文档中引用的)然后main需要被子进程导入，并且由于这个项目的性质交互式python正在使用。

同时还不清楚为什么ThreadPool会这样——尽管线索就在名字里。ThreadPool使用multiprocessing创建它的工作进程池。dummy，如这里所述，它只是Threading模块的包装器。池使用multiprocessing.Process。这可以通过下面的测试看到:

p=ThreadPool(processes=3)
p._pool[0]
<DummyProcess(Thread23, started daemon 12345)> #no terminate() method
p=Pool(processes=3)
p._pool[0]
<Process(PoolWorker-1, started daemon)> #has handy terminate() method if needed

由于线程没有terminate方法，工作线程将继续运行，直到它们完成当前的任务。终止线程是混乱的(这就是为什么我试图使用多处理模块)，但解决方案在这里。

关于使用上述解决方案的一个警告:

def wrapper(a,target,q,args=(),kwargs={}):
    '''Used when return value is wanted'''
    q.put(getattr(a,target)(*args,**kwargs))

是对对象实例内部属性的更改不会传递回主程序。作为一个例子，上面的类foo也可以有这样的方法:def addIP (newIP):self.hardwareIP = newIP调用r=mp.Process(target=a.addIP,args=(127.0.0.1))不会更新a。

对于一个复杂的对象，唯一的解决方法似乎是使用自定义的manager来共享内存，它可以访问对象a的方法和属性。对于一个基于库的非常大的复杂对象，最好使用dir(foo)来填充管理器。如果我能想出办法，我会用一个例子来更新这个答案(为了我未来的自己，也为了其他人)。

如果出于某些原因使用线程更可取，我们可以使用这个

我们可以发送一些信号给我们想要终止的线程。最简单的信号是全局变量:

import time
from multiprocessing.pool import ThreadPool
_FINISH = False
def hang():
    while True:
        if _FINISH:
            break
        print 'hanging..'
        time.sleep(10)

def main():
    global _FINISH
    pool = ThreadPool(processes=1)
    pool.apply_async(hang)
    time.sleep(10)
    _FINISH = True
    pool.terminate()
    pool.join()
    print 'main process exiting..'

if __name__ == '__main__':
    main()

相关内容

最新更新

热门标签：