MapReduce任务失败Python



我似乎得到以下错误:

14/07/02 23:29:14 INFO mapreduce.Job: Task Id : attempt_1395688818137_1239_r_000001_2, Status : FAILED
Error: java.lang.RuntimeException: PipeMapRed.waitOutputThreads(): subprocess failed with code 1
        at org.apache.hadoop.streaming.PipeMapRed.waitOutputThreads(PipeMapRed.java:330)
        at org.apache.hadoop.streaming.PipeMapRed.mapRedFinished(PipeMapRed.java:543)
        at org.apache.hadoop.streaming.PipeReducer.close(PipeReducer.java:134)
        at org.apache.hadoop.io.IOUtils.cleanup(IOUtils.java:237)
        at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:487)
        at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411)
        at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:162)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:415)
        at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
        at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:157)

我已经包括#!/usr/bin/python按照其他论坛。但我似乎仍然得到相同的结果。谁能帮我弄清楚其他问题是什么吗?

你试过包含#!/usr/bin/env python代替#!/usr/bin/python吗?这将允许操作系统将它们作为独立的可执行文件运行

最新更新