pySpark作业在纱线上失败



我正在尝试从yarnclient提交pyspark作业。在没有任何进一步日志的情况下从RM获得以下错误。

org.apache.hadop.ipc.RemoteException(org.apache.haop.ipc.StandbyException(:在待机状态下不支持操作类别READ ENOENT:否该文件或目录位于org.apache.hoop.io.nativeio.nativeio$POSIX.chmodImpl(Native Method(在org.apache.hoop.io.nativeio.nativeio$POSIX.chmod(nativeio.java:231(在org.apache.hadop.fs.RawLocalFileSystem.setPermission(RawLocalFileSystem.java:773(在org.apache.hoop.fs.DelegateToFileSystem.setPermission(DelegateToFileSystem.java:218(网址:org.apache.hoop.fs.FilterFs.setPermission(FilterFs.java:266(org.apache.hoop.fs.FileContext$11.next(FileContext.java:1008(org.apache.hoop.fs.FileContext$11.next(FileContext.java:1004(org.apache.hadop.fs.FSLinkResolver.resolve(FSLinkResolver.java:90(org.apache.hoop.fs.FileContext.setPermission(FileContext.java:1011(网址:org.apache.hadop.yarn.util.FSDownload$3.run(FSDownload.java:483(网址:org.apache.hadop.yarn.util.FSDownload$3.run(FSDownload.java:481(位于java.security.AccessController.doPrivileged(本机方法(javax.security.auth.Subject.doAs(Subject.java:422(org.apache.hoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1875(在org.apache.hadop.yarn.util.FSDownload.changePermissions(FSDownload.java:481(网址:org.apache.hadop.yarn.util.FSDownload.call(FSDownload.java:419(org.apache.hoop.syar.server.nodemanager.containermanager.localizer.ContainerLocalizer$FSDownloadWrapper.doDownloadCall(ContainerLocalizer.java:242(在org.apache.hadop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer$FSDownloadWrapper.call(ContainerLocalizer.java:235(在org.apache.hadop.yarn.server.nodemanager.containermanager.localizer.ContainerLocalizer$FSDownloadWrapper.call(ContainerLocalizer.java:223(位于java.util.concurrent.FFutureTask.run(FutureTask.java:266(java.util.concurrent.Executors$RunnableAdapter.call(Executitors.java:511(位于java.util.concurrent.FFutureTask.run(FutureTask.java:266(java.util.concurrent.ThreadPoolExecutiator.runWorker(ThreadPoolExecutiator.java:1149(在java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624(在java.lang.Thread.run(Thread.java:748(要获得更详细的输出,检查应用程序跟踪页面:https://.com:8090/cluster/app/application_1638972290118_64750然后单击每次尝试的日志链接。失败应用

集群运行良好,其他pyspark作业运行良好。请帮助

提前感谢

你说的">集群运行良好,其他pyspark作业运行良好";?你是在Yarn上运行它们,还是只是在独立模式下运行?

然而,我认为最好先检查一下你的纱线簇,看看它是否有效(没有火花(
您可以使用hadoop MapR来完成此操作示例:

yarn jar $HadoopDir/share/hadoop/mapreduce/hadoop-mapreduce-examples-$version.jar wordcount inputFilePath OutputDir

同时检查连杆1和连杆2。它们可能会有所帮助。

相关内容

  • 没有找到相关文章

最新更新