将 ML 9 与 Hadoop2-2.2.3 连接器集成时不是可用的网络地址错误



遵循此 ML 文档,我正在使用文档中存在的配置运行示例 marklogic-hello-world.xml。我的本地主机名是 ubuntu.localdomain 。当我在我的配置文件中给出相同的内容时,它会抛出这样的错误

    18/01/04 22:39:54 INFO mapred.MapTask: (EQUATOR) 0 kvi 26214396(104857584)
18/01/04 22:39:54 INFO mapred.MapTask: mapreduce.task.io.sort.mb: 100
18/01/04 22:39:54 INFO mapred.MapTask: soft limit at 83886080
18/01/04 22:39:54 INFO mapred.MapTask: bufstart = 0; bufvoid = 104857600
18/01/04 22:39:54 INFO mapred.MapTask: kvstart = 26214396; length = 6553600
18/01/04 22:39:54 INFO mapred.MapTask: Map output collector class = org.apache.hadoop.mapred.MapTask$MapOutputBuffer
18/01/04 22:40:05 INFO mapred.MapTask: Starting flush of map output
18/01/04 22:40:05 INFO mapred.LocalJobRunner: map task executor complete.
18/01/04 22:40:05 WARN mapred.LocalJobRunner: job_local196795803_0001
java.lang.Exception: java.lang.IllegalArgumentException: Default provider - Not a usable net address: ubuntu.localdomain:8000
    at org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:462)
    at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:522)
Caused by: java.lang.IllegalArgumentException: Default provider - Not a usable net address: ubuntu.localdomain:8000
    at com.marklogic.xcc.ContentSourceFactory.defaultConnectionProvider(ContentSourceFactory.java:453)
    at com.marklogic.xcc.ContentSourceFactory.newContentSource(ContentSourceFactory.java:264)
    at com.marklogic.xcc.ContentSourceFactory.newContentSource(ContentSourceFactory.java:321)
    at com.marklogic.mapreduce.utilities.InternalUtilities.getInputContentSource(InternalUtilities.java:127)
    at com.marklogic.mapreduce.MarkLogicRecordReader.init(MarkLogicRecordReader.java:348)
    at com.marklogic.mapreduce.MarkLogicRecordReader.initialize(MarkLogicRecordReader.java:247)
    at org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.initialize(MapTask.java:548)
    at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:786)
    at org.apache.hadoop.mapred.MapTask.run(MapTask.java:341)
    at org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:243)
    at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:514)
    at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
    at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1167)
    at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:641)
    at java.base/java.lang.Thread.run(Thread.java:844)
18/01/04 22:40:05 INFO mapreduce.Job: Job job_local196795803_0001 failed with state FAILED due to: NA
18/01/04 22:40:05 INFO mapreduce.Job: Counters: 0

我的配置文件是这样的

<configuration>
    <property>
        <name>mapreduce.marklogic.input.username</name>
        <value>admin</value>
    </property>
    <property>
        <name>mapreduce.marklogic.input.password</name>
        <value>admin</value>
    </property>
    <property>
        <name>mapreduce.marklogic.input.host</name>
        <value>ubuntu.localdomain</value>
    </property>
    <property>
        <name>mapreduce.marklogic.input.port</name>
        <value>8000</value>
    </property>
    <property>
        <name>mapreduce.marklogic.input.mode</name>
        <value>basic</value>
    </property>
    <property>
        <name>mapreduce.marklogic.input.valueclass</name>
        <value>com.marklogic.mapreduce.DatabaseDocument</value>
    </property>
    <property>
        <name>mapreduce.marklogic.output.username</name>
        <value>admin</value>
    </property>
    <property>
        <name>mapreduce.marklogic.output.password</name>
        <value>admin</value>
    </property>
    <property>
        <name>mapreduce.marklogic.output.host</name>
        <value>ubuntu.localdomain</value>
    </property>
    <property>
        <name>mapreduce.marklogic.output.port</name>
        <value>8000</value>
    </property>
    <property>
        <name>mapreduce.marklogic.output.content.type</name>
        <value>TEXT</value>
    </property>
</configuration>

我尝试过为这个mapreduce.marklogic.input.host起各种名称,我尝试过127.0.0.1 localhost但默认情况下它需要ubuntu.localdomain

我不知道为什么它采用默认文件而不是采用我在配置.xml文件中指定的文件(即127.0.0.1等(。

我使用以下命令来运行这个

hadoop jar 
  $CONNECTOR_HOME/lib/marklogic-mapreduce-examples-version.jar 
  com.marklogic.mapreduce.examples.HelloWorld -libjars $LIBJARS 
  -conf marklogic-hello-world.xml

如文件中所述。

我该如何克服这个问题?任何帮助不胜感激..

谢谢

通过将 Marklogic 配置页面中的本地主机名从 ubuntu.localdomain 更改为 localhost 解决了该问题,然后上述配置运行良好。但是仍然无法找到为什么它不从配置文件中挑选hostname而不是转到ML。

相关内容

最新更新