使用NullWritable作为OutputKeyClass，但我没有得到预期的结果

我只想输出值，所以我使用NullWritable作为OutputKeyClass，如下所示：

protected void reduce(Text key, Iterable<Text> values,
            Reducer<Text, Text, NullWritable, Text>.Context context)
                    throws IOException, InterruptedException {
        for(Text value : values){
            context.write(NullWritable.get(), value);
        }
    }

我设置了这样的工作：

job.setNumReduceTasks(1);
    job.setOutputKeyClass(NullWritable.class);
    job.setOutputValueClass(Text.class);
    job.setMapOutputKeyClass(Text.class);
    job.setMapOutputValueClass(Text.class);
    FileInputFormat.addInputPath(job, new Path(baseInPath));
    FileSystem.get(conf).delete(new Path(baseOutPath), true);
    FileOutputFormat.setOutputPath(job, new Path(baseOutPath));
    System.exit(job.waitForCompletion(true) ? 0 : 1);

但当我检查结果路径时，我得到了这个。�LZO

`@��五、��/�!�Z0|res|1*"|33260580217607|2|1|0.2|23|2016-03-28 13:57:420|支付6-03-28 13:57:42

以0|res|1……开头的字符串是值，但前面有一些乱码。我认为它们是NullWritable的指针。我该如何去除这些乱码？我的代码正确吗？

查看输出，似乎设置了LZO压缩。您可以尝试查看您的mapred-site.xml，看看此属性是否设置为

<property>
  <name>mapred.map.output.compression.codec</name>
  <value>com.hadoop.compression.lzo.LzoCodec</value>
</property>

有关更多详细信息：链接

相关内容

最新更新

热门标签：