在hadoop上解析xml文件



如何在Hadoop上解析具有如下结构的xml文件:

<row Id="2292" PostTypeId="2" ParentId="2284" CreationDate="2008-08-05T13:28:06.700" Score="0" ViewCount="0" Body="&lt;p&gt;The first thing you should do is contact the main people who run the open source project. Ask them if it is ok to contribute to the code and go from there.&lt;/p&gt;&#xD;&#xA;&#xD;&#xA;&lt;p&gt;Simply writing your improved code and then giving it to them may result in your code being rejected.&lt;/p&gt;" OwnerUserId="383" LastActivityDate="2008-08-05T13:28:06.700" />

注意:我已经为它编写了代码,但它无法正常工作。需要一种新的方法...

提前感谢...

看看XMLInputFormat,它可能需要稍微修改一下。

相关内容

  • 没有找到相关文章

最新更新