XMLTEXTREADER忽略了dtdProcessing标志



我正在尝试使用XMLTEXTREADER加载XML内容,但是由于某种原因,XMLTEXTREADER在处理XML时忽略了dtdprocessing Flag。如果我使用XMLReader,则DTDProcessing标志正常工作。XMLReader的问题是,它会自动归一化 r n to n,我在输出字符串中不需要。

这是我的代码段:

XmlDocument xmlDocument = new XmlDocument();
string contents = @"<?xml version='1.0' encoding='ISO-8859-1' standalone='yes'?>
    <!DOCTYPE content [<!ENTITY ouml '&#246;'>]>
    <content>Test &ouml; Test
    Test</content>";
byte[] byteArray = Encoding.UTF8.GetBytes(contents);
MemoryStream stream = new MemoryStream(byteArray);
//XmlReaderSettings settings = new XmlReaderSettings();
//settings.DtdProcessing = DtdProcessing.Parse;
//settings.IgnoreWhitespace = false;
//XmlReader reader = XmlReader.Create(stream, settings);
//xmlDocument.Load(reader);
XmlTextReader reader = new XmlTextReader(stream);
reader.DtdProcessing = DtdProcessing.Parse;
xmlDocument.Load(reader);
Console.WriteLine(xmlDocument.OuterXml);

输出我从上面的处理中获得:

"<?xml version="1.0" encoding="ISO-8859-1" standalone="yes"?><!DOCTYPE content[<!ENTITY ouml '&#246;'>]><content>Test &ouml; Testrnrn    Test</content>"

相反,我想要使用DTD处理的输出字符串:

"<?xml version="1.0" encoding="ISO-8859-1" standalone="yes"?><!DOCTYPE content[<!ENTITY ouml '&#246;'>]><content>Test ö Testrnrn    Test</content>"

代码看起来像这样

using System;
using System.Collections.Generic;
using System.Linq;
using System.Text;
using System.Data;
using System.Xml;
using System.Xml.Linq;
using System.IO;

namespace ConsoleApplication31
{
    class Program
    {
        const string FILENAME = @"c:temptest.xml";
        static void Main(string[] args)
        {
            XmlDocument xmlDocument = new XmlDocument();
            try
            {
                string contents = @"<?xml version='1.0' encoding='ISO-8859-1' standalone='yes'?>
                <!DOCTYPE content [<!ENTITY ouml '&#246;'>]>
                <content>Test &ouml; Test
                Test</content>";
                MemoryStream stream = new MemoryStream();
                XmlTextWriter writer = new XmlTextWriter(stream, Encoding.GetEncoding("ISO-8859-1"));
                writer.WriteString(contents);
                writer.Flush();
                byte[] bytes = new byte[stream.Length];
                stream.Position = 0;
                stream.Read(bytes, 0, (int)stream.Length);
                Console.WriteLine(Encoding.GetEncoding("ISO-8859-1").GetString(bytes));
            }
            catch (Exception e)
            {
                Console.WriteLine(e.Message);
            }

        }
    }
}

最新更新