如何在C#中解析/反序列化文本序列化HTML..Net中有可用的实用程序或类吗



我正在从REST API获取序列化的HTML响应(响应头中的内容类型为"text/HTML"(。我应该如何解析/反序列化并将其存储为有效的HTML页面。

这是示例:

"html lang="en-US">rnt<head>rntt<title>page1</title>rntt<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />rntt<meta name="created" content="2020-04-23T01:26:00.0000000" />rnt</head>rnt<body data-absolute-enabled="true" style="font-family:Calibri;font-size:11pt">rntt<div style="position:absolute;left:48px;top:115px;width:614px">rnttt<p style="margin-top:0pt;margin-bottom:0pt">This is test page</p>rntt</div>rntt<div style="position:absolute;left:327px;top:102px;width:624px">rnttt<table style="border:1px solid;border-collapse:collapse">rntttt<tr>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rntttt</tr>rntttt<tr>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rntttt</tr>rntttt<tr>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rntttt</tr>rntttt<tr>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rntttt</tr>rntttt<tr>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rntttt</tr>rntttt<tr>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rnttttt<td style="border:1px solid"><br />rnttttt</td>rntttt</tr>rnttt</table>rnttt<br />rntt</div>rntt<!-- InkNode is not supported -->rntt<div style="position:absolute;left:399px;top:39px;width:624px">rnttt<img alt="Water Buffalo" width="256" height="256" src="https://graph.microsoft.com/v1.0/users('d2e7f1e9-3c9e-4f7a-8b6b-2302fee8af6a')/onenote/resources/1-8ea29e7dc05f4dedb229ed86f5b10b84!1-38E1572A-084A-4B2A-A530-2370B83A8542/$value" data-src-type="image/png" data-fullres-src="https://graph.microsoft.com/v1.0/users('d2e7f1e9-3c9e-4f7a-8b6b-2302fee8af6a')/onenote/resources/1-8ea29e7dc05f4dedb229ed86f5b10b84!1-38E1572A-084A-4B2A-A530-2370B83A8542/$value" data-fullres-src-type="image/png" />rnttt<br />rntt</div>rntt<div style="position:absolute;left:381px;top:375px;width:624px">rnttt<img alt="Water Buffalo" width="256" height="256" src="https://graph.microsoft.com/v1.0/users('d2e7f1e9-3c9e-4f7a-8b6b-2302fee8af6a')/onenote/resources/1-199df58a10154beebca3ca2ac4c69e4e!1-38E1572A-084A-4B2A-A530-2370B83A8542/$value" data-src-type="image/png" data-fullres-src="https://graph.microsoft.com/v1.0/users('d2e7f1e9-3c9e-4f7a-8b6b-2302fee8af6a')/onenote/resources/1-199df58a10154beebca3ca2ac4c69e4e!1-38E1572A-084A-4B2A-A530-2370B83A8542/$value" data-fullres-src-type="image/png" />rnttt<br />rntt</div>rntt<div style="position:absolute;left:57px;top:111px;width:624px">rnttt<img alt="Water Buffalo" width="256" height="256" src="https://graph.microsoft.com/v1.0/users('d2e7f1e9-3c9e-4f7a-8b6b-2302fee8af6a')/onenote/resources/1-8fe32926b9994022bc29527b6590f007!1-38E1572A-084A-4B2A-A530-2370B83A8542/$value" data-src-type="image/png" data-fullres-src="https://graph.microsoft.com/v1.0/users('d2e7f1e9-3c9e-4f7a-8b6b-2302fee8af6a')/onenote/resources/1-8fe32926b9994022bc29527b6590f007!1-38E1572A-084A-4B2A-A530-2370B83A8542/$value" data-fullres-src-type="image/png" />rnttt<br />rntt</div>rnt</body>rn</html>rn"

当您无法立即将Content-Type标头解释为HTML时,将其设置为text/html似乎不合适。我认为application/json更合适,如果它确实是JSON转义的,它似乎是

在这种情况下,您可以简单地使用Regex.Unescape("...")将转义字符(例如n"(替换为它们所代表的实际字符。

相关内容

最新更新