使用分隔符删除 Applescript 中的 HTML 标记



我如何使用分隔符从以下 HTML 中删除所有这些<br />标签,知道这只是大型 HTML 文件的一部分。我还想使用分隔符删除 html 文件中的其他标签。

<Education :<br />
<br />
- School-leaving exam type B, Zug<br />
- Basic course HSG<br />
- Business economist FH, HWV St. Gallen (1990)<br />
<br />
Professional development :<br />
<br />
- 1984-97 GESTIO Treuhand- und Verwaltungs AG, Zug (part time)<br />
- 1985-86 Financial administration Canton of St. Gallen<br />
- 1991-94 Gestinor Services AG, Zug<br />
- 1994-97 Revisuisse Price Waterhouse AG, Zug, Taxes and Law<br />
- 1997 Founding of Bohnet & Schlatter Treuhand AG<br />
<br />
Experience :<br />
<br />
- Tax consultation of legal and natural persons<br />
- Preparation of tax statements for legal and natural persons<br />
- Preparation of structural plans and execution of organizational processes<br />
- Management and support on responsibilities in finances and accounting.

提前谢谢。

替换文本是最基本的,例如,大多数人的库中都有一个处理程序

on run -- example
    set theText to "<Education :<br />
<br />
- School-leaving exam type B, Zug<br />..."
    replaceTextInString_fromOldItem_toNewItem_(theText, "<br />", "")
end run

to replaceTextInString_fromOldItem_toNewItem_(someText, oldItem, newItem)
    (*
    replace all occurances of oldItem with newItem
        parameters -    someText [text]: the text containing the item(s) to change
                        oldItem [text]: the item to be replaced
                        newItem [text]: the item to replace with
        returns [text]: the text with the item(s) replaced
    *)
    set {tempTID, AppleScript's text item delimiters} to {AppleScript's text item delimiters, oldItem}
    try
        set {pieces, AppleScript's text item delimiters} to {text items of someText, newItem}
        set {someText, AppleScript's text item delimiters} to {pieces as text, tempTID}
    on error errorMessage number errorNumber -- oops
        set AppleScript's text item delimiters to tempTID -- make sure TID's are reset
        error errorMessage number errorNumber -- pass on error
    end try
    return someText
end replaceTextInString_fromOldItem_toNewItem_

请注意,在 Snow Leopard 和 Lion 中,要替换的项目的文本分隔符(在我的示例中为 oldItem 参数)可以是项目列表。

最新更新