正则表达式 Word 宏,可在彼此的范围内查找两个单词,然后将这些单词斜体化



所以,我刚刚开始理解正则表达式,我发现学习曲线相当陡峭。然而,stackoverflow在我的实验过程中非常有帮助。我想写一个特定的单词宏,但我还没有找到一种方法。我希望能够在文档中找到 10 个左右单词中的两个单词,然后将这些单词斜体化,如果单词相距超过 10 个单词或顺序不同,我希望宏不要将这些单词斜体化。

我一直在使用以下正则表达式:

bPanamaW+(?:w+W+){0,10}?Canalb

但是,它只允许作整个字符串,包括中间的随机单词。还有.替换功能仅允许我用不同的字符串替换该字符串,而不更改格式样式。

有没有更有经验的人知道如何做到这一点?甚至可能做到吗?


编辑:这是我到目前为止所拥有的。我有两个问题。首先,我不知道如何从匹配的正则表达式中选择单词"巴拿马"和"运河",并仅替换这些单词(而不是中间单词)。其次,我只是不知道如何替换与不同格式匹配的正则表达式,只有不同的文本字符串 - 可能只是由于对单词宏不熟悉。

Sub RegText()
Dim re As regExp
Dim para As Paragraph
Dim rng As Range
Set re = New regExp
re.Pattern = "bPanamaW+(?:w+W+){0,10}?Canalb"
re.IgnoreCase = True
re.Global = True
For Each para In ActiveDocument.Paragraphs
  Set rng = para.Range
  rng.MoveEnd unit:=wdCharacter, Count:=-1
  Text$ = rng.Text + "Modified"
  rng.Text = re.Replace(rng.Text, Text$)
Next para
End Sub

好的,感谢下面蒂姆·威廉姆斯的帮助,我得到了以下解决方案,它在某些方面有点笨拙,它绝不是纯粹的正则表达式,但它确实完成了工作。如果有人对如何解决这个问题有更好的解决方案或想法,我会很着迷地听到它。同样,我使用搜索和替换功能强制进行更改有点令人尴尬的粗糙,但至少它可以工作......

Sub RegText()
Dim re As regExp
Dim para As Paragraph
Dim rng As Range
Dim txt As String
Dim allmatches As MatchCollection, m As match
Set re = New regExp
re.pattern = "bPanamaW+(?:w+W+){0,13}?Canalb"
re.IgnoreCase = True
re.Global = True
For Each para In ActiveDocument.Paragraphs
  txt = para.Range.Text
  'any match?
  If re.Test(txt) Then
    'get all matches
    Set allmatches = re.Execute(txt)
    'look at each match and hilight corresponding range
    For Each m In allmatches
        Debug.Print m.Value, m.FirstIndex, m.Length
        Set rng = para.Range
        rng.Collapse wdCollapseStart
        rng.MoveStart wdCharacter, m.FirstIndex
        rng.MoveEnd wdCharacter, m.Length
        rng.Font.ColorIndex = wdOrange
    Next m
  End If
Next para
Selection.Find.ClearFormatting
Selection.Find.Font.ColorIndex = wdOrange
Selection.Find.Replacement.ClearFormatting
Selection.Find.Replacement.Font.Italic = True
With Selection.Find
    .Text = "Panama"
    .Replacement.Text = "Panama"
    .Forward = True
    .Wrap = wdFindContinue
    .Format = True
    .MatchCase = False
    .MatchWholeWord = False
    .MatchWildcards = False
    .MatchSoundsLike = False
    .MatchAllWordForms = False
End With
Selection.Find.Execute Replace:=wdReplaceAll
Selection.Find.ClearFormatting
Selection.Find.Font.ColorIndex = wdOrange
Selection.Find.Replacement.ClearFormatting
Selection.Find.Replacement.Font.Italic = True
With Selection.Find
    .Text = "Canal"
    .Replacement.Text = "Canal"
    .Forward = True
    .Wrap = wdFindContinue
    .Format = True
    .MatchCase = False
    .MatchWholeWord = False
    .MatchWildcards = False
    .MatchSoundsLike = False
    .MatchAllWordForms = False
End With
Selection.Find.Execute Replace:=wdReplaceAll
Selection.Find.ClearFormatting
Selection.Find.Font.ColorIndex = wdOrange
Selection.Find.Replacement.ClearFormatting
Selection.Find.Replacement.Font.ColorIndex = wdBlack
With Selection.Find
    .Text = ""
    .Replacement.Text = ""
    .Forward = True
    .Wrap = wdFindContinue
    .Format = True
    .MatchCase = False
    .MatchWholeWord = False
    .MatchWildcards = False
    .MatchSoundsLike = False
    .MatchAllWordForms = False
End With
Selection.Find.Execute Replace:=wdReplaceAll
End Sub

我离成为一名体面的Word程序员还有很长的路要走,但这可能会让你开始。

编辑:更新以包括参数化版本。

Sub Tester()
    HighlightIfClose ActiveDocument, "panama", "canal", wdBrightGreen
    HighlightIfClose ActiveDocument, "red", "socks", wdRed
End Sub

Sub HighlightIfClose(doc As Document, word1 As String, _
                     word2 As String, clrIndex As WdColorIndex)
    Dim re As RegExp
    Dim para As Paragraph
    Dim rng As Range
    Dim txt As String
    Dim allmatches As MatchCollection, m As match
    Set re = New RegExp
    re.Pattern = "b" & word1 & "W+(?:w+W+){0,10}?" _
                 & word2 & "b"
    re.IgnoreCase = True
    re.Global = True
    For Each para In ActiveDocument.Paragraphs
      txt = para.Range.Text
      'any match?
      If re.Test(txt) Then
        'get all matches
        Set allmatches = re.Execute(txt)
        'look at each match and hilight corresponding range
        For Each m In allmatches
            Debug.Print m.Value, m.FirstIndex, m.Length
            Set rng = para.Range
            rng.Collapse wdCollapseStart
            rng.MoveStart wdCharacter, m.FirstIndex
            rng.MoveEnd wdCharacter, Len(word1)
            rng.HighlightColorIndex = clrIndex
            Set rng = para.Range
            rng.Collapse wdCollapseStart
            rng.MoveStart wdCharacter, m.FirstIndex + (m.Length - Len(word2))
            rng.MoveEnd wdCharacter, Len(word2)
            rng.HighlightColorIndex = clrIndex
        Next m
      End If
    Next para
End Sub

如果你一次只做两个单词,这对我有用,按照你的练习线。

foo([a-zA-Z0-9]+? ){0,10}bar

解释:将抓取单词 1 ( foo ),然后匹配任何字母数字字符 ( [a-zA-Z0-9]+?) 后跟空格 ()、10 次 (bar ) 的单词,然后是单词 2 ( . )。

不包括句号(不知道您是否想要它们),但如果您想在正则表达式中foo this that bar后添加0-9句号。

因此,您的(伪代码)语法将类似于

$matches = preg_match_all(); // Your function to get regex matches in an array
foreach (those matches) {
    replace(KEY_WORD, <i>KEY_WORD</i>);
}

希望它有所帮助。下面的测试突出显示了它的匹配项。


工作:

foo economic order war bar废话

CC_9

没用

福经济秩序。 战争酒吧

全球 foo 秩序已经存在了几个世纪,在这段时间里,人们发展了不同而复杂的贸易关系,处理农业和酒吧等情况

最新更新