查询以获取失败请求百分比 Azure 应用程序见解



我正在尝试为以下条件构建警报:在 15 分钟内,如果失败请求的数量大于收到的请求的 99%,我想发出警报。我已经编写了一个 KQL 查询,但不幸的是,即使没有发生实际问题,它也会触发,即没有真正获得大于 99% 的条件。以下是查询,我确定我在其中犯了一些愚蠢的错误,有什么帮助吗?

修复上述查询的任何帮助,因此它只有在批评时才真正给出结果,即当收到的所有请求都失败时。

requests 
| where cloud_RoleName == 'ABCDEF_cloudRName' and resultCode != '404' 
| summarize FailedPercent=((countif(success == false))/count() by timestamp, cloud_RoleName, appName)*100 
| where FailedPercent > 99 
| project RelatedCI='XYZZZ',AlarmTime=timestamp,Category="Cloud-Azure-Monitor",SubCategory="Application",Object=appName ,"Value of Metric","Percentage Failed Requests"," is ", FailedPercent

下面是在失败百分比大于 xx% 时发送警报的类似问题。

我只是写一个查询,如果它不符合您的需求,请随时修改它:

requests
| where resultCode != "404" and success == "False" 
| summarize exceptionsCount =count()
| extend a = "a"
| join
(
requests
| where resultCode != "404" 
| summarize requestsCount =count()
| extend a = "a"
)
on a
| project isFail = 1.0 * exceptionsCount / requestsCount > 0.99 //check if the failed percentage is greater than 99%.
| project rr=iff(isFail, "Fail","Pass" ) 
| where rr=="Fail"

查询代码准备就绪后,可以按照上述问题中的步骤创建基于查询的警报。

最新更新