如何使数据帧计算有界均值并识别r中的数据个数



我有这个数据集:

Data <- tribble(~Name,  ~Age,   ~Gender,
"Cristian",    74, "Male",
"Ana", 19, "Female",
"Luis",    20, "Male",
"Jaime",   21, "Male",
"Jennifer",    18, "Female")

对年龄变量进行数据帧计算,得到如下结果:

<表类>性别AgeMean总修剪tbody><<tr>女性18.520男性38.330

mean也有trim参数。我们可以用它来得到有界平均值,以及根据mean.default中使用的公式计算trim

的"Total"。
library(dplyr)
Data %>% 
group_by(Gender) %>%
mutate(lo = floor(n() * 0.4) + 1) %>%
summarise(AgeMean = mean(Age, trim = 0.4),
Total = length(first(lo):(n() + 1 - first(lo))),
Trimmed = 0.4, .groups = 'drop')

最新更新