Cloudant index:统计每个时间段内唯一用户的数量

关于这个问题，这里有一个非常相似的帖子。在cloudant中，我有一个文档结构，用于存储用户访问应用程序时的数据，如下所示:

{"username":"one","timestamp":"2015-10-07T15:04:46Z"}—|同日{"username":"one","timestamp":"2015-10-07T19:22:00Z"}——^
{"username":"one","timestamp":"2015-10-25T04:22:00Z"}
{"username":"two","timestamp":"2015-10-07T19:22:00Z"}

我想知道的是计算给定时间段内唯一用户的数量。例:

2015-10-07 = {"count": 2} 两个不同的用户访问2015-10-07
2015-10-25 = {"count": 1} 一个不同的用户访问2015-10-25
2015 = {"count" 2} 2015年访问的两个不同用户

这一切都变得棘手，因为例如在2015-10-07，username: one有两条访问时间的记录，但它应该只返回1的唯一用户总数。

我试过:

function(doc) {
    var time = new Date(Date.parse(doc['timestamp'])); 
    emit([time.getUTCFullYear(),time.getUTCMonth(),time.getUTCDay(),doc.username], 1);
}

这有几个问题，Jesus Alva在我上面链接的文章中强调了这一点。

谢谢!

可能有更好的方法，但是我想…

您可以尝试为每个粒度级别发出一个索引:

function(doc) {
    var time = new Date(Date.parse(doc['timestamp'])); 
    var year = time.getUTCFullYear();
    var month = time.getUTCMonth()+1;
    var day = time.getUTCDate();
    // day granularity
    emit([year,month,day,doc.username], null);
    // year granularity
    emit([year,doc.username], null);
}
// reduce function - `_count`

日查询(2015-10-07):

inclusive_end=true&
start_key=[2015, 10, 7, "u0000"]&
end_key=[2015, 10, 7, "uefff"]&
reduce=true&
group=true

Day查询结果-应用程序代码将计算行数:

{"rows":[
  {"key":[2015,10,7,"one"],"value":2},
  {"key":[2015,10,7,"two"],"value":1}
]}

查询:

inclusive_end=true&
start_key=[2015, "u0000"]&
end_key=[2015, "uefff"]&
reduce=true&
group=true

查询结果-您的应用程序代码将计算行数:

{"rows":[
  {"key":[2015,"one"],"value":3},
  {"key":[2015,"two"],"value":1}
]}

相关内容

最新更新

热门标签：